Commits · f422ac6dccf27a2310d510be1fc4c8b8a7a1e78a · 陈曦 / sub2api

27 Apr, 2026 5 commits
- test: cover filter-target account bulk update · f422ac6d
  KnowSky404 authored Apr 27, 2026
  
  f422ac6d
- docs: add account bulk edit implementation plan · 54de4e00
  KnowSky404 authored Apr 27, 2026
  
  54de4e00
- docs: add account bulk edit scope design · 65c27d2c
  KnowSky404 authored Apr 27, 2026
  
  65c27d2c
- Merge pull request #1996 from Cloud370/fix/claude-code-read-empty-pages · c92b88e3
  Wesley Liddick authored Apr 27, 2026
```
fix(anthropic): drop empty Read.pages in responses-to-anthropic tool input
```
  c92b88e3
- Merge pull request #2006 from gaoren002/pr/openai-images-explicit-session · ed0c85a1
  Wesley Liddick authored Apr 27, 2026
```
fix(openai): avoid implicit image sticky sessions
```
  ed0c85a1
26 Apr, 2026 7 commits
- fix(openai): avoid implicit image sticky sessions · 615557ec
  gaoren002 authored Apr 26, 2026
  
  615557ec
- fix(anthropic): drop empty Read.pages in responses-to-anthropic tool input · 30220903
  Cloud370 authored Apr 26, 2026
  
  30220903
- chore: sync VERSION to 0.1.119 [skip ci] · c056db74
  github-actions[bot] authored Apr 26, 2026
  
  c056db74
- Merge pull request #1973 from Nobody-Zhang/main · a0b5e5bf
  Wesley Liddick authored Apr 26, 2026
```
fix(payment): 修复 Zpay 退款接口调用
```
  a0b5e5bf
- Merge pull request #1970 from deqiying/fix-1754-claude-openai-cache-usage · 41d06573
  Wesley Liddick authored Apr 26, 2026
```
fix(anthropic): 修正缓存 token 的 Anthropic 用量语义
```
  41d06573
- Fix Zpay refund endpoint handling · 1a0cabbf
  Nobody-Zhang authored Apr 26, 2026
  
  1a0cabbf
- feat(affiliate): 完善邀请返利系统 · 9b6dcc57
  shaw authored Apr 26, 2026
```
  - 修复返利不到账的根因：tryClaimAffiliateRebateAudit 中 PostgreSQL 参数类型推断冲突
  - 补全 OAuth 注册路径（LinuxDo/OIDC/WeChat/Pending Flow）的邀请码绑定
  - 前端 OAuth 注册页面传递 aff_code 参数
  - 新增返利冻结期机制：可配置冻结时间，到期后自动解冻（懒解冻）
  - 新增返利有效期：绑定后 N 天内有效，过期不再产生返利
  - 新增单人返利上限：超出上限部分精确截断
  - 增强返利流程 slog 结构化日志，便于排查问题
  - 已邀请用户列表增加返利明细列
```
  9b6dcc57
25 Apr, 2026 22 commits

fix(anthropic): 修正缓存 token 的 Anthropic 用量语义 · b17704d6
deqiying authored Apr 26, 2026

b17704d6

fix(gateway): skip body mimicry for real Claude Code clients to restore prompt caching · 496469ac

shaw authored Apr 25, 2026

PR #1914 unconditionally applied the full mimicry pipeline to all OAuth
accounts, including real Claude Code CLI clients. This replaced the
client's long system prompt (~10K+ tokens with stable cache_control
breakpoints) with a short ~45 token [billing, CC prompt] pair, which
falls below Anthropic's 1024-token minimum cacheable prefix threshold.
The result: every request created a new cache but never hit an existing
one.

Fix: restore the Claude Code client detection gate so that real CC
clients bypass body-level mimicry (system rewrite, message cache
management, tool name obfuscation). Non-CC third-party clients
(opencode, etc.) continue to receive full mimicry.

Also harden the detection logic:
- Make UA regex case-insensitive (align with claude_code_validator.go)
- Validate metadata.user_id format via ParseMetadataUserID() instead of
  just checking non-empty, preventing third-party tools from spoofing
  a claude-cli/* UA with an arbitrary user_id string to bypass mimicry

496469ac

fix(payment): allow Stripe payment pages to bypass router auth guard · c1b52615

shaw authored Apr 25, 2026

Stripe payment routes (/payment/stripe, /payment/stripe-popup) are
reached via hard navigation (window.location.href), which caused
the router guard to block access before the page could load.
Set requiresAuth and requiresPayment to false, consistent with
/payment/result. Backend API still enforces authentication.

c1b52615

style: fix gofmt and ineffassign lint errors · 3af9940b

shaw authored Apr 25, 2026

- gofmt: realign AffiliateDetail struct tags in affiliate_service.go
- ineffassign: remove dead seenCompleted assignment before return in account_test_service.go

3af9940b

Merge pull request #1948 from hungryboy1025/fix/openai-account-test-responses-stream · 22b12775
Wesley Liddick authored Apr 25, 2026
```
fix(openai): tighten responses stream account tests
```
22b12775
Merge pull request #1960 from gaoren002/fix/openai-stream-keepalive-downstream-idle · aff98d5a
Wesley Liddick authored Apr 25, 2026
```
fix(openai): keep responses stream alive during pre-output failover
```
aff98d5a

feat(affiliate): add feature toggle and per-user custom invite settings · 4e1bb2b4

shaw authored Apr 25, 2026

- 在系统设置「功能开关」中新增邀请返利总开关，默认关闭；
  关闭态：菜单隐藏、注册忽略 aff、新充值不返利，但已有 quota 仍可转余额
- 支持管理员为指定用户设置专属邀请码（覆盖随机码，全局唯一）
- 支持管理员为指定用户设置专属返利比例（覆盖全局比例，可单条/批量调整）
- 在系统设置邀请返利卡片内嵌入专属用户管理表格（搜索/编辑/批量/删除），
  删除采用项目通用 ConfirmDialog，会同时清除专属比例并把邀请码重置为系统随机码
- /affiliate 用户页新增「我的返利比例」卡片与动态使用说明，让用户直观看到
  分享后能拿到多少（同源 resolveRebateRatePercent 计算，与实际充值一致）
- 新增数据库迁移 132 添加 aff_rebate_rate_percent 与 aff_code_custom 列
- 新增 admin 路由组 /api/v1/admin/affiliates/users/* 共 5 个端点
- AffiliateService 改为只依赖 *SettingService，去除冗余的 SettingRepository
- 邀请码格式校验放宽到 [A-Z0-9_-]{4,32}，兼容旧 12 位系统码与新自定义码
- 补充单元测试与集成测试覆盖新方法、冲突路径与边界值

4e1bb2b4

fix(openai): keep responses stream alive during pre-output failover · dac6e520
gaoren002 authored Apr 25, 2026

dac6e520
fix(openai): tighten responses stream account tests · 8987e0ba
hungryboy1025 authored Apr 25, 2026

8987e0ba
chore: sync VERSION to 0.1.118 [skip ci] · 9d1751ec
github-actions[bot] authored Apr 25, 2026

9d1751ec
Merge pull request #1943 from AyeSt0/fix/openai-responses-preoutput-failover · 5d1c12e6
Wesley Liddick authored Apr 25, 2026
```
fix(openai): 修复 Responses 流式失败前置事件导致无法 failover
```
5d1c12e6
fix(openai): fail over before responses stream output · 5b63a9b0
AyeSt0 authored Apr 25, 2026

5b63a9b0
Merge pull request #1940 from 4fuu/fix/bump-codex-cli-version-to-0.125.0 · 641e6107
Wesley Liddick authored Apr 25, 2026
```
fix(openai): bump codex CLI version from 0.104.0 to 0.125.0
```
641e6107

feat(openai): port /responses/compact account support flow (PR #1555) · 095f457c

shaw authored Apr 25, 2026

将 vansour/sub2api#1555 的 OpenAI compact 能力建模手工移植到当前 main：账号
级 compact 状态/auto-force_on-force_off 模式、compact-only 模型映射、调度器
tier 分层（已支持 > 未知 > 已知不支持）、管理后台 compact 主动探测，以及对应
i18n/状态徽章。普通 /responses 流量行为不变，无数据库迁移。

095f457c

fix(openai): bump codex CLI version from 0.104.0 to 0.125.0 · 1e57e88e

4fuu authored Apr 25, 2026

The hardcoded codex CLI version (0.104.0) causes upstream rejection
when using gpt-5.5 with compact, as the server treats the request
as an outdated client and returns 400/502.

Update codexCLIVersion, codexCLIUserAgent, and openAICodexProbeVersion
to 0.125.0 to match the current Codex CLI release.

Fixes #1933, #1887, #1865
Related: #1609, #1298, #849

1e57e88e

Merge pull request #1772 from KnowSky404/fix/openai-test-state-reconciliation · b95ffce2
Wesley Liddick authored Apr 25, 2026
```
[codex] reconcile OpenAI admin test rate-limit state
```
b95ffce2

fix(payment): 同时启用易支付和 Stripe 时显示 Stripe 按钮 · 8f28a834

shaw authored Apr 25, 2026

VISIBLE_METHOD_ALIASES 漏了 stripe，导致 getVisibleMethods 把后端返回
的 stripe 过滤掉。点 Stripe 按钮时省略 method 查询参数，让落地页渲染
完整的 Payment Element。

8f28a834

chore: remove unused model IDs · 7424c73b
shaw authored Apr 25, 2026

7424c73b
Merge pull request #1920 from Wuxie233/fix/responses-web-search-tool-types · 1afd81b0
Wesley Liddick authored Apr 25, 2026
```
fix(apicompat): recognize web_search_20250305 / google_search in Responses→Anthropic tool conversion
```
1afd81b0

chore(gateway): fix lint issues from cc-mimicry-parity merge · 732d6495

shaw authored Apr 25, 2026

- staticcheck QF1001: apply De Morgan's law to the OAuth-mimic header
  passthrough guard (`!(a && b)` → `a != ... || !b`).
- unused: drop `isClaudeCodeRequest`, which became dead after PR #1914
  switched both `/v1/messages` and `/count_tokens` paths to unconditional
  `account.IsOAuth()` mimicry. The lowercase helper `isClaudeCodeClient`
  is kept (still referenced by `TestIsClaudeCodeClient`).

732d6495

Merge pull request #1914 from keh4l/feat/cc-mimicry-parity · 6d20ab80
Wesley Liddick authored Apr 25, 2026
```
fix(claude): align Claude Code OAuth mimicry with real CLI traffic
```
6d20ab80

refactor(affiliate): tighten DI and harden inviter code validation · aa8ee33b

shaw authored Apr 25, 2026

- Drop SetAffiliateService setters and ProvideAuthService /
  ProvidePaymentService / ProvideUserHandler wrappers in favor of direct
  Wire constructor injection. AffiliateService has no back-edge to
  Auth/Payment/User, so the indirection was never required.
- Change RegisterWithVerification's variadic affiliateCode to a fixed
  parameter; adjust all call sites.
- Validate aff_code length and charset in BindInviterByCode before any
  DB lookup, eliminating timing-side-channel and useless DB roundtrips
  on malformed input.
- Make affiliate cache invalidation synchronous; surface Redis errors
  via the project logger instead of swallowing them in a detached
  goroutine.
- Add an integration test guarding cross-layer tx propagation in
  AccrueQuota and a unit test pinning the aff_code format rules.

aa8ee33b

24 Apr, 2026 6 commits

fix(apicompat): recognize web_search_20250305 / google_search in Responses to... · 5f630fbb
Wuxie233 authored Apr 25, 2026
```
fix(apicompat): recognize web_search_20250305 / google_search in Responses to Anthropic tool conversion
```
5f630fbb

fix(gateway): skip client header passthrough on OAuth mimicry path · bdbd2916

keh4l authored Apr 25, 2026

Root cause of persistent third-party detection: sub2api's
buildUpstreamRequest transparently forwards client headers via
allowedHeaders whitelist (addHeaderRaw) before applying mimicry
overrides. When third-party clients (opencode, etc.) send their own
anthropic-beta / user-agent / x-stainless-* / x-claude-code-session-id
values, these get appended to the request alongside our injected
headers, creating an inconsistent header set that Anthropic detects.

Parrot's build_upstream_headers constructs exactly 9 headers from
scratch and never forwards anything from the client. This is why
'same opencode version, some users work some don't' — different
opencode configs/versions send different header combinations.

Fix: when tokenType=oauth and mimicClaudeCode=true, skip the
client header passthrough loop entirely. The subsequent
applyClaudeCodeMimicHeaders + ApplyFingerprint + beta merge
pipeline constructs all necessary headers from our controlled values.

Also: remove systemIncludesClaudeCodePrompt gate — OAuth accounts
now unconditionally rewrite system (even if client already sent a
Claude Code-style prompt), ensuring billing attribution block is
always present.

bdbd2916

fix(gateway): always apply full mimicry for OAuth accounts regardless of client identity · 6dc89765

keh4l authored Apr 25, 2026

Before: isClaudeCodeRequest() checked whether the client looks like a
real Claude Code CLI (UA, system prompt, X-App header, metadata format).
If it looked like Claude Code, all mimicry was skipped — the assumption
being that a real CLI needs no help.

Problem: third-party tools like opencode partially impersonate Claude
Code (sending claude-cli UA + claude-code beta + CC system prompt) but
miss critical details (billing attribution block, tool-name obfuscation,
cache breakpoints, full beta set). Some users' opencode instances pass
the isClaudeCodeRequest check, causing sub2api to skip mimicry entirely,
while Anthropic still detects the request as third-party.

This explains why 'same opencode version, some users work, some don't'
— it depends on which opencode features/config trigger the validator.

Fix: OAuth accounts now unconditionally run the full mimicry pipeline,
matching Parrot's behavior (Parrot never checks client identity).
This is safe because our mimicry is strictly more complete than any
third-party client's partial impersonation.

Changed:
  - /v1/messages path: remove isClaudeCode gate
  - /v1/messages/count_tokens path: same

6dc89765

fix(gateway): apply D/E/F mimicry to native /v1/messages and count_tokens paths · f3233db0

keh4l authored Apr 24, 2026

The previous commit only wired stripMessageCacheControl,
addMessageCacheBreakpoints, and tool-name obfuscation into
applyClaudeCodeOAuthMimicryToBody (used by /chat/completions and
/responses). The native /v1/messages path and count_tokens path
have their own independent mimicry code blocks and were missed.

Now all three entry points share the same D/E/F pipeline:
  - /v1/messages (gateway_service.go forwardAnthropic)
  - /v1/messages/count_tokens (gateway_service.go countTokens)
  - OpenAI compat (applyClaudeCodeOAuthMimicryToBody)

f3233db0

feat(gateway): port Parrot tool-name obfuscation + message cache breakpoints · 6e12578b

keh4l authored Apr 24, 2026

Implements the remaining three parity items with Parrot cc_mimicry:

  D) Tool-name obfuscation
     - Dynamic mapping when tools.length > 5 (matches Parrot threshold).
       Fake names follow {prefix}{name[:3]}{i:02d} (e.g. 'manage_bas00').
       Go port of random.Random(hash(tuple(names))) uses fnv64a seed +
       math/rand; byte-exact reproduction is impossible (Python hash vs
       Go hash), but the two invariants that matter are preserved:
         * same input tool_names yield identical mapping (cache hit)
         * prefix pool is shuffled (names look distributed)
     - Static prefix map (sessions_ -> cc_sess_, session_ -> cc_ses_)
       applied as fallback, matching Parrot TOOL_NAME_REWRITES verbatim.
     - Server tools (web_search_20250305, computer_*, etc.) are NOT
       renamed; only type=='function' and type=='custom' tools are.
     - tool_choice.name is rewritten in sync (only when type=='tool').
     - Response side: bytes-level replace on every SSE chunk / JSON
       body at 6 injection points (standard stream/non-stream,
       passthrough stream/non-stream, chat_completions stream +
       non-stream, responses stream + non-stream). Reverse mapping
       applied longest-fake-name-first to prevent substring conflicts
       (parity with Parrot _restore_tool_names_in_chunk).
     - tool_choice is no longer unconditionally deleted in
       normalizeClaudeOAuthRequestBody — Parrot passes it through.

  E) tools[-1] cache_control breakpoint
     - Injected as {type:ephemeral, ttl:<DefaultCacheControlTTL>} when
       the last tool has no cache_control. Client-provided ttl is
       passed through unchanged (repo-wide policy).

  F) messages cache_control strategy
     - stripMessageCacheControl removes every client-provided
       messages[*].content[*].cache_control (multi-turn stability).
     - addMessageCacheBreakpoints then injects two stable breakpoints:
       (1) last message, and (2) second-to-last user turn when
       messages.length >= 4.
     - Combined with the system block breakpoint and tools[-1]
       breakpoint, this gives exactly the 4 breakpoints Anthropic
       allows per request.

Non-trivial implementation details to be aware of when rebasing:

  * Two new files, no upstream collision:
      gateway_tool_rewrite.go       (D + E algorithms)
      gateway_messages_cache.go     (F strip + breakpoints)
  * Two new feature calls bolted onto the tail of
    applyClaudeCodeOAuthMimicryToBody in gateway_service.go — rebase
    conflicts will be ~10 lines maximum.
  * Response-side injection points all wrap their existing write with
    reverseToolNamesIfPresent(c, ...), preserving original behavior
    when no mapping is stored (static prefix rollback still runs).
  * Non-stream chat/responses switched from c.JSON to
    json.Marshal + c.Data so bytes-level replace is possible.
  * Retry bodies (FilterThinkingBlocksForRetry,
    FilterSignatureSensitiveBlocksForRetry, RectifyThinkingBudget)
    only prune blocks — they preserve the already-obfuscated tool
    names, so no extra mapping re-application is needed.

Manual QA: end-to-end scenario verified with 6 tools (above threshold)
and tool_choice.type=='tool'. Obfuscation + restore roundtrip shown
in test logs; then removed the temp test file.

Tests (16 new):
  - buildDynamicToolMap stability + below-threshold guard
  - sanitizeToolName precedence (dynamic > static)
  - restoreToolNamesInBytes longest-first + static rollback
  - applyToolNameRewriteToBody skips server tools + syncs tool_choice
  - applyToolsLastCacheBreakpoint defaults to 5m + passes client ttl
  - stripMessageCacheControl + addMessageCacheBreakpoints in the
    1/4/string-content cases + second-to-last user turn selection
  - buildToolNameRewriteFromBody ReverseOrdered is desc-by-fake-length
  - fake name shape follows Parrot {prefix}{head3}{i:02d}

6e12578b

feat(gateway): align body shape with real Claude Code CLI defaults · a25faeca

keh4l authored Apr 24, 2026

Three field-level alignments in normalizeClaudeOAuthRequestBody to
match real Claude Code CLI traffic byte-for-byte:

  1. temperature: previously deleted unconditionally; now passes
     through client value, defaults to 1 when absent (real CLI
     always sends temperature, default 1).

  2. max_tokens: defaults to 128000 when absent (real CLI default).

  3. context_management: when thinking.type is enabled/adaptive
     and the client did not provide context_management, inject
     {"edits":[{"type":"clear_thinking_20251015","keep":"all"}]}
     to mirror real CLI behavior.

tool_choice removal is unchanged (Claude Code OAuth credentials
do not allow client-supplied tool_choice).

Tests updated:
  - gateway_body_order_test.go: temperature/max_tokens are now
    expected in output; tool_choice still removed.
  - gateway_prompt_test.go: system array is now 2 blocks
    (billing + cc prompt), assertions adjusted.
  - gateway_anthropic_apikey_passthrough_test.go: same 2-block
    assertion.

a25faeca