Commits · fd43be8d0b0e45c146c677709fb455c28deba3b8 · 陈曦 / sub2api

07 Feb, 2026 30 commits

yangjianbo authored Feb 07, 2026



- handleUpstreamError 补齐新增的三个参数 (0, "", false)
- handleStreamingResponse 移除已删除的 nil 参数
Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

836ba14b

fix(audit): 第二批审计修复 — P0 生产 Bug、安全加固、性能优化、缓存一致性、代码质量 · 2588fa6a

yangjianbo authored Feb 07, 2026



基于 backend-code-audit 审计报告，修复剩余 P0/P1/P2 共 34 项问题：

P0 生产 Bug：
- 修复 time.Since(time.Now()) 计时逻辑错误 (P0-03)
- generateRandomID 改用 crypto/rand 替代固定索引 (P0-04)
- IncrementQuotaUsed 重写为 Ent 原子操作消除 TOCTOU 竞态 (P0-05)

安全加固：
- gateway/openai handler 错误响应替换为泛化消息，防止内部信息泄露 (P1-14)
- usage_log_repo dateFormat 参数改用白名单映射，防止 SQL 注入 (P1-16)
- 默认配置安全加固：sslmode=prefer、response_headers=true、mode=release (P1-18/19, P2-15)

性能优化：
- gateway handler 循环内 defer 替换为显式 releaseWait 闭包 (P1-02)
- group_repo/promo_code_repo Count 前 Clone 查询避免状态污染 (P1-03)
- usage_log_repo 四个查询添加 LIMIT 10000 防止 OOM (P1-07)
- GetBatchUsageStats 添加时间范围参数，默认最近 30 天 (P1-10)
- ip.go CIDR 预编译为包级变量 (P1-11)
- BatchUpdateCredentials 重构为先验证后更新 (P1-13)

缓存一致性：
- billing_cache 添加 jitteredTTL 防止缓存雪崩 (P2-10)
- DeductUserBalance/UpdateSubscriptionUsage 错误传播修复 (P2-12)
- UserService.UpdateBalance 成功后异步失效 billingCache (P2-13)

代码质量：
- search 截断改为按 rune 处理，支持多字节字符 (P2-01)
- TLS Handshake 改为 HandshakeContext 支持 context 取消 (P2-07)
- CORS 预检添加 Access-Control-Max-Age: 86400 (P2-16)

测试覆盖：
- 新增 user_service_test.go（UpdateBalance 缓存失效 6 个用例）
- 新增 batch_update_credentials_test.go（fail-fast + 类型验证 7 个用例）
- 新增 response_transformer_test.go、ip_test.go、usage_log_repo_unit_test.go、search_truncate_test.go
- 集成测试：IncrementQuotaUsed 并发测试、billing_cache 错误传播测试
- config_test.go 补充 server.mode/sslmode 默认值断言
Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

2588fa6a

fix: 收敛 Claude Code 探测拦截并补齐回归测试 · 6aaa4aee
shaw authored Feb 07, 2026

6aaa4aee

fix(oauth): SessionStore.Stop() 添加 sync.Once 防重入保护 (P1-05) · f6ca7019

yangjianbo authored Feb 07, 2026

oauth 和 openai 包的 SessionStore.Stop() 直接调用 close(stopCh)，
重复调用会导致 panic。使用 sync.Once 包裹确保幂等安全。

新增单元测试覆盖连续调用和 50 goroutine 并发调用场景。
Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

f6ca7019

fix(config): 禁止 server.frontend_url 携带 query/userinfo · a84604dc
yangjianbo authored Feb 07, 2026

a84604dc

fix(security): 修复密码重置链接 Host Header 注入漏洞 (P0-07) · e75d3e35

yangjianbo authored Feb 07, 2026



ForgotPassword 原来从 c.Request.Host 构建重置链接基础 URL，攻击者
可伪造 Host 头将重置链接指向恶意域名窃取 token。

修复方案：
- ServerConfig 新增 frontend_url 配置项
- auth_handler 改为从配置读取前端 URL，未配置时拒绝请求
- Validate() 校验 frontend_url 必须为绝对 HTTP(S) URL
- 新增 TestValidateServerFrontendURL 单元测试
- config.example.yaml 添加配置说明
Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

e75d3e35

fix(gateway): harden digest logging and align antigravity ops · 1439eb39

shaw authored Feb 07, 2026

- avoid panic by using safe UUID prefix truncation in Gemini digest fallback logs\n- remove unconditional Antigravity 429 full-body debug logs and honor log truncation config\n- align Antigravity quick preset mappings to opus 4.6-thinking targets only\n- restore scope rate-limit aggregation/output in ops availability stats

1439eb39

perf(service): 优化 model 替换函数，用 gjson/sjson 替代全量 JSON 序列化 · 8226a4ce

yangjianbo authored Feb 07, 2026

SSE 热路径中 replaceModelInSSELine 和 replaceModelInResponseBody 原来
使用 json.Unmarshal/Marshal 对每个事件做全量反序列化再序列化，现改为
gjson.Get/sjson.Set 精确字段操作，消除 O(n) 中间 map 分配，保持 JSON
字段顺序不变。涉及 OpenAIGatewayService 和 GatewayService 两个服务。

新增 23 个单元测试覆盖：顶层/嵌套 model 替换、不匹配跳过、空行/[DONE]/
非法 JSON 等边界情况。

Fixes: P1-08
Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

8226a4ce

fix(middleware): 管理员JWT增加TokenVersion校验 · 65c0d8b5
yangjianbo authored Feb 07, 2026
```
管理员改密后旧JWT会被拒绝，并补充单元测试覆盖。
```
65c0d8b5
fix(openai): 修复 usage 为空导致 panic（P0-02） · a9e256ce
yangjianbo authored Feb 07, 2026

a9e256ce
fix(test): update test calls to match method receivers on handleSmartRetry and antigravityRetryLoop · fa28dcbf
erio authored Feb 07, 2026

fa28dcbf

fix(antigravity): fetch default mapping from API and sync Redis on rate limit · 2656320d

erio authored Feb 07, 2026

1. Frontend: replace hardcoded antigravityDefaultMappings with async
fetch from GET /admin/accounts/antigravity/default-model-mapping,
eliminating the duplicate data source that caused frontend/backend
mapping inconsistency.

2. Backend: convert handleSmartRetry and antigravityRetryLoop from
standalone functions to AntigravityGatewayService methods, enabling
Redis cache sync (updateAccountModelRateLimitInCache) after both
rate-limit write paths — long-delay branch and retry-exhausted branch.

2656320d

chore(version): 更新版本号至 0.1.70.2 · 7e1674e4
yangjianbo authored Feb 07, 2026

7e1674e4
style: fix gofmt formatting in gateway_service.go · b4f6c4f9
erio authored Feb 07, 2026
```
Remove extra blank line that caused golangci-lint gofmt check to fail.
```
b4f6c4f9

perf(middleware): 优化订阅模式认证中间件，5次串行调用降至2步同步+1步异步 · 0e514ed8

yangjianbo authored Feb 07, 2026



- 为 GetActiveSubscription 添加 ristretto L1 缓存 + singleflight 防击穿
- 合并 ValidateSubscription + CheckUsageLimits 为纯内存 ValidateAndCheckLimits
- 窗口维护操作（激活/重置）异步化，不再阻塞首字节
- 缓存返回浅拷贝，避免并发 data race 和缓存污染
- 所有管理操作（分配/续期/撤销/扩展/窗口重置）同步失效 L1 缓存
- 新增 SubscriptionCacheConfig 可配置 L1 缓存大小/TTL/抖动
Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

0e514ed8

refactor: remove unused IsAntigravityModelSupported function and its tests · 14c6c932
erio authored Feb 07, 2026

14c6c932

test(antigravity): add missing unit tests for upstream and custom model_mapping · 386126b1

erio authored Feb 07, 2026

- Add GetAccessToken upstream branch tests (success/failure/empty/nil)
- Add mapAntigravityModel wildcard-target-equals-request edge case tests
- Add upstream account smart retry test case
- Add GeminiMessagesCompatService custom model_mapping and empty model tests

386126b1

fix(antigravity): support upstream accounts and custom model_mapping in scheduling · de092728

erio authored Feb 07, 2026

- GetAccessToken: add upstream branch to read api_key from credentials
- shouldTriggerAntigravitySmartRetry: relax check from IsOAuth to Platform-based
- isModelSupportedByAccount/WithContext: replace IsAntigravityModelSupported
  whitelist with mapAntigravityModel for unified scheduling/forwarding logic
- mapAntigravityModel: fix edge case where wildcard target equals request model
- Update tests for new behavior and add custom model_mapping test cases

de092728

fix: restore non-failover error passthrough from 7b156489 · edb09370
erio authored Feb 07, 2026

edb09370
fix: restore error passthrough service improvements from 7b156489 · 43a4840d
erio authored Feb 07, 2026

43a4840d

feat(antigravity): comprehensive enhancements - model mapping, rate limiting, scheduling & ops · 5e98445b

erio authored Feb 07, 2026

Key changes:
- Upgrade model mapping: Opus 4.5 → Opus 4.6-thinking with precise matching
- Unified rate limiting: scope-level → model-level with Redis snapshot sync
- Load-balanced scheduling by call count with smart retry mechanism
- Force cache billing support
- Model identity injection in prompts with leak prevention
- Thinking mode auto-handling (max_tokens/budget_tokens fix)
- Frontend: whitelist mode toggle, model mapping validation, status indicators
- Gemini session fallback with Redis Trie O(L) matching
- Ops: enhanced concurrency monitoring, account availability, retry logic
- Migration scripts: 049-051 for model mapping unification

5e98445b

fix(antigravity): reduce 429 fallback cooldown from 5min to 30s · 8917afab

erio authored Feb 07, 2026

The default fallback cooldown when rate limit reset time cannot be
parsed was 5 minutes, which is too aggressive and causes accounts
to be unnecessarily locked out. Reduce to 30 seconds for faster
recovery. Config override still works (unit remains minutes).

8917afab

fix(antigravity): auto-fix max_tokens <= budget_tokens causing 400 error · 49233ec2

erio authored Feb 07, 2026

When extended thinking is enabled, Claude API requires max_tokens >
thinking.budget_tokens. If misconfigured, this auto-adjusts max_tokens
to budget_tokens + 1000 instead of returning a 400 error.

- Add ensureMaxTokensGreaterThanBudget helper function
- Extract Gemini25FlashThinkingBudgetLimit constant (24576)
- Log adjustment for debugging

49233ec2

fix: 账号测试根据类型使用不同的 beta header · 39a5b17d

shaw authored Feb 07, 2026

- OAuth 账号：使用完整的 DefaultBetaHeader 和 Claude Code 客户端 headers
- API Key 账号：使用 APIKeyBetaHeader（不含 oauth beta）

39a5b17d

chore(version): 更新版本号至 0.1.70.1 · 782a54a8
yangjianbo authored Feb 07, 2026

782a54a8
fix: ix: antigravity 添加 aude-opus-4-6-thinking 模型支持 · 5299f3dc
shaw authored Feb 07, 2026

5299f3dc
fix: make error passthrough effective for non-failover upstream errors · 7b156489
shaw authored Feb 07, 2026

7b156489
test(codex): 清理无用的 opencode 缓存测试 · 4e01126f
yangjianbo authored Feb 07, 2026
```
移除不再需要的 setupCodexCache 调用与辅助函数（已不再回源/读写缓存）
```
4e01126f

feat(codex): 移除 opencode 指令回源与缓存 · 55b56328

yangjianbo authored Feb 07, 2026

- 不再从 GitHub 拉取 opencode codex_header.txt\n- 删除 ~/.opencode 缓存与异步刷新逻辑\n- 所有 instructions 统一使用内置 codex_cli_instructions.md

55b56328

feat(gateway): 支持强制 Codex CLI 模式并伪装 UA · ce764bf2

yangjianbo authored Feb 07, 2026

- Codex CLI 请求仅使用内置 instructions，不再读取 opencode 缓存/回源\n- 新增 gateway.force_codex_cli（环境变量 GATEWAY_FORCE_CODEX_CLI）\n- ForceCodexCLI=true 时转发上游强制 User-Agent=codex_cli_rs/0.0.0\n- 更新 deploy 示例配置

ce764bf2

06 Feb, 2026 10 commits

perf(service): SSE Scanner buffer 改用 sync.Pool 复用，减少高并发 GC 压力 · d71537d4

yangjianbo authored Feb 06, 2026



将流式响应中 bufio.Scanner 的 64KB buffer 从每次 make 分配改为
sync.Pool 复用，统一切片表达式为 [:0]、变量命名为 scanBuf，
并补充对应的单元测试。
Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

d71537d4

perf(service): jitterTTL 改用 rand/v2 并移除锁 · ae1ba453
yangjianbo authored Feb 06, 2026

ae1ba453
perf(service): 移除 jitter 随机数全局锁 · c4182f8c
yangjianbo authored Feb 06, 2026

c4182f8c
chore(gateway): 提升 max_idle_conns 并补齐 env · 8672b2f3
yangjianbo authored Feb 06, 2026

8672b2f3
chore(deploy): 补齐连接池默认与 8G 参数 · de753a14
yangjianbo authored Feb 06, 2026

de753a14
feat: 优化codex冷启动, 还有连接池数据库配置信息 · 2d4bbbf4
yangjianbo authored Feb 06, 2026

2d4bbbf4

fix(ops): 添加 token 相关字段白名单避免误脱敏 · 9f4c1ef9

shaw authored Feb 06, 2026

在敏感字段检测中添加白名单，排除 API 参数和用量统计字段：
- max_tokens, max_completion_tokens, max_output_tokens
- completion_tokens, prompt_tokens, total_tokens
- input_tokens, output_tokens
- cache_creation_input_tokens, cache_read_input_tokens

这些字段名虽然包含 "token" 但只是数值参数，不应被脱敏处理。

9f4c1ef9

fix(gateway): 移除 PR #316 引入的工具名转换逻辑 · d182ef03

shaw authored Feb 06, 2026

移除响应阶段的工具名/schema/description 转换逻辑，修复第三方工具调用时
工具名被错误转换的问题（如 Task → task）。

移除内容：
- 工具名相关正则变量（toolPrefixRe, toolNameBoundaryRe 等）
- openCodeToolOverrides 和 claudeToolNameOverrides 映射表
- 工具名转换函数（normalizeToolNameForClaude, normalizeToolNameForOpenCode 等）
- 响应体工具名替换函数（replaceToolNamesInText, replaceToolNamesInResponseBody 等）
- 参数名转换函数（normalizeParamNameForOpenCode, rewriteParamKeysInValue）
- 工具描述清理函数（sanitizeToolDescription）
- 输入 schema 转换函数（normalizeToolInputSchema）
- 模型 ID 正则替换函数（replaceModelIDInText）

保留内容：
- 系统提示词清理（sanitizeSystemText）
- Claude Code 指纹 headers 处理
- 模型 ID 映射（通过 JSON 对象操作）

d182ef03

test(backend): 修复 usage 类型断言未检查 · ee01f80d
yangjianbo authored Feb 06, 2026

ee01f80d

fix(兼容): 将 Kimi cached_tokens 映射到 Claude 标准 cache_read_input_tokens · f33a9501

yangjianbo authored Feb 06, 2026

Kimi 等 Claude 兼容 API 返回缓存信息使用 OpenAI 风格的 cached_tokens 字段，
而非 Claude 标准的 cache_read_input_tokens，导致客户端收不到缓存命中信息且
内部计费缓存折扣为 0。

新增 reconcileCachedTokens 辅助函数，在 cache_read_input_tokens == 0 且
cached_tokens > 0 时自动填充，覆盖流式（message_start/message_delta）和
非流式两种响应路径。对 Claude 原生上游无影响。
Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

f33a9501