Commits · e4d74ae11dd494512ed7d08858429854034aa3f6 · 陈曦 / sub2api

08 Feb, 2026 7 commits

shaw authored Feb 08, 2026

优化 /admin/users 页面的并发数列，显示「当前/最大」格式，
参考 AccountCapacityCell 的设计风格。

- 后端 UserHandler 注入 ConcurrencyService，批量查询用户当前并发数
- 新增 UserConcurrencyCell 组件，支持颜色状态（空闲灰/使用中黄/满载红）
- 前端 AdminUser 类型添加 current_concurrency 字段

e4d74ae1

fix: remove unused upstreamHopByHopHeaders variable to pass golangci-lint · 69816f86
erio authored Feb 08, 2026

69816f86
fix: apikey类型账号test去掉oauth-2025-04-20 · b4ec6578
shaw authored Feb 08, 2026

b4ec6578

refactor(upstream): replace upstream account type with apikey, auto-append /antigravity · fb58560d

erio authored Feb 08, 2026

Upstream accounts now use the standard APIKey type instead of a dedicated
upstream type. GetBaseURL() and new GetGeminiBaseURL() automatically append
/antigravity for Antigravity platform APIKey accounts, eliminating the need
for separate upstream forwarding methods.

- Remove ForwardUpstream, ForwardUpstreamGemini, testUpstreamConnection
- Remove upstream branch guards in Forward/ForwardGemini/TestConnection
- Add migration 052 to convert existing upstream accounts to apikey
- Update frontend CreateAccountModal to create apikey type
- Add unit tests for GetBaseURL and GetGeminiBaseURL

fb58560d

fix(upstream): passthrough response body directly instead of parsing SSE · 6ab77f5e

erio authored Feb 08, 2026

ForwardUpstream/ForwardUpstreamGemini should pipe the upstream response
directly to the client (headers + body), not parse it as SSE stream.

6ab77f5e

fix: add nil guard for gin.Context in header passthrough to satisfy staticcheck SA5011 · 4f57d7f7
erio authored Feb 08, 2026

4f57d7f7

feat(upstream): passthrough all client headers instead of manual header setting · 1563bd3d

erio authored Feb 08, 2026

Replace manual header setting (Content-Type, anthropic-version, anthropic-beta)
with full client header passthrough in ForwardUpstream/ForwardUpstreamGemini.
Only authentication headers (Authorization, x-api-key) are overridden with
upstream account credentials. Hop-by-hop headers are excluded.

Add unit tests covering header passthrough, auth override, and hop-by-hop filtering.

1563bd3d

07 Feb, 2026 22 commits

fix(gateway): restore upstream account forwarding with dedicated methods · 77b66653

erio authored Feb 08, 2026

v0.1.74 merged upstream accounts into the OAuth path, causing requests
to hit the wrong protocol and endpoint. Add three upstream-specific
methods (testUpstreamConnection, ForwardUpstream, ForwardUpstreamGemini)
that use base_url + apiKey auth and passthrough the original body, while
reusing the existing response handling and error/retry logic.

77b66653

feat: smart retry max 1 attempt + clear sticky session on failure · 3077fd27

erio authored Feb 07, 2026

- Change antigravitySmartRetryMaxAttempts from 3 to 1 to prevent
  repeated rate limiting and long waits
- Clear sticky session binding (DeleteSessionAccountID) after smart
  retry exhaustion, so subsequent requests don't hit the same
  rate-limited account
- Add flow diagrams to Forward/ForwardGemini doc comments
- Add comprehensive unit tests covering:
  - Sticky session cleared on retry failure (429, 503, network error)
  - Sticky session NOT cleared on retry success
  - Sticky session NOT cleared for non-sticky requests (empty hash)
  - Sticky session NOT cleared on long delay path (handled by handler)
  - Nil cache safety (no panic)
  - MaxAttempts constant verification
  - End-to-end retryLoop → switchError propagation with session clear

3077fd27

fix: 收敛 Claude Code 探测拦截并补齐回归测试 · 6aaa4aee
shaw authored Feb 07, 2026

6aaa4aee
fix(lint): handle errcheck for strings.Builder.WriteString · e3748da8
erio authored Feb 07, 2026

e3748da8

refactor: remove Anthropic digest chain from Messages handler · 86b503f8

erio authored Feb 07, 2026

The digest chain fallback is only needed for Gemini endpoints, not
for the Anthropic Messages API path. Remove the handler integration
while keeping the reusable service/repository layer for future use.

86b503f8

feat: add Anthropic sticky session digest chain matching via Trie · 50a783ff

erio authored Feb 07, 2026

The previous fallback (step 3) in GenerateSessionHash hashed system +
all messages together, producing a different hash each round as the
conversation grew ([a] -> [a,b] -> [a,b,c]). This made fallback sticky
sessions ineffective for multi-turn conversations.

Implement per-message Trie digest chain matching (reusing Gemini's Trie
infrastructure) so that the previous round's chain is always a prefix
of the current round's chain, enabling reliable session affinity.

50a783ff

fix(gateway): harden digest logging and align antigravity ops · 1439eb39

shaw authored Feb 07, 2026

- avoid panic by using safe UUID prefix truncation in Gemini digest fallback logs\n- remove unconditional Antigravity 429 full-body debug logs and honor log truncation config\n- align Antigravity quick preset mappings to opus 4.6-thinking targets only\n- restore scope rate-limit aggregation/output in ops availability stats

1439eb39

refactor: simplify sticky session rate limit handling — switch immediately on any rate limit · e1a68497

erio authored Feb 07, 2026

Remove threshold-based waiting in both sticky session and antigravity
pre-check paths. When a model is rate-limited, immediately clear the
sticky session and switch accounts instead of waiting for short durations.

e1a68497

fix(test): update test calls to match method receivers on handleSmartRetry and antigravityRetryLoop · fa28dcbf
erio authored Feb 07, 2026

fa28dcbf

fix(antigravity): fetch default mapping from API and sync Redis on rate limit · 2656320d

erio authored Feb 07, 2026

1. Frontend: replace hardcoded antigravityDefaultMappings with async
fetch from GET /admin/accounts/antigravity/default-model-mapping,
eliminating the duplicate data source that caused frontend/backend
mapping inconsistency.

2. Backend: convert handleSmartRetry and antigravityRetryLoop from
standalone functions to AntigravityGatewayService methods, enabling
Redis cache sync (updateAccountModelRateLimitInCache) after both
rate-limit write paths — long-delay branch and retry-exhausted branch.

2656320d

style: fix gofmt formatting in gateway_service.go · b4f6c4f9
erio authored Feb 07, 2026
```
Remove extra blank line that caused golangci-lint gofmt check to fail.
```
b4f6c4f9
refactor: remove unused IsAntigravityModelSupported function and its tests · 14c6c932
erio authored Feb 07, 2026

14c6c932

test(antigravity): add missing unit tests for upstream and custom model_mapping · 386126b1

erio authored Feb 07, 2026

- Add GetAccessToken upstream branch tests (success/failure/empty/nil)
- Add mapAntigravityModel wildcard-target-equals-request edge case tests
- Add upstream account smart retry test case
- Add GeminiMessagesCompatService custom model_mapping and empty model tests

386126b1

fix(antigravity): support upstream accounts and custom model_mapping in scheduling · de092728

erio authored Feb 07, 2026

- GetAccessToken: add upstream branch to read api_key from credentials
- shouldTriggerAntigravitySmartRetry: relax check from IsOAuth to Platform-based
- isModelSupportedByAccount/WithContext: replace IsAntigravityModelSupported
  whitelist with mapAntigravityModel for unified scheduling/forwarding logic
- mapAntigravityModel: fix edge case where wildcard target equals request model
- Update tests for new behavior and add custom model_mapping test cases

de092728

fix: restore non-failover error passthrough from 7b156489 · edb09370
erio authored Feb 07, 2026

edb09370
fix: restore error passthrough service improvements from 7b156489 · 43a4840d
erio authored Feb 07, 2026

43a4840d

feat(antigravity): comprehensive enhancements - model mapping, rate limiting, scheduling & ops · 5e98445b

erio authored Feb 07, 2026

Key changes:
- Upgrade model mapping: Opus 4.5 → Opus 4.6-thinking with precise matching
- Unified rate limiting: scope-level → model-level with Redis snapshot sync
- Load-balanced scheduling by call count with smart retry mechanism
- Force cache billing support
- Model identity injection in prompts with leak prevention
- Thinking mode auto-handling (max_tokens/budget_tokens fix)
- Frontend: whitelist mode toggle, model mapping validation, status indicators
- Gemini session fallback with Redis Trie O(L) matching
- Ops: enhanced concurrency monitoring, account availability, retry logic
- Migration scripts: 049-051 for model mapping unification

5e98445b

fix(antigravity): reduce 429 fallback cooldown from 5min to 30s · 8917afab

erio authored Feb 07, 2026

The default fallback cooldown when rate limit reset time cannot be
parsed was 5 minutes, which is too aggressive and causes accounts
to be unnecessarily locked out. Reduce to 30 seconds for faster
recovery. Config override still works (unit remains minutes).

8917afab

fix(antigravity): auto-fix max_tokens <= budget_tokens causing 400 error · 49233ec2

erio authored Feb 07, 2026

When extended thinking is enabled, Claude API requires max_tokens >
thinking.budget_tokens. If misconfigured, this auto-adjusts max_tokens
to budget_tokens + 1000 instead of returning a 400 error.

- Add ensureMaxTokensGreaterThanBudget helper function
- Extract Gemini25FlashThinkingBudgetLimit constant (24576)
- Log adjustment for debugging

49233ec2

fix: 账号测试根据类型使用不同的 beta header · 39a5b17d

shaw authored Feb 07, 2026

- OAuth 账号：使用完整的 DefaultBetaHeader 和 Claude Code 客户端 headers
- API Key 账号：使用 APIKeyBetaHeader（不含 oauth beta）

39a5b17d

fix: ix: antigravity 添加 aude-opus-4-6-thinking 模型支持 · 5299f3dc
shaw authored Feb 07, 2026

5299f3dc
fix: make error passthrough effective for non-failover upstream errors · 7b156489
shaw authored Feb 07, 2026

7b156489

06 Feb, 2026 6 commits

fix(ops): 添加 token 相关字段白名单避免误脱敏 · 9f4c1ef9

shaw authored Feb 06, 2026

在敏感字段检测中添加白名单，排除 API 参数和用量统计字段：
- max_tokens, max_completion_tokens, max_output_tokens
- completion_tokens, prompt_tokens, total_tokens
- input_tokens, output_tokens
- cache_creation_input_tokens, cache_read_input_tokens

这些字段名虽然包含 "token" 但只是数值参数，不应被脱敏处理。

9f4c1ef9

fix(gateway): 移除 PR #316 引入的工具名转换逻辑 · d182ef03

shaw authored Feb 06, 2026

移除响应阶段的工具名/schema/description 转换逻辑，修复第三方工具调用时
工具名被错误转换的问题（如 Task → task）。

移除内容：
- 工具名相关正则变量（toolPrefixRe, toolNameBoundaryRe 等）
- openCodeToolOverrides 和 claudeToolNameOverrides 映射表
- 工具名转换函数（normalizeToolNameForClaude, normalizeToolNameForOpenCode 等）
- 响应体工具名替换函数（replaceToolNamesInText, replaceToolNamesInResponseBody 等）
- 参数名转换函数（normalizeParamNameForOpenCode, rewriteParamKeysInValue）
- 工具描述清理函数（sanitizeToolDescription）
- 输入 schema 转换函数（normalizeToolInputSchema）
- 模型 ID 正则替换函数（replaceModelIDInText）

保留内容：
- 系统提示词清理（sanitizeSystemText）
- Claude Code 指纹 headers 处理
- 模型 ID 映射（通过 JSON 对象操作）

d182ef03

test(backend): 修复 usage 类型断言未检查 · ee01f80d
yangjianbo authored Feb 06, 2026

ee01f80d

fix(兼容): 将 Kimi cached_tokens 映射到 Claude 标准 cache_read_input_tokens · f33a9501

yangjianbo authored Feb 06, 2026

Kimi 等 Claude 兼容 API 返回缓存信息使用 OpenAI 风格的 cached_tokens 字段，
而非 Claude 标准的 cache_read_input_tokens，导致客户端收不到缓存命中信息且
内部计费缓存折扣为 0。

新增 reconcileCachedTokens 辅助函数，在 cache_read_input_tokens == 0 且
cached_tokens > 0 时自动填充，覆盖流式（message_start/message_delta）和
非流式两种响应路径。对 Claude 原生上游无影响。
Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

f33a9501

chore: 前端增加opus4.6模型映射 · 01b08e1e
shaw authored Feb 06, 2026

01b08e1e

fix(兼容): 将 Kimi cached_tokens 映射到 Claude 标准 cache_read_input_tokens · c6a456c7

yangjianbo authored Feb 06, 2026

c6a456c7

05 Feb, 2026 5 commits

fix(计费): gpt-5.3-codex 定价回退到 gpt-5.2-codex · a38bd413
yangjianbo authored Feb 06, 2026

a38bd413
feat(模型): 添加 gpt-5.3 Codex 映射与价格配置 · 9e1535e2
yangjianbo authored Feb 06, 2026

9e1535e2
fix: 修复了 codex 更新用量窗口异常的 bug · 037a4099
iBenzene authored Feb 06, 2026

037a4099

fix: 修复管理页面活跃会话数始终显示为0的问题 · ae1934f7

shaw authored Feb 05, 2026

问题原因：Redis Pipeline 执行 Lua 脚本时出现 NOSCRIPT 错误，
因为 redis.NewScript 使用 EVALSHA 执行脚本，当 Redis 重启或
脚本未被缓存时，Pipeline 模式无法自动回退到 EVAL。

解决方案：在 NewSessionLimitCache 初始化时预加载所有 Lua 脚本
到 Redis，确保后续 Pipeline 执行时脚本已被缓存。

ae1934f7

feat: 新增全局错误透传规则功能 · 39e05a2d

shaw authored Feb 05, 2026

支持管理员配置上游错误如何返回给客户端：
- 新增 ErrorPassthroughRule 数据模型和 Ent Schema
- 实现规则的 CRUD API（/admin/error-passthrough-rules）
- 支持按错误码、关键词匹配，支持 any/all 匹配模式
- 支持按平台过滤（anthropic/openai/gemini/antigravity）
- 支持透传或自定义响应状态码和错误消息
- 实现两级缓存（Redis + 本地内存）和多实例同步
- 集成到 gateway_handler 的错误处理流程
- 新增前端管理界面组件
- 新增单元测试覆盖核心匹配逻辑

优化：
- 移除 refreshLocalCache 中的冗余排序（数据库已排序）
- 后端 Validate() 增加匹配条件非空校验

39e05a2d