Commits · beceb45d23f568ab9557dd51891c5961c2ae920e · 陈曦 / sub2api

17 Feb, 2026 1 commit

feat: add Cache TTL Override per account + bump VERSION to 0.1.83 · 3d1f03c2

John Doe authored Feb 17, 2026

- Account-level cache TTL override: rewrite Anthropic cache_creation
  token classification (5m↔

1h) in streaming/non-streaming responses
- New DB field cache_ttl_overridden in usage_log for billing tracking
- Migration 055_add_cache_ttl_overridden
- Frontend: CacheTTL override toggle in account create/edit modals
- Ent schema regenerated for new usage_log fields
Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

3d1f03c2

11 Feb, 2026 2 commits

feat(admin): Add group filtering for account listings · fe1d46a8

kyx236 authored Feb 12, 2026

- Add groupID parameter to ListAccounts and ListWithFilters methods
- Implement account filtering by group ID in repository query
- Add group query parameter parsing in account handler
- Update all ListAccounts/ListWithFilters call sites with groupID parameter
- Add group filter UI component to AccountTableFilters
- Add i18n translations for group filter label in English and Chinese
- Update API contract and test stubs to reflect new signature
- Enable filtering accounts by their assigned groups in admin panel

fe1d46a8

feat(admin): Add email search and rate limit filtering for accounts and redeem codes · 04a1a7c2

kyx236 authored Feb 11, 2026

- Add used_by_email column to redeem code export CSV for better user identification
- Implement rate_limited status filter in account listing with RateLimitResetAt check
- Extend redeem code search to include user email in addition to code matching
- Add API key search capability to user listing filters
- Display user email in redeem code table used_by column for improved visibility
- Update search placeholders in UI to reflect expanded search capabilities (email, username, notes, API key)
- Improve Chinese and English localization strings for search hints

04a1a7c2

10 Feb, 2026 3 commits

feat(antigravity): 支持 Refresh Token 批量导入创建 OAuth 账号 · c8f87a9c

Tian authored Feb 10, 2026

后端新增 ValidateRefreshToken service 方法和 POST /oauth/refresh-token 端点，
前端新增 API/Composable/UI 集成，OAuthAuthorizationFlow i18n 动态化，
支持在 Antigravity 创建账号时批量粘贴 Refresh Token 自动验证并创建账号。

c8f87a9c

fix: 修复错误透传规则 skip_monitoring 未生效的问题 · 2d4236f7

Edric Li authored Feb 10, 2026

- ops_error_logger: status < 400 分支增加 OpsSkipPassthroughKey 检查
- ops_upstream_context: 新增 checkSkipMonitoringForUpstreamEvent，中间重试/故障转移事件也能触发跳过标记
- gateway_handler/openai_gateway_handler/gemini_v1beta_handler: handleFailoverExhausted 匹配规则后设置 OpsSkipPassthroughKey
- antigravity_gateway_service: writeMappedClaudeError 增加 applyErrorPassthroughRule 调用

2d4236f7

feat: 错误透传规则支持 skip_monitoring 跳过运维监控记录 · d95e04fd

Edric Li authored Feb 10, 2026

在每条错误透传规则上新增 skip_monitoring 选项，开启后匹配该规则的错误
不会被记录到 ops_error_logs，减少监控噪音。默认关闭，不影响现有规则。

d95e04fd

09 Feb, 2026 6 commits

feat: same-account retry before failover for transient errors · d6c2921f

Edric Li authored Feb 10, 2026

For retryable transient errors (Google 400 "invalid project resource name"
and empty stream responses), retry on the same account up to 2 times
(with 500ms delay) before switching to another account.

- Add RetryableOnSameAccount field to UpstreamFailoverError
- Add same-account retry loop in both Gemini and Claude/OpenAI handler paths
- Move temp-unschedule from service layer to handler layer (only after
  all same-account retries exhausted)
- Reduce temp-unschedule cooldown from 30 minutes to 1 minute

d6c2921f

test: 添加单账号 503 退避重试机制的单元测试 · e4bc3515

Rose Ding authored Feb 09, 2026



覆盖 Service 层和 Handler 层的所有新增逻辑：
- isSingleAccountRetry context 标记检查
- handleSmartRetry 中 503 + SingleAccountRetry 分支
- handleSingleAccountRetryInPlace 原地重试逻辑
- antigravityRetryLoop 预检查跳过限流
- sleepAntigravitySingleAccountBackoff 固定延迟退避
- 端到端集成场景验证
Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

e4bc3515

fix: 单账号分组首次 503 不设模型限流标记，避免后续请求雪崩 · 021abfca

Rose Ding authored Feb 09, 2026

单账号 antigravity 分组收到 503 (MODEL_CAPACITY_EXHAUSTED) 时，
原逻辑会设置 ~29s 模型限流标记。由于只有一个账号无法切换，
后续所有新请求在预检查时命中限流 → 几毫秒内直接返回 503，
导致约 30 秒的雪崩窗口。

修复：在 Handler 入口处检查分组是否只有单个 antigravity 账号，
如果是则提前设置 SingleAccountRetry context 标记，让 Service 层
首次 503 就走原地重试逻辑（不设限流标记），避免污染后续请求。

021abfca

feat: 添加 Antigravity 单账号 503 退避重试机制 · f6cfab99

Rose Ding authored Feb 09, 2026

当分组内只有一个可用账号且上游返回 503 (MODEL_CAPACITY_EXHAUSTED) 时，
不再设置模型限流+切换账号（因为切换回来还是同一个账号），而是在 Service 层
原地等待+重试，避免双重等待问题。

主要变更：
- Handler 层：检测单账号 503 场景，清除排除列表并设置 SingleAccountRetry 标记
- Service 层：新增 handleSingleAccountRetryInPlace 原地重试逻辑
- Service 层：预检查跳过单账号模式下的限流检查
- 新增 ctxkey.SingleAccountRetry 上下文标记

f6cfab99

feat(admin): 新增 CRS 同步预览和账号选择功能 · 5e0d7894

QTom authored Feb 09, 2026

- 后端新增 PreviewFromCRS 接口，允许用户先预览 CRS 中的账号
- 后端支持在同步时选择特定账号，不选中的账号将被跳过
- 前端重构 SyncFromCrsModal 为三步向导：输入凭据 → 预览账号 → 执行同步
- 改进表单无障碍性：添加 for/id 关联和 required 属性
- 修复 Back 按钮返回时的状态清理
- 新增 buildSelectedSet 和 shouldCreateAccount 的单元测试
- 完整的向后兼容性：旧客户端不发送 selected_account_ids 时行为不变

5e0d7894

refactor: replace scope-level rate limiting with model-level rate limiting · fc095bf0

erio authored Feb 09, 2026

Merge functional changes from develop branch:
- Remove AntigravityQuotaScope system (claude/gemini_text/gemini_image)
- Replace with per-model rate limiting using resolveAntigravityModelKey
- Remove model load statistics (IncrModelCallCount/GetModelLoadBatch)
- Simplify account selection to unified priority→load→LRU algorithm
- Remove SetAntigravityQuotaScopeLimit from AccountRepository
- Clean up scope-related UI indicators and API fields

fc095bf0

08 Feb, 2026 9 commits

refactor: replace Trie-based digest session store with flat cache · b889d501
erio authored Feb 09, 2026

b889d501
fix: ensure sticky session failover triggers cache billing exemption · 72b08f9c
erio authored Feb 09, 2026

72b08f9c
feat: add linear delay between Antigravity account failover switches · 681950da
erio authored Feb 09, 2026

681950da

fix: parse Gemini native request format in ParseGatewayRequest for correct session hash generation · 35598d56

erio authored Feb 09, 2026

ParseGatewayRequest only parsed Anthropic format (system/messages),
ignoring Gemini native format (systemInstruction/contents). This caused
GenerateSessionHash to produce identical hashes for all Gemini sessions.

Add protocol parameter to ParseGatewayRequest to branch between
Anthropic and Gemini parsing. Update GenerateSessionHash message
traversal to extract text from both formats.

35598d56

fix: prevent sessionHash collision for different users with same messages · 5c76b9e4

erio authored Feb 09, 2026

Mix SessionContext (ClientIP, UserAgent, APIKeyID) into
GenerateSessionHash 3rd-level fallback to differentiate requests
from different users sending identical content.

Also switch hashContent from SHA256-truncated to XXHash64 for
better performance, and optimize Trie Lua script to match from
longest prefix first.

5c76b9e4

fix: clean thoughtSignature for all clients, not just CLI · 0b8fea4c

erio authored Feb 09, 2026

Previously, thoughtSignature cleanup only applied to Gemini CLI
requests (detected via x-gemini-api-privileged-user-id header or
tmp dir pattern). This caused 400 errors for non-CLI clients when
session cache expired and they sent stale signatures.

Remove the isGeminiCLIRequest guard so all clients benefit from
proactive thoughtSignature cleanup on session binding miss.

0b8fea4c

feat(admin): add drag-and-drop group sort order · bac9e2bf

bayma888 authored Feb 08, 2026

- Add `sort_order` field to groups table with migration
- Add `PUT /api/v1/admin/groups/sort-order` API for batch update
- Implement drag-and-drop UI using vue-draggable-plus
- All queries now order groups by sort_order
- Add i18n support (en/zh) for sort-related UI text
- Update test stubs to satisfy new interface methods

bac9e2bf

feat(ui): 用户列表页显示当前并发数 · e4d74ae1

shaw authored Feb 08, 2026

优化 /admin/users 页面的并发数列，显示「当前/最大」格式，
参考 AccountCapacityCell 的设计风格。

- 后端 UserHandler 注入 ConcurrencyService，批量查询用户当前并发数
- 新增 UserConcurrencyCell 组件，支持颜色状态（空闲灰/使用中黄/满载红）
- 前端 AdminUser 类型添加 current_concurrency 字段

e4d74ae1

refactor(upstream): replace upstream account type with apikey, auto-append /antigravity · fb58560d

erio authored Feb 08, 2026

Upstream accounts now use the standard APIKey type instead of a dedicated
upstream type. GetBaseURL() and new GetGeminiBaseURL() automatically append
/antigravity for Antigravity platform APIKey accounts, eliminating the need
for separate upstream forwarding methods.

- Remove ForwardUpstream, ForwardUpstreamGemini, testUpstreamConnection
- Remove upstream branch guards in Forward/ForwardGemini/TestConnection
- Add migration 052 to convert existing upstream accounts to apikey
- Update frontend CreateAccountModal to create apikey type
- Add unit tests for GetBaseURL and GetGeminiBaseURL

fb58560d

07 Feb, 2026 7 commits

fix: 收敛 Claude Code 探测拦截并补齐回归测试 · 6aaa4aee
shaw authored Feb 07, 2026

6aaa4aee

refactor: remove Anthropic digest chain from Messages handler · 86b503f8

erio authored Feb 07, 2026

The digest chain fallback is only needed for Gemini endpoints, not
for the Anthropic Messages API path. Remove the handler integration
while keeping the reusable service/repository layer for future use.

86b503f8

feat: add Anthropic sticky session digest chain matching via Trie · 50a783ff

erio authored Feb 07, 2026

The previous fallback (step 3) in GenerateSessionHash hashed system +
all messages together, producing a different hash each round as the
conversation grew ([a] -> [a,b] -> [a,b,c]). This made fallback sticky
sessions ineffective for multi-turn conversations.

Implement per-message Trie digest chain matching (reusing Gemini's Trie
infrastructure) so that the previous round's chain is always a prefix
of the current round's chain, enabling reliable session affinity.

50a783ff

fix(gateway): harden digest logging and align antigravity ops · 1439eb39

shaw authored Feb 07, 2026

- avoid panic by using safe UUID prefix truncation in Gemini digest fallback logs\n- remove unconditional Antigravity 429 full-body debug logs and honor log truncation config\n- align Antigravity quick preset mappings to opus 4.6-thinking targets only\n- restore scope rate-limit aggregation/output in ops availability stats

1439eb39

fix: restore non-failover error passthrough from 7b156489 · edb09370
erio authored Feb 07, 2026

edb09370

feat(antigravity): comprehensive enhancements - model mapping, rate limiting, scheduling & ops · 5e98445b

erio authored Feb 07, 2026

Key changes:
- Upgrade model mapping: Opus 4.5 → Opus 4.6-thinking with precise matching
- Unified rate limiting: scope-level → model-level with Redis snapshot sync
- Load-balanced scheduling by call count with smart retry mechanism
- Force cache billing support
- Model identity injection in prompts with leak prevention
- Thinking mode auto-handling (max_tokens/budget_tokens fix)
- Frontend: whitelist mode toggle, model mapping validation, status indicators
- Gemini session fallback with Redis Trie O(L) matching
- Ops: enhanced concurrency monitoring, account availability, retry logic
- Migration scripts: 049-051 for model mapping unification

5e98445b

fix: make error passthrough effective for non-failover upstream errors · 7b156489
shaw authored Feb 07, 2026

7b156489

05 Feb, 2026 10 commits

feat: 新增全局错误透传规则功能 · 39e05a2d

shaw authored Feb 05, 2026

支持管理员配置上游错误如何返回给客户端：
- 新增 ErrorPassthroughRule 数据模型和 Ent Schema
- 实现规则的 CRUD API（/admin/error-passthrough-rules）
- 支持按错误码、关键词匹配，支持 any/all 匹配模式
- 支持按平台过滤（anthropic/openai/gemini/antigravity）
- 支持透传或自定义响应状态码和错误消息
- 实现两级缓存（Redis + 本地内存）和多实例同步
- 集成到 gateway_handler 的错误处理流程
- 新增前端管理界面组件
- 新增单元测试覆盖核心匹配逻辑

优化：
- 移除 refreshLocalCache 中的冗余排序（数据库已排序）
- 后端 Validate() 增加匹配条件非空校验

39e05a2d

fix: remove unused listAllAccounts · 029994a8
LLLLLLiulei authored Feb 05, 2026

029994a8
fix: harden import/export flow · 37047919
LLLLLLiulei authored Feb 05, 2026

37047919
perf: batch fetch proxies for account export · 0b45d48e
LLLLLLiulei authored Feb 05, 2026

0b45d48e
feat: refine proxy export and toolbar layout · 0c660f83
LLLLLLiulei authored Feb 05, 2026

0c660f83
feat: add proxy import flow · ce9a247a
LLLLLLiulei authored Feb 05, 2026

ce9a247a
feat: add data import/export bundle · b4bd46d0
LLLLLLiulei authored Feb 05, 2026

b4bd46d0
feat: 支持用户专属分组倍率配置 · 2b192f7d
shaw authored Feb 05, 2026

2b192f7d

feat(auth): 实现 Refresh Token 机制 · 49a3c437

shaw authored Feb 05, 2026

- 新增 Access Token + Refresh Token 双令牌认证
- 支持 Token 自动刷新和轮转
- 添加登出和撤销所有会话接口
- 前端实现无感刷新和主动刷新定时器

49a3c437

feat(gateway): filter /v1/usage stats by API Key instead of UserID · fa3ea5ee

JIA-ss authored Feb 05, 2026



Previously the /v1/usage endpoint aggregated usage stats (today/total
tokens, cost, RPM/TPM) across all API Keys belonging to the user.
This made it impossible to distinguish usage from different API Keys
(e.g. balance vs subscription keys).

Now the usage stats are filtered by the current request's API Key ID,
so each key only sees its own usage data. The balance/remaining fields
are unaffected and still reflect the user-level wallet balance.

Changes:
- Add GetAPIKeyDashboardStats to repository interface and implementation
- Add getPerformanceStatsByAPIKey helper (also fixes TPM to include
  cache_creation_tokens and cache_read_tokens)
- Add GetAPIKeyDashboardStats to UsageService
- Update Usage handler to call GetAPIKeyDashboardStats(apiKey.ID)
Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

fa3ea5ee

03 Feb, 2026 2 commits
- chore: fix gofmt formatting · df1c2383
  shaw authored Feb 03, 2026
  
  df1c2383
- fix(test): add IncrementQuotaUsed to all APIKeyRepository test stubs · be7bc658
  bayma888 authored Feb 03, 2026
```
- Add missing IncrementQuotaUsed method to stubApiKeyRepo in api_contract_test.go
- Fix gofmt formatting issues in api_key_service.go, dto/types.go, api_key_handler.go
```
  be7bc658