Commits · 4b41e898a41e11a8cf8443c465a508eabd4fa667 · 陈曦 / sub2api

16 Mar, 2026 6 commits

feat(dashboard): add per-user drill-down for group, model, and endpoint distributions · 4b41e898

erio authored Mar 16, 2026

Click on a group name, model name, or endpoint name in the distribution
tables to expand and show per-user usage breakdown (requests, tokens,
actual cost, standard cost).

Backend: new GET /admin/dashboard/user-breakdown API with group_id,
model, endpoint, endpoint_type filters.
Frontend: clickable rows with expand/collapse sub-table in all three
distribution charts.

4b41e898

fix(antigravity): add stream keepalive to prevent connection drops · d7957343

kunish authored Mar 16, 2026

Antigravity streaming handlers were missing the keepalive mechanism
that exists in the standard gateway, causing proxy/CDN idle timeouts
to break connections during long thinking phases (e.g. claude-opus-4-6).
This resulted in truncated responses with missing tool calls.

Add StreamKeepaliveInterval support to all three Antigravity streaming
paths: Claude SSE, Gemini SSE, and upstream passthrough.

d7957343

fix: always attach OpenAI 5h/7d window stats regardless of zero values · fa782e70

Ethan0x0000 authored Mar 16, 2026

Removes hasMeaningfulWindowStats guard so the /usage endpoint consistently
returns WindowStats for both time windows. The frontend now controls
zero-value display filtering at the component level.

fa782e70

fix: allow empty extra payload to clear account quota limits · afd72abc

Ethan0x0000 authored Mar 16, 2026

UpdateAccount previously required len(input.Extra) > 0, causing explicit
empty payloads (extra:{}) to be silently skipped. Change condition to
input.Extra != nil so clearing quota keys actually persists.

afd72abc

fix(gateway): WS 连接池条件式 MarkBroken 防止跨请求串流 · 3741617e

QTom authored Mar 16, 2026

正常终端事件（response.completed 等）退出后连接归还复用，
仅异常路径（读写错误、error 事件、客户端断连）MarkBroken 销毁。

Generate 模式:
- 引入 cleanExit 标记，仅在 isTerminalEvent break 时设置 true
- defer 中根据 cleanExit 决定是否 MarkBroken
- 所有异常路径已在各自分支中提前调用 MarkBroken

Ingress 模式:
- 引入 lastTurnClean 标记，sendAndRelay 正常完成时设为 true
- releaseSessionLease 根据 lastTurnClean 决定是否 MarkBroken
- 错误路径重置 lastTurnClean = false
- 客户端断连后 drain 仍保守 MarkBroken（L2916）

3741617e

fix(gateway): 防止 OpenAI Codex 跨用户串流 · ab4e8b2c

QTom authored Mar 16, 2026

根因：多个用户共享同一 OAuth 账号时，conversation_id/session_id 头
未做用户隔离，导致上游 chatgpt.com 将不同用户的请求关联到同一会话。

HTTP SSE 修复:
- 新增 isolateOpenAISessionID(apiKeyID, raw)，将 API Key ID 混入
  session 标识符（xxhash），确保不同 Key 的用户产生不同上游会话
- buildUpstreamRequest: OAuth 分支先 Del 客户端透传的 session 头，
  再用隔离值覆盖
- buildUpstreamRequestOpenAIPassthrough: 透传路径同样隔离
- ForwardAsAnthropic: Anthropic Messages 兼容路径同步修复
- buildOpenAIWSHeaders: WS 路径的 OAuth session 头同步隔离

ab4e8b2c

15 Mar, 2026 16 commits

fix: resolve golangci-lint issues (gofmt, errcheck) · 552a4b99

erio authored Mar 16, 2026

- Fix gofmt alignment in admin_service.go and trailing newline in
  antigravity_credits_overages.go
- Suppress errcheck for fmt.Sscanf in client.go GetMinimumAmount

552a4b99

fix: remove ClaudeMax references not yet in upstream/main · 0d2061b2

erio authored Mar 16, 2026

Remove SimulateClaudeMaxEnabled field and related logic from
admin_service.go, and remove applyClaudeMaxCacheBillingPolicyToUsage,
applyClaudeMaxNonStreamingRewrite, setupClaudeMaxStreamingHook calls
from antigravity_gateway_service.go. These symbols are not yet
available in upstream/main.

0d2061b2

refactor: replace sync.Map credits state with AICredits rate limit key · 8a260def

erio authored Mar 16, 2026

Replace process-memory sync.Map + per-model runtime state with a single
"AICredits" key in model_rate_limits, making credits exhaustion fully
isomorphic with model-level rate limiting.

Scheduler: rate-limited accounts with overages enabled + credits available
are now scheduled instead of excluded.

Forwarding: when model is rate-limited + credits available, inject credits
proactively without waiting for a 429 round trip.

Storage: credits exhaustion stored as model_rate_limits["AICredits"] with
5h duration, reusing SetModelRateLimit/isRateLimitActiveForKey.

Frontend: show credits_active (yellow ⚡) when model rate-limited but
credits available, credits_exhausted (red) when AICredits key active.

Tests: add unit tests for shouldMarkCreditsExhausted, injectEnabledCreditTypes,
clearCreditsExhausted, and update existing overages tests.

8a260def

feat: enhance Antigravity account overages handling and improve UI credit display · f3f19d35
SilentFlower authored Mar 16, 2026

f3f19d35
feat: add AI Credits balance handling and update model status indicators · ced90e1d
SilentFlower authored Mar 15, 2026

ced90e1d
feat: implement resolveCreditsOveragesModelKey function to stabilize model key... · 17e40333
SilentFlower authored Mar 15, 2026
```
feat: implement resolveCreditsOveragesModelKey function to stabilize model key resolution for credit overages
```
17e40333
fix: suppress SA4006 unused value warning in Path A branch · 044d3a01
erio authored Mar 16, 2026

044d3a01

feat: unified OAuth token refresh API with distributed locking · 1fc9dd7b

erio authored Mar 16, 2026

Introduce OAuthRefreshAPI as the single entry point for all OAuth token
refresh operations, eliminating the race condition where background
refresh and inline refresh could simultaneously use the same
refresh_token (fixes #1035).

Key changes:
- Add OAuthRefreshExecutor interface extending TokenRefresher with CacheKey
- Add OAuthRefreshAPI.RefreshIfNeeded with lock → DB re-read → double-check flow
- Add ProviderRefreshPolicy / BackgroundRefreshPolicy strategy types
- Simplify all 4 TokenProviders to delegate to OAuthRefreshAPI
- Rewrite TokenRefreshService.refreshWithRetry to use unified API path
- Add MergeCredentials and BuildClaudeAccountCredentials helpers
- Add 40 unit tests covering all new and modified code paths

1fc9dd7b

feat: add InboundEndpoint/UpstreamEndpoint fields to non-OpenAI usage records · 1b79b0f3

Ethan0x0000 authored Mar 15, 2026

Extend RecordUsageInput and RecordUsageLongContextInput structs with InboundEndpoint and UpstreamEndpoint so that Claude, Gemini, and Sora handlers can record endpoint info alongside OpenAI handlers.

Ultraworked with [Sisyphus](https://github.com/code-yeongyu/oh-my-opencode

)
Co-authored-by: Sisyphus <clio-agent@sisyphuslabs.ai>

1b79b0f3

fix: 重置密码功能新增UI配置发送邮件域名 · ae44a943
shaw authored Mar 15, 2026

ae44a943
fix: 移除 Gemini 不支持的 patternProperties 字段 #795 · 90b38381
IanShaw027 authored Mar 15, 2026

90b38381

feat(ops): add ignore insufficient balance errors toggle and extract error constants · cfe72159

erio authored Mar 15, 2026

- Add 5th error filter switch IgnoreInsufficientBalanceErrors to suppress
  upstream insufficient balance / insufficient_quota errors from ops log
- Extract hardcoded error strings into package-level constants for
  shouldSkipOpsErrorLog, normalizeOpsErrorType, classifyOpsPhase, and
  classifyOpsIsBusinessLimited
- Define ErrNoAvailableAccounts sentinel error and replace all
  errors.New("no available accounts") call sites
- Update tests to use require.ErrorIs with the sentinel error

cfe72159

fix(billing): allow clearing group quota limits and treat 0 as zero-limit · 5899784a

erio authored Mar 15, 2026

Previously, v-model.number produced "" when input was cleared, causing
JSON decode errors on the backend. Also, normalizeLimit treated 0 as
"unlimited" which prevented setting a zero quota. Now "" is converted
to null (unlimited) in frontend, and 0 is preserved as a valid limit.

Closes Wei-Shaw/sub2api#1021

5899784a

fix(billing): treat nil rate limit window as expired to prevent usage accumulation · 9e8959c5

erio authored Mar 15, 2026

When Redis cache is populated from DB with a NULL window_1d_start, the
Lua increment script only updates usage counters without setting window
timestamps. IsWindowExpired(nil) previously returned false, so the
accumulated usage was never reset across time windows, effectively
turning usage_1d into a lifetime counter. Once this exceeded
rate_limit_1d the key was incorrectly blocked with "日限额已用完".

Fixes Wei-Shaw/sub2api#1022

9e8959c5

fix: extract and log Claude output_config.effort in usage records · 1bff2292

YanzheL authored Mar 15, 2026

Claude's output_config.effort parameter (low/medium/high/max) was not
being extracted from requests or logged in the reasoning_effort column
of usage logs. Only the OpenAI path populated this field.

Changes:
- Extract output_config.effort in ParseGatewayRequest
- Add ReasoningEffort field to ForwardResult
- Populate reasoning_effort in both RecordUsage and RecordUsageWithLongContext
- Guard against overwriting service-set effort values in handler
- Update stale comments that described reasoning_effort as OpenAI-only
- Add unit tests for extraction, normalization, and persistence

1bff2292

feat: 完善使用记录端点可观测性与分布统计 · eefab159

Ethan0x0000 authored Mar 15, 2026

将入站、上游与路径三类端点分布统一到使用记录页的一致化卡片交互中，并补齐端点元数据与统计链路，提升排障与流量分析效率。

eefab159

14 Mar, 2026 10 commits

fix: remove unused saveRecords method to pass lint · 39f8bd91
shaw authored Mar 14, 2026

39f8bd91

fix(ops): tune aggregation constants to prevent PG overload · f59b66b7

erio authored Mar 14, 2026

Increase MAX(bucket_start) query timeout from 3s to 5s to reduce
timeout-induced fallbacks. Shrink backfill window from 30 days to
1 hour so that fallback recomputation stays lightweight instead of
scanning the entire retention range.

f59b66b7

fix: 按 review 意见重构数据库备份服务（安全性 + 架构 + 健壮性） · 1047f973

Rose Ding authored Mar 14, 2026

1. S3 凭证加密存储：使用 SecretEncryptor (AES-256-GCM) 加密 SecretAccessKey，
防止备份文件中泄露 S3 凭证，兼容旧的未加密数据
2. 修复 saveRecord 竞态条件：添加 recordsMu 互斥锁保护 records 的 load/save
3. 恢复操作增加服务端验证：handler 层要求重新输入管理员密码，通过 bcrypt
校验，前端弹出密码输入框
4. pg_dump/psql/S3 操作抽象为接口：定义 DBDumper 和 BackupObjectStore 接口，
实现放入 repository 层，遵循项目依赖注入架构规范
5. 改为流式处理避免大数据库 OOM：备份时 pg_dump stdout -> gzip -> io.Pipe ->
S3 upload；恢复时 S3 download -> gzip reader -> psql stdin，不再全量加载
6. loadRecords 区分"无数据"和"数据损坏"场景：JSON 解析失败返回明确错误
7. 添加 18 个核心逻辑单元测试：覆盖加密、并发、流式备份/恢复、错误处理等
Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

1047f973

feat: 实现固定时间重置模式的 SQL 表达式，并添加相关单元测试 · b5f78ec1
wucm667 authored Mar 14, 2026

b5f78ec1
style: fix gofmt formatting for account type constants · e0f290fd
SsageParuders authored Mar 14, 2026

e0f290fd

refactor: merge bedrock-apikey into bedrock with auth_mode credential · 4644af2c

SsageParuders authored Mar 14, 2026

Consolidate two separate channel types (bedrock + bedrock-apikey) into
a single "AWS Bedrock" channel. Authentication mode is now distinguished
by credentials.auth_mode ("sigv4" | "apikey") instead of separate types.

Backend:
- Remove AccountTypeBedrockAPIKey constant
- IsBedrock() simplified; IsBedrockAPIKey() checks auth_mode
- Add IsAPIKeyOrBedrock() helper to eliminate repeated type checks
- Extend pool mode, quota scheduling, and billing to bedrock
- Add RetryableOnSameAccount to handleBedrockUpstreamErrors
- Add "bedrock" scope to Beta Policy for independent control

Frontend:
- Merge two buttons into one "AWS Bedrock" with auth mode radio
- Badge displays "Anthropic | AWS"
- Pool mode and quota limit UI available for bedrock
- Quota display in account list (usage bars, capacity badges, reset)
- Remove all bedrock-apikey type references

4644af2c

fix: stop rewriting native responses input ids · ca42a458
ius authored Mar 14, 2026

ca42a458
fix: remove unused wildcard mapping helper · a377e990
Wang Lvyuan authored Mar 14, 2026

a377e990
fix: handle invalid encrypted content error and retry logic. · 2666422b
InCerry authored Mar 14, 2026

2666422b
fix: honor account model mapping before group fallback · 4e8615f2
Wang Lvyuan authored Mar 14, 2026

4e8615f2

13 Mar, 2026 8 commits

fix: restore OAuth 401 temp-unschedulable for Gemini, update Antigravity tests · 45456fa2

erio authored Mar 14, 2026

The 403 detection PR changed the 401 handler condition from
`account.Type == AccountTypeOAuth` to
`account.Type == AccountTypeOAuth && account.Platform == PlatformOpenAI`,
which accidentally excluded Gemini OAuth from the temp-unschedulable path.

Fix: use `!= PlatformAntigravity` instead, preserving Gemini behavior
while correctly excluding Antigravity (whose 401 is handled by
applyErrorPolicy's temp_unschedulable_rules).

Update tests to reflect Antigravity's new 401 semantics:
- HandleUpstreamError: Antigravity OAuth 401 now uses SetError
- CheckErrorPolicy: Antigravity 401 second hit stays TempUnscheduled
- DB fallback: split into Gemini (escalates) and Antigravity (stays temp)

45456fa2

fix lint · e90ec847
Ylarod authored Mar 13, 2026

e90ec847

feat(antigravity): add 403 forbidden status detection, classification and display · 6344fa2a

erio authored Mar 13, 2026

Backend:
- Detect and classify 403 responses into three types:
  validation (account needs Google verification),
  violation (terms of service / banned),
  forbidden (generic 403)
- Extract verification/appeal URLs from 403 response body
  (structured JSON parsing with regex fallback)
- Add needs_verify, is_banned, needs_reauth, error_code fields
  to UsageInfo (omitempty for zero impact on other platforms)
- Handle 403 in request path: classify and permanently set account error
- Save validation_url in error_message for degraded path recovery
- Enrich usage with account error on both success and degraded paths
- Add singleflight dedup for usage requests with independent context
- Differentiate cache TTL: success/403 → 3min, errors → 1min
- Return degraded UsageInfo instead of HTTP 500 on quota fetch errors

Frontend:
- Display forbidden status badges with color coding (red for banned,
  amber for needs verification, gray for generic)
- Show clickable verification/appeal URL links
- Display needs_reauth and degraded error states in usage cell
- Add Antigravity tier label badge next to platform type

Tests:
- Comprehensive unit tests for classifyForbiddenType (7 cases)
- Unit tests for extractValidationURL (8 cases including unicode escapes)
- Integration test for FetchQuota forbidden path

6344fa2a

feat(ops): allow hiding alert events · 29b0e4a8
Peter authored Mar 13, 2026

29b0e4a8
sub2api: add bedrock support · 11f7b835
Ylarod authored Mar 12, 2026

11f7b835
fix: golangci-lint 修复（gofmt 格式化 + errcheck 返回值检查） · f7177be3
Rose Ding authored Mar 13, 2026
```
Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
```
f7177be3
refactor: 将 ComputeQuotaResetAt 和 ValidateQuotaResetConfig 函数中的 map 类型从... · 2573107b
wucm667 authored Mar 13, 2026
```
refactor: 将 ComputeQuotaResetAt 和 ValidateQuotaResetConfig 函数中的 map 类型从 map[string]interface{} 修改为 map[string]any
```
2573107b

feat: 账号配额支持固定时间重置模式 · 5b850059

wucm667 authored Mar 13, 2026

- 后端新增 rolling/fixed 两种配额重置模式，支持日配额和周配额
- fixed 模式下可配置重置时刻（小时）、重置星期几（周配额）及时区（IANA）
- 在 account_repo.go 中使用 SQL 表达式适配两种模式的过期判断与重置时间推进
- 新增 ComputeQuotaResetAt / ValidateQuotaResetConfig 等辅助函数
- DTO 层新增相关字段并在 mappers 中完整映射
- 前端 QuotaLimitCard 新增 rolling/fixed 切换 UI、时区选择器
- CreateAccountModal / EditAccountModal 透传新配置字段
- i18n（zh/en）同步新增相关翻译词条

5b850059