Commits · 4b1ffc23f55ba275fc319e6818258592a31565f0 · 陈曦 / sub2api

24 Mar, 2026 6 commits

feat(openai): Mobile RT 补全 plan_type、精确匹配账号、刷新时自动设置隐私 · 91b1d812

QTom authored Mar 24, 2026

1. accounts/check 补全 plan_type：当 id_token 缺少 plan_type（如 Mobile RT），
自动调用 accounts/check 端点获取订阅类型
2. orgID 精确匹配账号：从 JWT 提取 poid 匹配正确账号，避免 Go map
遍历顺序随机导致 plan_type 不稳定
3. RT 刷新时设置隐私：调用 disableOpenAITraining 关闭训练数据共享，
结果存入 extra.privacy_mode，后续跳过重复设置
Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

91b1d812

feat: 支持自定义端点配置与展示 · 995bee14
shaw authored Mar 24, 2026

995bee14
refactor(test): improve type assertions in ops endpoint context tests · f10e56be
Ethan0x0000 authored Mar 24, 2026

f10e56be

fix(service): preserve anthropic usage fields across compat endpoints · 2f8e10db

Ethan0x0000 authored Mar 24, 2026

Ultraworked with [Sisyphus](https://github.com/code-yeongyu/oh-my-openagent

)
Co-authored-by: Sisyphus <clio-agent@sisyphuslabs.ai>

2f8e10db

fix(service): normalize user agent for gemini session reuse · 5418e15e

Ethan0x0000 authored Mar 24, 2026

Ultraworked with [Sisyphus](https://github.com/code-yeongyu/oh-my-openagent

)
Co-authored-by: Sisyphus <clio-agent@sisyphuslabs.ai>

5418e15e

fix(service): normalize user agent for sticky session hashes · bcf84cc1

Ethan0x0000 authored Mar 24, 2026

Ultraworked with [Sisyphus](https://github.com/code-yeongyu/oh-my-openagent

)
Co-authored-by: Sisyphus <clio-agent@sisyphuslabs.ai>

bcf84cc1

23 Mar, 2026 6 commits

fix(openai): persist passthrough 429 rate limits · ce8520c9
qingyuzhang authored Mar 24, 2026

ce8520c9

feat(routes): add platform-based routing split for /v1/responses and /v1/chat/completions · d927c0e4

Ethan0x0000 authored Mar 23, 2026

Mirror the existing /v1/messages platform split pattern:
- OpenAI groups → OpenAIGateway handlers (existing, unchanged)
- Non-OpenAI groups → Gateway handlers (new Anthropic-upstream path)

Updated both /v1 prefixed routes and non-prefixed alias routes
(/responses, /chat/completions). WebSocket route (/v1/responses GET)
remains OpenAI-only as Anthropic has no WebSocket equivalent.

d927c0e4

feat(handler): add Responses/ChatCompletions handlers on GatewayHandler · 31660c4c

Ethan0x0000 authored Mar 23, 2026

New HTTP handlers for Anthropic platform groups accepting OpenAI-format
endpoints:

- GatewayHandler.Responses: /v1/responses for non-OpenAI groups
- GatewayHandler.ChatCompletions: /v1/chat/completions for non-OpenAI groups

Both handlers include:
- Claude Code only restriction (403 reject when claude_code_only enabled,
  since these endpoints are never Claude Code clients)
- Full auth → billing → user/account concurrency → failover loop
- Ops error/endpoint context propagation
- Async usage recording via worker pool

Error responses use each endpoint's native format (Responses API format
for /v1/responses, CC format for /v1/chat/completions).

31660c4c

feat(service): add ForwardAsResponses/ForwardAsChatCompletions on GatewayService · 4321adab

Ethan0x0000 authored Mar 23, 2026

New forwarding methods on GatewayService for Anthropic platform groups:

- ForwardAsResponses: accept Responses body → convert to Anthropic →
  forward to upstream → convert response back to Responses format.
  Supports both streaming (SSE event-by-event conversion) and buffered
  (accumulate then convert) response modes.
- ForwardAsChatCompletions: chain CC→Responses→Anthropic for request,
  Anthropic→Responses→CC for response. Streaming uses dual state machine
  chain with [DONE] marker.

Both methods reuse existing GatewayService infrastructure:
buildUpstreamRequest, Claude Code mimicry, cache control enforcement,
model mapping, and return UpstreamFailoverError for handler-level retry.

4321adab

feat(apicompat): add Responses

↔

Anthropic bidirectional format conversion · 68f151f5

Ethan0x0000 authored Mar 23, 2026

Add reverse-direction converters for Anthropic platform groups to accept
OpenAI-format requests:

- ResponsesToAnthropicRequest: Responses API input → Anthropic Messages
  request with system extraction, tool/toolChoice mapping, reasoning
  effort conversion, image data URI↔base64, and consecutive role merging
- AnthropicToResponsesResponse: Anthropic response → Responses response
  with content block→output item mapping, usage, stop_reason→status
- AnthropicEventToResponsesEvents: stateful SSE stream converter
  (Anthropic streaming protocol → Responses streaming protocol)
- FinalizeAnthropicResponsesStream: synthetic termination for
  incomplete streams

68f151f5

feat(admin): add account privacy mode filter · 4838ab74
weak-fox authored Mar 23, 2026

4838ab74

22 Mar, 2026 3 commits

fix(openai): recheck runtime state from db before final account selection · fef9259a
Wang Lvyuan authored Mar 23, 2026

fef9259a
fix(account): preserve runtime state during credentials-only updates · ad7c1072
Wang Lvyuan authored Mar 23, 2026

ad7c1072

fix(gateway): strip empty text blocks from nested tool_result content · 70a9d0d3

alfadb authored Mar 22, 2026

Empty text blocks inside tool_result.content were not being filtered,
causing upstream 400 errors: 'text content blocks must be non-empty'.

Changes:
- Add stripEmptyTextBlocksFromSlice helper for recursive content filtering
- FilterThinkingBlocksForRetry now recurses into tool_result nested content
- Add StripEmptyTextBlocks pre-filter on initial request path to avoid
  unnecessary 400+retry round-trips
- Add unit tests for nested empty text block scenarios

70a9d0d3

21 Mar, 2026 8 commits
- test(ops): add tests for setOpsEndpointContext and safeUpstreamURL · 7cd38248
  Ethan0x0000 authored Mar 21, 2026
  
  7cd38248
- feat(ops): propagate endpoint/request-type context in handlers; add... · db9021f9
  Ethan0x0000 authored Mar 21, 2026
```
feat(ops): propagate endpoint/request-type context in handlers; add UpstreamURL to upstream error events
```
  db9021f9
- feat(ops): adapt repository INSERT/SELECT + add setOpsEndpointContext in error logger middleware · a2418c60
  Ethan0x0000 authored Mar 21, 2026
  
  a2418c60
- fix(settings): prevent SMTP config overwrite and stabilize test after refresh · 1fb29d59
  Eilen6316 authored Mar 21, 2026
  
  1fb29d59
- feat(ops): add endpoint/model/request_type fields to error log structs + safeUpstreamURL · 8c4a217f
  Ethan0x0000 authored Mar 21, 2026
  
  8c4a217f
- fix(apicompat): support array content for system and tool messages · 4feacf22
  mutuyihao authored Mar 21, 2026
  
  4feacf22
- fix(dto): fallback to legacy model in usage mapping · 27948c77
  Ethan0x0000 authored Mar 21, 2026
```
Ultraworked with [Sisyphus](https://github.com/code-yeongyu/oh-my-openagent

)
Co-authored-by: Sisyphus <clio-agent@sisyphuslabs.ai>
```
  27948c77
- fix: correct log levels for thinking block signature retry flow · c64ed46d
  Dave King authored Mar 21, 2026
```
LegacyPrintf uses inferStdLogLevel() to infer log level from message
text. Any message containing the word "error" is classified as ERROR
level, causing the entire signature-retry recovery flow (which succeeds)
to produce spurious ERROR log entries.

Changes:
- Remove noisy [SignatureCheck] debug logs inside isThinkingBlockSignatureError
  that were logging every detected signature check as ERROR
- Change retry-start log to WARN level via [warn] prefix
- Change retry-success log to INFO level by removing "error" from message
Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
```
  c64ed46d
20 Mar, 2026 14 commits

refactor(dto): split admin usage upstream model exposure · 095200bd

Ethan0x0000 authored Mar 21, 2026

Ultraworked with [Sisyphus](https://github.com/code-yeongyu/oh-my-openagent

)
Co-authored-by: Sisyphus <clio-agent@sisyphuslabs.ai>

095200bd

fix(provider): retain upstream model for gemini compat and ws · 2c667a15

Ethan0x0000 authored Mar 21, 2026

Ultraworked with [Sisyphus](https://github.com/code-yeongyu/oh-my-openagent

)
Co-authored-by: Sisyphus <clio-agent@sisyphuslabs.ai>

2c667a15

fix(provider): preserve requested model in antigravity and sora · bac40804

Ethan0x0000 authored Mar 21, 2026

Ultraworked with [Sisyphus](https://github.com/code-yeongyu/oh-my-openagent

)
Co-authored-by: Sisyphus <clio-agent@sisyphuslabs.ai>

bac40804

fix(usage): preserve requested model in gateway billing paths · 4edcfe1f

Ethan0x0000 authored Mar 21, 2026

Ultraworked with [Sisyphus](https://github.com/code-yeongyu/oh-my-openagent

)
Co-authored-by: Sisyphus <clio-agent@sisyphuslabs.ai>

4edcfe1f

test(repo): cover requested model repository semantics · 9259dcb6

Ethan0x0000 authored Mar 21, 2026

Ultraworked with [Sisyphus](https://github.com/code-yeongyu/oh-my-openagent

)
Co-authored-by: Sisyphus <clio-agent@sisyphuslabs.ai>

9259dcb6

feat(repo): persist requested model in usage log queries · 7ef933c7

Ethan0x0000 authored Mar 21, 2026

Ultraworked with [Sisyphus](https://github.com/code-yeongyu/oh-my-openagent

)
Co-authored-by: Sisyphus <clio-agent@sisyphuslabs.ai>

7ef933c7

feat(usage): add requested model usage metadata helpers · 7d312822

Ethan0x0000 authored Mar 21, 2026

Ultraworked with [Sisyphus](https://github.com/code-yeongyu/oh-my-openagent

)
Co-authored-by: Sisyphus <clio-agent@sisyphuslabs.ai>

7d312822

fix(ops_alert): wg.Add 竞态修复 + leader lock release context 泄漏 · 5c39e6f2

QTom authored Mar 12, 2026

1. Start() 中 wg.Add(1) 从 run() goroutine 内部移到 go s.run() 之前，
防止 Stop().wg.Wait() 在 Add 之前返回导致孤儿 goroutine。
2. tryAcquireLeaderLock 返回的 release 闭包改用独立的
context.Background()+5s 超时，避免捕获的 evaluateOnce ctx
在 defer 执行时已过期导致锁释放失败（最长阻塞 90s TTL）。

5c39e6f2

Fix OpenAI default model forwarding · 4617ef2b
Jiahao Luo authored Mar 20, 2026

4617ef2b

fix(apicompat): 修正 Anthropic→OpenAI 推理级别映射 · 8afa8c10

alfadb authored Mar 20, 2026

旧映射错误地将所有级别上移一档（medium→high, high→xhigh），
导致 effort=max 被原样透传到 OpenAI 上游并返回 400 错误。

根据两边官方 API 定义对齐：
- Anthropic: low, medium, high（默认）, max
- OpenAI:    low, medium, high（默认）, xhigh

新的 1:1 映射：low→low, medium→medium, high→high, max→xhigh

8afa8c10

fix: format gpt-5.4 mini fallback pricing · 578608d3
Remx authored Mar 20, 2026

578608d3

fix: quota display shows stale cumulative usage after daily/weekly reset · 0d45d866

wucm667 authored Mar 20, 2026

The quota reset mechanism is lazy — quota_daily_used/quota_weekly_used
in the database are only reset on the next IncrementQuotaUsed call.
The scheduling layer (IsQuotaExceeded) correctly checks period expiry
before enforcing limits, so the account remains usable. However, the
API response mapper reads the raw DB value without checking expiry,
causing the frontend to display cumulative usage (e.g. 110%) even
after the reset period has passed.

Add IsDailyQuotaPeriodExpired/IsWeeklyQuotaPeriodExpired methods and
use them in the mapper to return used=0 when the period has expired.

0d45d866

fix: add max_claude_code_version to API contract test expected output · 4f7629a4
shaw authored Mar 20, 2026

4f7629a4

feat: add max_claude_code_version setting and disable auto-upgrade env var · 01d8286b

shaw authored Mar 20, 2026

Add maximum Claude Code version limit to complement the existing minimum
version check. Refactor the version cache from single-value to unified
bounds struct (min+max) with a single atomic.Value and singleflight group.

- Backend: new constant, struct field, cache refactor, validation (semver
  format + cross-validation max >= min), gateway enforcement, audit diff
- Frontend: settings UI input, TypeScript types, zh/en i18n
- Add CLAUDE_CODE_DISABLE_NONESSENTIAL_TRAFFIC=1 to all Claude Code
  tutorials on /keys page (unix/cmd/powershell/vscode settings.json)

01d8286b

19 Mar, 2026 3 commits

fix(antigravity): correctly mark credits exhausted on "Resource has been exhausted" 429 · 21b6f2d5

erio authored Mar 20, 2026

shouldMarkCreditsExhausted was blocked by isURLLevelRateLimit check when
credit overages retry returned "Resource has been exhausted (e.g. check quota).",
causing credits to never be marked as exhausted. This led to an infinite loop
where each request injected credits, bypassed model rate limits, and failed again.

- Remove isURLLevelRateLimit guard from shouldMarkCreditsExhausted (only called
  for credit retry responses — if credits retry fails, mark exhausted)
- Add "resource has been exhausted" to creditsExhaustedKeywords
- Update tests to match corrected behavior

21b6f2d5

fix(antigravity): fast-fail on proxy unavailable, temp-unschedule account · 528ff5d2

erio authored Mar 19, 2026

## Problem

When a proxy is unreachable, token refresh retries up to 4 times with
30s timeout each, causing requests to hang for ~2 minutes before
failing with a generic 502 error. The failed account is not marked,
so subsequent requests keep hitting it.

## Changes

### Proxy connection fast-fail
- Set TCP dial timeout to 5s and TLS handshake timeout to 5s on
  antigravity client, so proxy connectivity issues fail within 5s
  instead of 30s
- Reduce overall HTTP client timeout from 30s to 10s
- Export `IsConnectionError` for service-layer use
- Detect proxy connection errors in `RefreshToken` and return
  immediately with "proxy unavailable" error (no retries)

### Token refresh temp-unschedulable
- Add 8s context timeout for token refresh on request path
- Mark account as temp-unschedulable for 10min when refresh fails
  (both background `TokenRefreshService` and request-path
  `GetAccessToken`)
- Sync temp-unschedulable state to Redis cache for immediate
  scheduler effect
- Inject `TempUnschedCache` into `AntigravityTokenProvider`

### Account failover
- Return `UpstreamFailoverError` on `GetAccessToken` failure in
  `Forward`/`ForwardGemini` to trigger handler-level account switch
  instead of returning 502 directly

### Proxy probe alignment
- Apply same 5s dial/TLS timeout to shared `httpclient` pool
- Reduce proxy probe timeout from 30s to 10s

528ff5d2

feat(admin): 用户管理新增分组列、分组筛选与专属分组一键替换 · ba7d2aec

QTom authored Mar 18, 2026

- 新增分组列：展示用户的专属/公开分组，支持 hover 查看详情
- 新增分组筛选：下拉选择或模糊搜索分组名过滤用户
- 专属分组替换：点击专属分组弹出操作菜单，选择目标分组后
  自动授予新分组权限、迁移绑定的 Key、移除旧分组权限
- 后端新增 POST /admin/users/:id/replace-group 端点，事务内
  完成分组替换并失效认证缓存

ba7d2aec