Commits · a738688287c6faa0834cad770c4a34a2c0f38696 · 陈曦 / sub2api

27 Apr, 2026 8 commits
- 补充openai、gemini以及流失请求的采集数据以及nfs落库 · 3b7a5fff
  陈曦 authored Apr 27, 2026
  
  3b7a5fff
- fix(openai): avoid implicit image sticky sessions · 95a4f473
  gaoren002 authored Apr 26, 2026 and 陈曦 committed Apr 27, 2026
  
  95a4f473
- fix(openai): tighten responses stream account tests · 95f06794
  hungryboy1025 authored Apr 25, 2026 and 陈曦 committed Apr 27, 2026
  
  95f06794
- fix(openai): keep responses stream alive during pre-output failover · 872aa3f2
  gaoren002 authored Apr 25, 2026 and 陈曦 committed Apr 27, 2026
  
  872aa3f2
- fix(openai): fail over before responses stream output · 79bb941d
  AyeSt0 authored Apr 25, 2026 and 陈曦 committed Apr 27, 2026
  
  79bb941d
- fix(openai): bump codex CLI version from 0.104.0 to 0.125.0 · fa5dc5b6
  4fuu authored Apr 25, 2026 and 陈曦 committed Apr 27, 2026
```
The hardcoded codex CLI version (0.104.0) causes upstream rejection
when using gpt-5.5 with compact, as the server treats the request
as an outdated client and returns 400/502.

Update codexCLIVersion, codexCLIUserAgent, and openAICodexProbeVersion
to 0.125.0 to match the current Codex CLI release.

Fixes #1933, #1887, #1865
Related: #1609, #1298, #849
```
  fa5dc5b6
- feat(openai): port /responses/compact account support flow (PR #1555) · 7a786be1
  shaw authored Apr 25, 2026 and 陈曦 committed Apr 27, 2026
```
将 vansour/sub2api#1555 的 OpenAI compact 能力建模手工移植到当前 main：账号
级 compact 状态/auto-force_on-force_off 模式、compact-only 模型映射、调度器
tier 分层（已支持 > 未知 > 已知不支持）、管理后台 compact 主动探测，以及对应
i18n/状态徽章。普通 /responses 流量行为不变，无数据库迁移。
```
  7a786be1
- fix(openai): handle codex spark model limitations · 40b643d6
  gaoren002 authored Apr 24, 2026 and 陈曦 committed Apr 27, 2026
  
  40b643d6
24 Apr, 2026 1 commit

fix(openai): preserve image outputs when text content serialization fails · ca204ddd

shaw authored Apr 24, 2026

In reconstructResponseOutputFromSSE, text content Marshal/Unmarshal
failure previously caused an early return that silently discarded
already-extracted image_generation_call outputs. Now serialization
errors are tolerated so image results still reach the client.

ca204ddd

23 Apr, 2026 7 commits

fix: bridge codex image generation over responses · 5f418997
gaoren002 authored Apr 23, 2026

5f418997

revert: remove fork-only changes from release sync · 67518a59

erio authored Apr 23, 2026

Revert payment/wechat, sora/claude-max cleanup, fork-only migrations,
and cosmetic changes that were brought in by the release sync commit.
Keep only channel-monitor related improvements:
- PublicSettingsInjectionPayload named struct with drift test
- ChannelMonitorRunner graceful shutdown in wire
- image_output_price in SupportedModelChip
- Simplified buildSelfNavItems in AppSidebar
- Gateway WARN logs for 503 branches

67518a59

sync: bring over remaining release/custom-0.1.115 changes · 748a84d8

erio authored Apr 23, 2026

- Extract PublicSettingsInjectionPayload named struct with drift test
- Add channel_monitor_default_interval_seconds to SSR injection
- Add image_output_price to SupportedModelChip
- Simplify AppSidebar buildSelfNavItems (admins see available channels)
- Add gateway WARN logs for 503 no-available-accounts branches
- Wire ChannelMonitorRunner into provideCleanup for graceful shutdown
- Add migrations 130/131 (CC template userid fix + mimicry field cleanup)
- Clean up fork-only features (sora, claude max simulation, client affinity)
- Remove ~320 obsolete i18n keys
- Add codexUsage utility, WechatServiceButton, BulkEditAccountModal
- Tidy go.sum

748a84d8

fix: 修复 golangci-lint 报告的 36 个问题 · ef967d8f
shaw authored Apr 23, 2026

ef967d8f
修复计费问题以及模型回显 · 9e5a6351
wx-11 authored Apr 23, 2026

9e5a6351
修改403逻辑: 先临时冷却，再根据连续次数决定是否判坏号 · 11cf23da
wx-11 authored Apr 23, 2026

11cf23da
fix openai image request handling · 00778dca
meteor041 authored Apr 23, 2026

00778dca

22 Apr, 2026 1 commit

feat(openai): 同步生图 API 支持并接入图片计费调度 · c5480219

lucas morgan authored Apr 22, 2026

- 同步 OpenAI 图片生成与编辑接口
- 接入图片请求解析、账号调度、转发与用量记录
- 接入图片计费与图片用量落库
- 限制 OAuth 生图仅支持无显式模型和尺寸的基础请求

c5480219

17 Apr, 2026 1 commit

refactor: extract ReadUpstreamResponseBody to deduplicate upstream response... · c0b2cacb

erio authored Apr 16, 2026 and

陈曦 committed Apr 17, 2026

refactor: extract ReadUpstreamResponseBody to deduplicate upstream response read + too-large error handling

Consolidates 9 call sites of resolveUpstreamResponseReadLimit + readUpstreamResponseBodyLimited + ErrUpstreamResponseBodyTooLarge error handling into a single ReadUpstreamResponseBody function with TooLargeWriter callback for API-format-specific error responses (Anthropic, OpenAI, countTokens).

c0b2cacb

15 Apr, 2026 2 commits

refactor: extract ReadUpstreamResponseBody to deduplicate upstream response... · 10699eeb

erio authored Apr 16, 2026

refactor: extract ReadUpstreamResponseBody to deduplicate upstream response read + too-large error handling

Consolidates 9 call sites of resolveUpstreamResponseReadLimit + readUpstreamResponseBodyLimited + ErrUpstreamResponseBodyTooLarge error handling into a single ReadUpstreamResponseBody function with TooLargeWriter callback for API-format-specific error responses (Anthropic, OpenAI, countTokens).

10699eeb

修复 OpenAI 账号限流回流误判：7d 窗口可用时不因 5h 窗口为 0 回写 429 · 7451b6f9
Wesley Liddick authored Apr 15, 2026

7451b6f9

14 Apr, 2026 6 commits

fix: merge general improvements from release branch · 63f539b3

erio authored Apr 14, 2026

Backend:
- gateway_handler: pass subject.UserID instead of int64(0) for user-level routing
- setting_handler: add missing BalanceLowNotifyRechargeURL to UpdateSettings response
- openai_gateway_service: use applyAccountStatsCost for account stats pricing integration
- embed_on: add local file override (data/public/) for embedded frontend assets

Frontend:
- useTableSelection: add batchUpdate method for batch operations
- AccountsView: virtual scrolling params, Set-based isSelected, swipe virtualization
- ProxiesView: add batchUpdate to selection and swipe-select
- BulkEditAccountModal: fix submit handler to prevent event object as argument
- SettingsView: move payload construction outside try block
- i18n: add general translation keys (saved, deleted, view, validation, allowUserRefund)
- api/client: reorder error fields for consistency
- stores/payment: clarify pollOrderStatus JSDoc

63f539b3

fix: correct account stats pricing priority order · 98c9d517

erio authored Apr 13, 2026

Priority was wrong:
- Before: custom rules → LiteLLM (when ApplyPricingToAccountStats) → nil
- After:  custom rules → totalCost (when ApplyPricingToAccountStats) → LiteLLM → nil

When ApplyPricingToAccountStats is enabled, use the request's actual
client billing cost (before multiplier) as account_stats_cost, instead
of recalculating from LiteLLM per-token prices which produced incorrect
values for per-request billing mode.

LiteLLM model pricing is now the final fallback (priority 3), used only
when neither custom rules nor ApplyPricingToAccountStats apply.

98c9d517

feat: WebSearch tri-state, account stats pricing fix, quota cache fix, usage tooltip · 1262654d

erio authored Apr 13, 2026

WebSearch tri-state switch:
- Account-level web_search_emulation changed from bool to tri-state
  string: "default" (follow channel) / "enabled" / "disabled"
- shouldEmulateWebSearch checks channel config when account is "default"
- SQL migration converts old bool values
- Frontend select replaces toggle in Edit/CreateAccountModal

Account stats pricing:
- resolveAccountStatsCost uses upstream model (post-mapping) for matching
- Priority: custom rules → model pricing file (when toggle on) → default
- Custom rules always configurable, independent of toggle
- Account ID field changed to searchable selector filtered by platform
- Description updated to reflect new behavior

Quota notification cache fix:
- CheckAccountQuotaAfterIncrement fetches real-time account from DB
- Reconstructs pre-increment usage for accurate threshold crossing detection
- New AccountQuotaReader interface (minimal: GetByID only)

Usage tooltip:
- Per-request/image billing shows per-request price instead of $0 token price
- Token billing continues to show input/output price per million tokens

1262654d

fix(channel): use upstream model for account stats pricing and remove channel pricing fallback · 11c46068

erio authored Apr 13, 2026

- resolveAccountStatsCost now uses the final upstream model (after
  account-level mapping) to match custom pricing rules, fixing the
  issue where requested model (e.g. claude-sonnet-4-5) didn't match
  rules configured for upstream model (e.g. claude-opus-4-6)
- Remove tryChannelPricing fallback — only custom rules are applied,
  unmatched requests use default formula (total_cost × rate)
- Remove unused billingService and serviceTier parameters
- Update description: "启用后将支持自定义账号统计的模型价格"

11c46068

feat(notify): add balance low & account quota notification system · b32d1a2c

erio authored Apr 12, 2026

- User balance low notification: email alert when balance drops below
  configurable threshold (user email + verified extra emails)
- Account quota notification: broadcast email to admin-configured
  recipients when daily/weekly/total quota usage exceeds alert threshold
- Admin settings: global enable/disable, default threshold, quota
  notification email list (Email Settings tab)
- User profile: enable/disable, custom threshold, add/remove extra
  notification emails with verification code flow
- Account quota: per-dimension alert toggle and threshold in quota
  control card
- Trigger logic: first-crossing only (old >= threshold && new < threshold
  for balance; old < threshold && new >= threshold for quota), naturally
  prevents duplicate notifications without Redis dedup

b32d1a2c

feat(channels): add custom account stats pricing rules · 7535e312

erio authored Apr 11, 2026

Allow channels to configure independent model pricing for account
statistics cost calculation, decoupled from user billing.

Backend:
- Migration 101: channels.apply_pricing_to_account_stats toggle,
  channel_account_stats_pricing_rules/model_pricing tables,
  usage_logs.account_stats_cost column
- resolveAccountStatsCost: match rules by group/account, then channel
  pricing, fallback to original formula when unconfigured
- Integrate into both GatewayService.recordUsageCore and
  OpenAIGatewayService.RecordUsage
- Update 8 account stats SQL queries to use
  COALESCE(account_stats_cost, total_cost) * account_rate_multiplier
- 23 unit tests for matching, pricing lookup, and cost calculation

Frontend:
- Channel edit dialog: toggle + custom rules UI with group/account
  multi-select and pricing entry cards
- API types and i18n (zh/en)

7535e312

08 Apr, 2026 7 commits

fix: 优化调度快照缓存以避免 Redis 大 MGET · 265687b5
ius authored Apr 08, 2026

265687b5
fix(openai): sanitize empty base64 input images · fb233463
YanzheL authored Apr 01, 2026 and 陈曦 committed Apr 08, 2026

fb233463

fix(gateway): add content-based session hash fallback for non-Codex clients · 16c7bd31

YanzheL authored Apr 02, 2026 and

陈曦 committed Apr 08, 2026

When no explicit session signals (session_id, conversation_id, prompt_cache_key)
are provided, derive a stable session seed from the request body content
(model + tools + system prompt + first user message) to enable sticky routing
and prompt caching for non-Codex clients using the Chat Completions API.

This mirrors the content-based fallback already present in GatewayService.
GenerateSessionHash, adapted for the OpenAI gateway's request formats (both
Chat Completions messages and Responses API input).

JSON fragments are canonicalized via normalizeCompatSeedJSON to ensure
semantically identical requests produce the same seed regardless of
whitespace or key ordering.

Closes #1421

16c7bd31

fix: 非流式响应路径扩展SSE检测至所有账号类型 (#1493) · 70836c70

Elysia authored Apr 07, 2026 and

陈曦 committed Apr 08, 2026



当上游返回SSE格式响应(如sub2api链路)时，API Key账号的非流式路径
未检测SSE，导致终态事件中空output直接透传给客户端。

- 将Content-Type SSE检测从仅OAuth扩展至所有账号类型
- 重命名handleOAuthSSEToJSON为handleSSEToJSON（无OAuth专属逻辑）
- 为透传路径新增handlePassthroughSSEToJSON，支持SSE转JSON及空output重建
Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

70836c70

fix(openai): do not normalize API token based accounts · e7439c32
Alex authored Apr 07, 2026 and 陈曦 committed Apr 08, 2026

e7439c32

fix: 非流式路径在上游终态事件output为空时从delta事件重建响应内容 · b85ab201

shaw authored Apr 07, 2026 and

陈曦 committed Apr 08, 2026

上游API近期更新后，response.completed终态SSE事件的output字段可能为空，
实际内容仅通过response.output_text.delta等增量事件下发。流式路径不受影响，
但chat_completions非流式路径和responses OAuth非流式路径只依赖终态事件的
output，导致返回空响应。

新增BufferedResponseAccumulator累积器，在SSE扫描过程中收集delta事件内容
（文本、function_call、reasoning），当终态output为空时补充重建。

同时修复handleChatBufferedStreamingResponse遗漏response.done事件类型的问题。

b85ab201

fix(openai): fail over passthrough 429 and 529 · cec5a3bf
qingyuzhang authored Mar 30, 2026 and 陈曦 committed Apr 08, 2026

cec5a3bf

07 Apr, 2026 3 commits

fix: 非流式响应路径扩展SSE检测至所有账号类型 (#1493) · 9e515ea7

Elysia authored Apr 07, 2026



当上游返回SSE格式响应(如sub2api链路)时，API Key账号的非流式路径
未检测SSE，导致终态事件中空output直接透传给客户端。

- 将Content-Type SSE检测从仅OAuth扩展至所有账号类型
- 重命名handleOAuthSSEToJSON为handleSSEToJSON（无OAuth专属逻辑）
- 为透传路径新增handlePassthroughSSEToJSON，支持SSE转JSON及空output重建
Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

9e515ea7

fix: 非流式路径在上游终态事件output为空时从delta事件重建响应内容 · b2e379cf

shaw authored Apr 07, 2026

上游API近期更新后，response.completed终态SSE事件的output字段可能为空，
实际内容仅通过response.output_text.delta等增量事件下发。流式路径不受影响，
但chat_completions非流式路径和responses OAuth非流式路径只依赖终态事件的
output，导致返回空响应。

新增BufferedResponseAccumulator累积器，在SSE扫描过程中收集delta事件内容
（文本、function_call、reasoning），当终态output为空时补充重建。

同时修复handleChatBufferedStreamingResponse遗漏response.done事件类型的问题。

b2e379cf

fix(openai): do not normalize API token based accounts · 7eecc49c
Alex authored Apr 07, 2026

7eecc49c

05 Apr, 2026 1 commit

fix(billing): prevent channel_mapped override from reverting BillingModel when channel did not map · f585a15e

shaw authored Apr 05, 2026

When a channel has no model mapping for the requested model, ChannelMappedModel
equals OriginalModel (the user's arbitrary input). Combined with the default
BillingModelSource="channel_mapped", this incorrectly overrides the BillingModel
set by the OpenAI format conversion layer (e.g., gpt-5.4 from DefaultMappedModel)
back to the unmapped original model (e.g., glm) which has no pricing — resulting
in zero-cost billing.

Add guard condition so the channel_mapped override only fires when the channel
actually changed the model (ChannelMappedModel != OriginalModel).

f585a15e

04 Apr, 2026 3 commits

refactor: remove resolveOpenAIUpstreamModel, use normalizeCodexModel directly · e27b0adb

erio authored Apr 04, 2026

Eliminates unnecessary indirection layer. The wrapper function only
called normalizeCodexModel with a special case for "gpt 5.3 codex spark"
(space-separated variant) that is no longer needed.

All call sites now use normalizeCodexModel directly.

e27b0adb

feat(channel): improve cache strategy and add restriction logging · 58f758c8

erio authored Apr 03, 2026

- Change channel cache TTL from 60s to 10min (reduce unnecessary DB queries)
- Actively rebuild cache after CRUD instead of lazy invalidation
- Add slog.Warn logging for channel pricing restriction blocks (4 places)

58f758c8

fix: resolve 5 audit findings in channel/credits/scheduling · 71f61bbc

erio authored Apr 02, 2026

P0-1: Credits degraded response retry + fail-open
- Add isAntigravityDegradedResponse() to detect transient API failures
- Retry up to 3 times with exponential backoff (500ms/1s/2s)
- Invalidate singleflight cache between retries
- Fail-open after exhausting retries instead of 5h circuit break

P1-1: Fix channel restriction pre-check timing conflict
- Swap checkClaudeCodeRestriction before checkChannelPricingRestriction
- Ensures channel restriction is checked against final fallback groupID

P1-2: Add interval pricing validation (frontend + backend)
- Backend: ValidateIntervals() with boundary, price, overlap checks
- Frontend: validateIntervals() with Chinese error messages
- Rules: MinTokens>=0, MaxTokens>MinTokens, prices>=0, no overlap

P2: Fix cross-platform same-model pricing/mapping override
- Store cache keys using original platform instead of group platform
- Lookup across matching platforms (antigravity→anthropic→gemini)
- Prevents anthropic/gemini same-name models from overwriting each other

71f61bbc