Commits · 77ba9e728d51ed5f8395d8bed3fddf3816660602 · 陈曦 / sub2api

01 Apr, 2026 1 commit

fix(gateway): add content-based session hash fallback for non-Codex clients · c5aac125

YanzheL authored Apr 02, 2026

When no explicit session signals (session_id, conversation_id, prompt_cache_key)
are provided, derive a stable session seed from the request body content
(model + tools + system prompt + first user message) to enable sticky routing
and prompt caching for non-Codex clients using the Chat Completions API.

This mirrors the content-based fallback already present in GatewayService.
GenerateSessionHash, adapted for the OpenAI gateway's request formats (both
Chat Completions messages and Responses API input).

JSON fragments are canonicalized via normalizeCompatSeedJSON to ensure
semantically identical requests produce the same seed regardless of
whitespace or key ordering.

Closes #1421

c5aac125

28 Mar, 2026 1 commit

fix(billing): 计费始终使用用户请求的原始模型，而非映射后的上游模型 · f5764d8d

wucm667 authored Mar 28, 2026

当账号配置了模型映射（如 claude-sonnet-4-6 → glm-5.0）时，系统错误地
使用映射后的上游模型名计算费用。由于上游模型（如 glm-5.0）在定价系统中
没有价格配置，导致计费失败后被静默置为 0，用户不被扣费。

修改 forwardResultBillingModel 优先返回请求模型名，并移除 OpenAI 路径
中 BillingModel 字段对计费模型的覆盖逻辑。

f5764d8d

24 Mar, 2026 1 commit
- refactor: improve model resolution and normalization logic for OpenAI integration · 995ef134
  InCerry authored Mar 24, 2026
  
  995ef134
23 Mar, 2026 1 commit
- fix(openai): persist passthrough 429 rate limits · ce8520c9
  qingyuzhang authored Mar 24, 2026
  
  ce8520c9
22 Mar, 2026 1 commit
- fix(openai): recheck runtime state from db before final account selection · fef9259a
  Wang Lvyuan authored Mar 23, 2026
  
  fef9259a
20 Mar, 2026 1 commit

fix(usage): preserve requested model in gateway billing paths · 4edcfe1f

Ethan0x0000 authored Mar 21, 2026

Ultraworked with [Sisyphus](https://github.com/code-yeongyu/oh-my-openagent

)
Co-authored-by: Sisyphus <clio-agent@sisyphuslabs.ai>

4edcfe1f

17 Mar, 2026 1 commit

feat(service): record upstream model across all gateway paths · 2e4ac88a

Ethan0x0000 authored Mar 17, 2026

Propagate UpstreamModel through ForwardResult and OpenAIForwardResult in Anthropic direct, API-key passthrough, Bedrock, and OpenAI gateway flows. Extract optionalNonEqualStringPtr and optionalTrimmedStringPtr into usage_log_helpers.go. Store upstream_model only when it differs from the requested model.

Also introduces anthropicPassthroughForwardInput struct to reduce parameter count.

2e4ac88a

16 Mar, 2026 1 commit

fix(gateway): 防止 OpenAI Codex 跨用户串流 · ab4e8b2c

QTom authored Mar 16, 2026

根因：多个用户共享同一 OAuth 账号时，conversation_id/session_id 头
未做用户隔离，导致上游 chatgpt.com 将不同用户的请求关联到同一会话。

HTTP SSE 修复:
- 新增 isolateOpenAISessionID(apiKeyID, raw)，将 API Key ID 混入
  session 标识符（xxhash），确保不同 Key 的用户产生不同上游会话
- buildUpstreamRequest: OAuth 分支先 Del 客户端透传的 session 头，
  再用隔离值覆盖
- buildUpstreamRequestOpenAIPassthrough: 透传路径同样隔离
- ForwardAsAnthropic: Anthropic Messages 兼容路径同步修复
- buildOpenAIWSHeaders: WS 路径的 OAuth session 头同步隔离

ab4e8b2c

15 Mar, 2026 2 commits

feat(ops): add ignore insufficient balance errors toggle and extract error constants · cfe72159

erio authored Mar 15, 2026

- Add 5th error filter switch IgnoreInsufficientBalanceErrors to suppress
  upstream insufficient balance / insufficient_quota errors from ops log
- Extract hardcoded error strings into package-level constants for
  shouldSkipOpsErrorLog, normalizeOpsErrorType, classifyOpsPhase, and
  classifyOpsIsBusinessLimited
- Define ErrNoAvailableAccounts sentinel error and replace all
  errors.New("no available accounts") call sites
- Update tests to use require.ErrorIs with the sentinel error

cfe72159

feat: 完善使用记录端点可观测性与分布统计 · eefab159

Ethan0x0000 authored Mar 15, 2026

将入站、上游与路径三类端点分布统一到使用记录页的一致化卡片交互中，并补齐端点元数据与统计链路，提升排障与流量分析效率。

eefab159

14 Mar, 2026 1 commit
- fix: handle invalid encrypted content error and retry logic. · 2666422b
  InCerry authored Mar 14, 2026
  
  2666422b
12 Mar, 2026 1 commit
- feat: decouple billing correctness from usage log batching · 611fd884
  ius authored Mar 12, 2026
  
  611fd884
11 Mar, 2026 6 commits

fix: 修复流水线golangci-lint 的 errcheck · eb0b77bf
CoolCoolTomato authored Mar 11, 2026

eb0b77bf

refactor: 重构 Chat Completions 端点，采用类型安全的 Responses API 转换 · 9d814679

shaw authored Mar 11, 2026

将 /v1/chat/completions 端点从 ResponseWriter 劫持模式重构为独立的
类型安全转换路径，与 Anthropic Messages 端点架构对齐：

- 在 apicompat 包新增 Chat Completions 完整类型定义和双向转换器
- 新增 ForwardAsChatCompletions service 方法，走 Responses API 上游
- Handler 改为独立的账号选择/failover 循环，不再劫持 Responses handler
- 提取 handleCompatErrorResponse 为 Chat Completions 和 Messages 共用
- 删除旧的 forwardChatCompletions 直传路径及相关死代码

9d814679

fix: 修复gpt-5.2以上模型映射到gpt-5.2以下时verbosity参数引发的报错 · fd8ccaf0
CoolCoolTomato authored Mar 11, 2026

fd8ccaf0
Fix Codex exhausted snapshot propagation · 2fc6aaf9
ius authored Mar 11, 2026

2fc6aaf9
Reduce DB write amplification on quota and account extra updates · 26941494
ius authored Mar 11, 2026

26941494

feat: 添加 OpenAI Chat Completions 兼容端点 · 656a77d5

7976723 authored Mar 11, 2026



基于 @yulate 在 PR #648 (commit 0bb6a392) 的工作，解决了与最新
main 分支的合并冲突。

原始功能（@yulate）:
- 添加 /v1/chat/completions 和 /chat/completions 兼容端点
- 将 Chat Completions 请求转换为 Responses API 格式并转换回来
- 添加 API Key 直连转发支持
- 包含单元测试
Co-authored-by: yulate <yulate@users.noreply.github.com>

656a77d5

09 Mar, 2026 3 commits

fix: OpenAI临时性400错误支持池模式同账号重试 & HelpTooltip层级修复 · 5fa22fdf

kyx236 authored Mar 10, 2026

1. 识别OpenAI "An error occurred while processing your request" 临时性400错误
并触发failover，同时在池模式下标记RetryableOnSameAccount，允许同账号重试
2. ForwardAsAnthropic路径同步支持临时性400错误的识别和同账号重试
3. HelpTooltip组件使用Teleport渲染到body，修复在dialog内被裁切的问题

5fa22fdf

fix: 修复gpt->claude转换无法命中codex缓存问题 · a461538d
shaw authored Mar 09, 2026

a461538d

fix(billing): 修复 OpenAI fast 档位计费并补齐展示 · 87f4ed59

yangjianbo authored Mar 08, 2026



- 打通 service_tier 在 OpenAI HTTP、WS、passthrough 与 usage 记录中的传递
- 修正 priority/flex 计费逻辑，并将 fast 归一化为 priority
- 在用户端和管理端补齐服务档位与计费明细展示
- 补齐前后端测试，并修复 WS 限流信号重复持久化导致的全量回归失败
Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

87f4ed59

08 Mar, 2026 1 commit
- feat: 支持 API Key 上游池模式同账号重试次数配置与自定义错误策略 · e643fc38
  kyx236 authored Mar 08, 2026
  
  e643fc38
07 Mar, 2026 5 commits
- fix: 补齐旧账号的 OpenAI 限流补偿 · 1307d604
  神乐 authored Mar 08, 2026
  
  1307d604
- fix: 修复 OpenAI WS 限流状态与调度同步 · 45d57018
  神乐 authored Mar 07, 2026
  
  45d57018
- fix: 限流账号自动退出调度并优化提示文案 · 101ef0cf
  神乐 authored Mar 07, 2026
  
  101ef0cf
- fix(openai): detect official codex client by headers · da89583c
  admin authored Mar 07, 2026
  
  da89583c
- fix: /response端点移除强制注入大量instructions内容 · ebd5253e
  shaw authored Mar 07, 2026
  
  ebd5253e
06 Mar, 2026 5 commits

feat: /v1/messages端点适配codex账号池 · 92159994
shaw authored Mar 06, 2026

92159994
fix(openai): restore ws usage window display · 838ada88
神乐 authored Mar 06, 2026

838ada88
fix(openai): support remote compact task · 34039093
神乐 authored Mar 06, 2026

34039093

fix(openai): 统一专属倍率计费链路并补齐回归测试 · a18bbb5f

yangjianbo authored Mar 06, 2026

抽取共享的用户分组专属倍率解析器，统一缓存、singleflight 与回退逻辑。\n\n让 OpenAI 独立计费链路复用专属倍率解析，修复 usage 记录与实际扣费未命中用户专属倍率的问题。\n\n补齐 OpenAI 计费与解析器单元测试，并修复全量回归中暴露的 lint 阻塞项。\n\nCo-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

a18bbb5f

feat(openai): add /v1/messages endpoint and API compatibility layer · ff1f1149

alfadb authored Mar 06, 2026

Add Anthropic Messages API support for OpenAI platform groups, enabling
clients using Claude-style /v1/messages format to access OpenAI accounts
through automatic protocol conversion.

- Add apicompat package with type definitions and bidirectional converters
  (Anthropic ↔ Chat, Chat ↔ Responses, Anthropic ↔

 Responses)
- Implement /v1/messages endpoint for OpenAI gateway with streaming support
- Add model mapping UI for OpenAI OAuth accounts (whitelist + mapping modes)
- Support prompt caching fields and codex OAuth transforms
- Fix tool call ID conversion for Responses API (fc_ prefix)
- Ensure function_call_output has non-empty output field
Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

ff1f1149

05 Mar, 2026 4 commits

feat: add independent load_factor field for scheduling load calculation · 0d6c1c77
erio authored Mar 06, 2026

0d6c1c77

refactor: unify post-usage billing logic and fix account quota calculation · 02dea7b0

erio authored Mar 06, 2026

- Extract postUsageBilling() to consolidate billing logic across
  GatewayService.RecordUsage, RecordUsageWithLongContext, and
  OpenAIGatewayService.RecordUsage, eliminating ~120 lines of
  duplicated code
- Fix account quota to use TotalCost × accountRateMultiplier
  (was using raw TotalCost, inconsistent with account cost stats)
- Fix RecordUsageWithLongContext API Key quota only updating in
  balance mode (now updates regardless of billing type)
- Fix WebSocket client disconnect detection on Windows by adding
  "an established connection was aborted" to known disconnect errors

02dea7b0

feat: add quota limit for API key accounts · 05527b13

erio authored Mar 05, 2026

- Add configurable spending limit (quota_limit) for apikey-type accounts
- Atomic quota accumulation via PostgreSQL JSONB operations on TotalCost
- Scheduler filters out over-quota accounts with outbox-triggered snapshot refresh
- Display quota usage ($used / $limit) in account capacity column
- Add "Reset Quota" action in account menu to reset usage to zero
- Editing account settings preserves quota_used (no accidental reset)
- Covers all 3 billing paths: Anthropic, Gemini, OpenAI RecordUsage

chore: bump version to 0.1.90.4

05527b13

feat(openai-ws): 合并 WS v2 透传模式与前端 ws mode · 1d0872e7

yangjianbo authored Mar 05, 2026

新增 OpenAI WebSocket v2 passthrough relay 数据面与服务适配层，
支持按账号 ws mode 在 ctx_pool 与 passthrough 间路由。

同步调整前端 OpenAI ws mode 选项为 off/ctx_pool/passthrough，
并补充 i18n 文案与对应单测。

新增 Caddyfile.dmit 与 docker-compose-aicodex.yml 部署配置，
用于宿主机场景下的反向代理与服务编排。
Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

1d0872e7

03 Mar, 2026 2 commits

feat: apikey支持5h/1d/7d速率控制 · a80ec5d8
shaw authored Mar 03, 2026

a80ec5d8

fix(gateway): 分组隔离 — 禁止未分组账号被跨组调度 · 530a1629

QTom authored Mar 03, 2026

当 API Key 无分组时，调度仅从未分组账号池中选取。
修复 isAccountInGroup 在 groupID==nil 时的逻辑，
同时补全 scheduler_snapshot_service 和 gemini_compat_service
中的 SimpleMode 保护，确保分组隔离在所有调度路径生效。

新增 ListSchedulableUngroupedByPlatform/s 方法，
使用 Ent 的 Not(HasAccountGroups()) 谓词实现未分组账号隔离。
新增 17 个单元和端到端隔离测试，覆盖所有分支和边界条件。

530a1629

28 Feb, 2026 1 commit
- feat(sync): full code sync from release · bb664d9b
  yangjianbo authored Feb 28, 2026
  
  bb664d9b
22 Feb, 2026 1 commit

fix(codex): 修复额度窗口过期展示并补齐高覆盖测试 · 10636d8a

yangjianbo authored Feb 22, 2026

- 后端新增绝对重置时间字段计算（codex_5h_reset_at/codex_7d_reset_at）

- 前端统一窗口解析逻辑：绝对时间优先，updated_at+seconds 回退，过期自动归零

- 新增后端与前端单元测试，覆盖关键边界与异常场景

10636d8a