Commits · 3718d6dcd4559ef3ecc5e63d5ddb217e732032f7 · 陈曦 / sub2api

15 Mar, 2026 1 commit

fix: extract and log Claude output_config.effort in usage records · 1bff2292

YanzheL authored Mar 15, 2026

Claude's output_config.effort parameter (low/medium/high/max) was not
being extracted from requests or logged in the reasoning_effort column
of usage logs. Only the OpenAI path populated this field.

Changes:
- Extract output_config.effort in ParseGatewayRequest
- Add ReasoningEffort field to ForwardResult
- Populate reasoning_effort in both RecordUsage and RecordUsageWithLongContext
- Guard against overwriting service-set effort values in handler
- Update stale comments that described reasoning_effort as OpenAI-only
- Add unit tests for extraction, normalization, and persistence

1bff2292

14 Mar, 2026 2 commits

refactor: merge bedrock-apikey into bedrock with auth_mode credential · 4644af2c

SsageParuders authored Mar 14, 2026

Consolidate two separate channel types (bedrock + bedrock-apikey) into
a single "AWS Bedrock" channel. Authentication mode is now distinguished
by credentials.auth_mode ("sigv4" | "apikey") instead of separate types.

Backend:
- Remove AccountTypeBedrockAPIKey constant
- IsBedrock() simplified; IsBedrockAPIKey() checks auth_mode
- Add IsAPIKeyOrBedrock() helper to eliminate repeated type checks
- Extend pool mode, quota scheduling, and billing to bedrock
- Add RetryableOnSameAccount to handleBedrockUpstreamErrors
- Add "bedrock" scope to Beta Policy for independent control

Frontend:
- Merge two buttons into one "AWS Bedrock" with auth mode radio
- Badge displays "Anthropic | AWS"
- Pool mode and quota limit UI available for bedrock
- Quota display in account list (usage bars, capacity badges, reset)
- Remove all bedrock-apikey type references

4644af2c

fix: handle invalid encrypted content error and retry logic. · 2666422b
InCerry authored Mar 14, 2026

2666422b

13 Mar, 2026 2 commits
- fix lint · e90ec847
  Ylarod authored Mar 13, 2026
  
  e90ec847
- sub2api: add bedrock support · 11f7b835
  Ylarod authored Mar 12, 2026
  
  11f7b835
12 Mar, 2026 3 commits
- fix: harden usage billing idempotency and backpressure · 6a685727
  ius authored Mar 12, 2026
  
  6a685727
- fix: remove unused gateway usage helpers · 8d4d3b03
  ius authored Mar 12, 2026
  
  8d4d3b03
- feat: decouple billing correctness from usage log batching · 611fd884
  ius authored Mar 12, 2026
  
  611fd884
11 Mar, 2026 1 commit

fix: 为 Anthropic Messages API 流式转发添加下游 keepalive ping · 6e90ec61

amberwarden authored Mar 11, 2026

Anthropic Messages API 的流式转发路径（gateway_service.go）在上游长时间
无数据时（如 Opus extended thinking 阶段）不会向下游发送任何内容，导致
Cloudflare Tunnel 等代理因连接空闲而断开。

复用已有的 StreamKeepaliveInterval 配置（默认 10 秒），在 select 循环中
添加 keepalive 分支，定时发送 Anthropic 原生格式的 ping 事件保活，与
OpenAI 兼容路径的实现模式保持一致。
Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

6e90ec61

10 Mar, 2026 1 commit
- feat: Anthropic平台可配置 anthropic-beta 策略 · 00a0a121
  shaw authored Mar 10, 2026
  
  00a0a121
09 Mar, 2026 3 commits
- fix: 修复gpt->claude转换无法命中codex缓存问题 · a461538d
  shaw authored Mar 09, 2026
  
  a461538d
- fix: gpt->claude格式转换对齐effort映射和fast · ebe6f418
  shaw authored Mar 09, 2026
  
  ebe6f418
- fix: increase SSE scanner max line size from 40MB to 500MB · 91ef085d
  erio authored Mar 09, 2026
```
4K image base64 data can exceed 40MB limit, causing "bufio.Scanner:
token too long" errors. Scanner is adaptive (starts at 64KB, grows
as needed), so increasing the cap has no impact on normal responses.
```
  91ef085d
08 Mar, 2026 1 commit
- feat: 支持 API Key 上游池模式同账号重试次数配置与自定义错误策略 · e643fc38
  kyx236 authored Mar 08, 2026
  
  e643fc38
07 Mar, 2026 2 commits

feat: 支持后台设置是否启用整流开关 · a3791104
shaw authored Mar 07, 2026

a3791104

feat(account): add daily/weekly periodic quota limits for API Key accounts · 1ee17383

erio authored Mar 07, 2026



Extend the existing total quota limit with daily and weekly periodic
dimensions. Each dimension is independently configurable and uses lazy
reset — when the period expires, usage is automatically reset to zero on
the next increment. Any dimension exceeding its limit will pause the
account from scheduling.

Backend:
- Add GetQuotaDailyLimit/Used, GetQuotaWeeklyLimit/Used, HasAnyQuotaLimit
- Rewrite IncrementQuotaUsed with atomic CTE SQL for 3-dimension update
- Rewrite ResetQuotaUsed to clear all dimensions and period timestamps
- Update postUsageBilling to use HasAnyQuotaLimit()
- Preserve daily/weekly used values on account edit

Frontend:
- Refactor QuotaLimitCard from single v-model to 3-dimension props
- Add QuotaBadge component for compact D/W/$ display
- Update AccountCapacityCell with per-dimension badges
- Update Create/Edit modals with daily/weekly quota fields
- Update AccountActionMenu hasQuotaLimit to check all dimensions
- Add i18n strings for daily/weekly/total quota labels
Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

1ee17383

06 Mar, 2026 4 commits

fix(openai): 统一专属倍率计费链路并补齐回归测试 · a18bbb5f

yangjianbo authored Mar 06, 2026

抽取共享的用户分组专属倍率解析器，统一缓存、singleflight 与回退逻辑。\n\n让 OpenAI 独立计费链路复用专属倍率解析，修复 usage 记录与实际扣费未命中用户专属倍率的问题。\n\n补齐 OpenAI 计费与解析器单元测试，并修复全量回归中暴露的 lint 阻塞项。\n\nCo-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

a18bbb5f

fix(openai): remove misplaced passthrough check from isModelSupportedByAccount · c28f691f

erio authored Mar 06, 2026

isModelSupportedByAccount 不被 OpenAI 调度路径调用，
OpenAI /responses 和 /chat/completions 走的是
openai_account_scheduler.go，透传短路已在 PR #806 的
第二个 commit 中正确添加到该文件。

此处的检查是多余的死代码，因为 OpenAI 账号不会走到
isModelSupportedByAccount 的这个分支。

c28f691f

feat(openai): add /v1/messages endpoint and API compatibility layer · ff1f1149

alfadb authored Mar 06, 2026

Add Anthropic Messages API support for OpenAI platform groups, enabling
clients using Claude-style /v1/messages format to access OpenAI accounts
through automatic protocol conversion.

- Add apicompat package with type definitions and bidirectional converters
  (Anthropic ↔ Chat, Chat ↔ Responses, Anthropic ↔

 Responses)
- Implement /v1/messages endpoint for OpenAI gateway with streaming support
- Add model mapping UI for OpenAI OAuth accounts (whitelist + mapping modes)
- Support prompt caching fields and codex OAuth transforms
- Fix tool call ID conversion for Responses API (fc_ prefix)
- Ensure function_call_output has non-empty output field
Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

ff1f1149

fix: OpenAI passthrough accounts bypass model mapping check · 79ae15d5

erio authored Mar 06, 2026

透传模式账号仅替换认证，应允许所有模型通过。之前调度阶段的
isModelSupportedByAccount 不感知透传模式，导致 model_mapping
中未配置的新模型（如 gpt-5.4）被拒绝返回 503。

79ae15d5

05 Mar, 2026 5 commits

feat: add independent load_factor field for scheduling load calculation · 0d6c1c77
erio authored Mar 06, 2026

0d6c1c77

refactor: unify post-usage billing logic and fix account quota calculation · 02dea7b0

erio authored Mar 06, 2026

- Extract postUsageBilling() to consolidate billing logic across
  GatewayService.RecordUsage, RecordUsageWithLongContext, and
  OpenAIGatewayService.RecordUsage, eliminating ~120 lines of
  duplicated code
- Fix account quota to use TotalCost × accountRateMultiplier
  (was using raw TotalCost, inconsistent with account cost stats)
- Fix RecordUsageWithLongContext API Key quota only updating in
  balance mode (now updates regardless of billing type)
- Fix WebSocket client disconnect detection on Windows by adding
  "an established connection was aborted" to known disconnect errors

02dea7b0

feat: add quota limit for API key accounts · 05527b13

erio authored Mar 05, 2026

- Add configurable spending limit (quota_limit) for apikey-type accounts
- Atomic quota accumulation via PostgreSQL JSONB operations on TotalCost
- Scheduler filters out over-quota accounts with outbox-triggered snapshot refresh
- Display quota usage ($used / $limit) in account capacity column
- Add "Reset Quota" action in account menu to reset usage to zero
- Editing account settings preserves quota_used (no accidental reset)
- Covers all 3 billing paths: Anthropic, Gemini, OpenAI RecordUsage

chore: bump version to 0.1.90.4

05527b13

fix: 修复claude apikey账号请求时未携带beta=true 查询参数的bug · 9d70c385
shaw authored Mar 05, 2026

9d70c385
feat: 模型映射应用 /v1/messages/count_tokens端点 · aeb464f3
shaw authored Mar 05, 2026

aeb464f3

03 Mar, 2026 2 commits

feat: apikey支持5h/1d/7d速率控制 · a80ec5d8
shaw authored Mar 03, 2026

a80ec5d8

fix(gateway): 分组隔离 — 禁止未分组账号被跨组调度 · 530a1629

QTom authored Mar 03, 2026

当 API Key 无分组时，调度仅从未分组账号池中选取。
修复 isAccountInGroup 在 groupID==nil 时的逻辑，
同时补全 scheduler_snapshot_service 和 gemini_compat_service
中的 SimpleMode 保护，确保分组隔离在所有调度路径生效。

新增 ListSchedulableUngroupedByPlatform/s 方法，
使用 Ent 的 Not(HasAccountGroups()) 谓词实现未分组账号隔离。
新增 17 个单元和端到端隔离测试，覆盖所有分支和边界条件。

530a1629

02 Mar, 2026 1 commit

feat(gateway): 双模式用户消息队列 — 串行队列 + 软性限速 · a9285b8a

QTom authored Mar 03, 2026

新增 UMQ (User Message Queue) 双模式支持:
- serialize: 账号级分布式串行锁 + RPM 自适应延迟（严格限流）
- throttle: 仅 RPM 自适应前置延迟，不阻塞并发（软性限速）

后端:
- config: 新增 Mode 字段，保留 Enabled 向后兼容
- service: 新增 UserMessageQueueService（Lua 锁/延迟算法/清理 worker）
- repository: 新增 UserMsgQueueCache（Redis Lua acquire/release/force-release）
- handler: 新增 UserMsgQueueHelper（SSE ping + 等待循环 + throttle）
- gateway: 按 mode 分支集成 serialize/throttle 逻辑
- lint: 修复 gofmt rewrite rules、errcheck 类型断言、staticcheck QF1012

前端:
- 三态选择器 UI（关闭/软性限速/串行队列）替代 toggle 开关
- BulkEdit 支持 null 语义（不修改）
- i18n 中英文文案

通过 6 轮专家评审（42 次 review）、golangci-lint、单元测试、集成测试。

a9285b8a

28 Feb, 2026 8 commits

fix: round-3 review fixes for RPM limiting · 2491e9b5

QTom authored Feb 28, 2026

- Add sanitizeExtraBaseRPM to BulkUpdate handler (was missing)
- Add WindowCost scheduling checks to legacy non-sticky selection
  paths (4 sites), matching existing sticky + load-aware coverage
- Export ParseExtraInt from service package, remove duplicate
  parseExtraIntForValidation from admin handler

2491e9b5

fix: address deep code review issues for RPM limiting · e63c8395

QTom authored Feb 28, 2026

- Move IncrementRPM after Forward success to prevent phantom RPM
  consumption during account switch retries
- Add base_rpm input sanitization (clamp to 0-10000) in Create/Update
- Add WindowCost scheduling checks to legacy path sticky sessions
  (4 check sites + 4 prefetch sites), fixing pre-existing gap
- Clean up rpm_strategy/rpm_sticky_buffer when disabling RPM in
  BulkEditModal (JSONB merge cannot delete keys, use empty values)
- Add json.Number test cases to TestGetBaseRPM/TestGetRPMStickyBuffer
- Document TOCTOU race as accepted soft-limit design trade-off

e63c8395

fix: move RPM prefetch before routing segment in legacy/mixed paths · ff9683b0

QTom authored Feb 28, 2026

Ensures isAccountSchedulableForRPM calls within the routing segment
hit the prefetch cache instead of querying Redis individually.

ff9683b0

fix: address code review issues for RPM limiting feature · 60723757

QTom authored Feb 28, 2026

- Use TxPipeline (MULTI/EXEC) instead of Pipeline for atomic INCR+EXPIRE
- Filter negative values in GetBaseRPM(), update test expectation
- Add RPM batch query (GetRPMBatch) to account List API
- Add warn logs for RPM increment failures in gateway handler
- Reset enableRpmLimit on BulkEditAccountModal close
- Use union type 'tiered' | 'sticky_exempt' for rpmStrategy refs
- Add design decision comments for rdb.Time() RTT trade-off

60723757

feat: increment RPM counter before request forwarding · f648b8e0
QTom authored Feb 28, 2026

f648b8e0
feat: integrate RPM scheduling checks into account selection flow · 678c3ae1
QTom authored Feb 28, 2026

678c3ae1
feat: wire RPMCache into GatewayService and AccountHandler · c1c31ed9
QTom authored Feb 28, 2026

c1c31ed9
feat(sync): full code sync from release · bb664d9b
yangjianbo authored Feb 28, 2026

bb664d9b

27 Feb, 2026 1 commit

feat: replace gemini-3-pro-image with gemini-3.1-flash-image · a6f9f9f9

erio authored Feb 27, 2026

- Add migration 060 to update model_mapping for all antigravity accounts
- Remove gemini-3-pro-image and gemini-3-pro-image-preview mappings
- Add gemini-3.1-flash-image and gemini-3.1-flash-image-preview mappings
- Update frontend usage window to show GImage for new model
- Update isImageGenerationModel to support new model

a6f9f9f9

26 Feb, 2026 3 commits

fix: address review - fix log wording and add response body assertion in test · e6969acb
alfadb authored Feb 26, 2026

e6969acb

fix(gateway): return 404 instead of fake 200 for unsupported count_tokens endpoint · 94895314

alfadb authored Feb 26, 2026

PR #635 returned HTTP 200 with {"input_tokens": 0} when upstream doesn't
support count_tokens (404). This caused Claude Code CLI to trust the zero
value, believing context uses 0 tokens, so auto-compression never triggers.

Fix: return 404 with proper error body so CLI falls back to its local
tokenizer for accurate estimation. Return nil (not error) to avoid
polluting ops error metrics with expected 404s.

Affected paths:
- Passthrough APIKey accounts: upstream 404 now passed through as 404
- Antigravity accounts: same fix (was also returning fake 200)

94895314

fix: 临时移除fast-mode-2026-02-01避免429问题 · 4ac57b4e
shaw authored Feb 26, 2026

4ac57b4e