Commits · b20e142249fd752e2a5471d4cc21d3b4b90f4739 · 陈曦 / sub2api

26 Mar, 2026 1 commit

feat: 网关请求头 wire casing 保持、转发行为开关、调试日志增强及 accept-encoding 恢复 · b20e1422

shaw authored Mar 26, 2026

- 新增 header_util.go，通过 setHeaderRaw/getHeaderRaw/addHeaderRaw 绕过
  Go 的 canonical-case 规范化，保持真实 Claude CLI 抓包的请求头大小写
  （如 "x-app" 而非 "X-App"，"X-Stainless-OS" 而非 "X-Stainless-Os"）
- 新增管理后台开关：指纹统一化（默认开启）和 metadata 透传（默认关闭），
  使用 atomic.Value + singleflight 缓存模式，60s TTL
- 调试日志从控制台 body 打印升级为文件级完整快照
  （按真实 wire 顺序输出 headers + 格式化 JSON body + 上下文元数据）
- 恢复 accept-encoding 到白名单，在 http_upstream.go 新增 decompressResponseBody
  处理 gzip/brotli/deflate 解压（Go 显式设置 Accept-Encoding 时不会自动解压）
- OAuth 服务 axios UA 从 1.8.4 更新至 1.13.6
- 测试断言改用 getHeaderRaw 适配 raw header 存储方式

b20e1422

24 Mar, 2026 2 commits
- feat: 支持自定义端点配置与展示 · 995bee14
  shaw authored Mar 24, 2026
  
  995bee14
- refactor(test): improve type assertions in ops endpoint context tests · f10e56be
  Ethan0x0000 authored Mar 24, 2026
  
  f10e56be
23 Mar, 2026 2 commits

feat(handler): add Responses/ChatCompletions handlers on GatewayHandler · 31660c4c

Ethan0x0000 authored Mar 23, 2026

New HTTP handlers for Anthropic platform groups accepting OpenAI-format
endpoints:

- GatewayHandler.Responses: /v1/responses for non-OpenAI groups
- GatewayHandler.ChatCompletions: /v1/chat/completions for non-OpenAI groups

Both handlers include:
- Claude Code only restriction (403 reject when claude_code_only enabled,
  since these endpoints are never Claude Code clients)
- Full auth → billing → user/account concurrency → failover loop
- Ops error/endpoint context propagation
- Async usage recording via worker pool

Error responses use each endpoint's native format (Responses API format
for /v1/responses, CC format for /v1/chat/completions).

31660c4c

feat(admin): add account privacy mode filter · 4838ab74
weak-fox authored Mar 23, 2026

4838ab74

21 Mar, 2026 5 commits
- test(ops): add tests for setOpsEndpointContext and safeUpstreamURL · 7cd38248
  Ethan0x0000 authored Mar 21, 2026
  
  7cd38248
- feat(ops): propagate endpoint/request-type context in handlers; add... · db9021f9
  Ethan0x0000 authored Mar 21, 2026
```
feat(ops): propagate endpoint/request-type context in handlers; add UpstreamURL to upstream error events
```
  db9021f9
- feat(ops): adapt repository INSERT/SELECT + add setOpsEndpointContext in error logger middleware · a2418c60
  Ethan0x0000 authored Mar 21, 2026
  
  a2418c60
- fix(settings): prevent SMTP config overwrite and stabilize test after refresh · 1fb29d59
  Eilen6316 authored Mar 21, 2026
  
  1fb29d59
- fix(dto): fallback to legacy model in usage mapping · 27948c77
  Ethan0x0000 authored Mar 21, 2026
```
Ultraworked with [Sisyphus](https://github.com/code-yeongyu/oh-my-openagent

)
Co-authored-by: Sisyphus <clio-agent@sisyphuslabs.ai>
```
  27948c77
20 Mar, 2026 4 commits

refactor(dto): split admin usage upstream model exposure · 095200bd

Ethan0x0000 authored Mar 21, 2026

Ultraworked with [Sisyphus](https://github.com/code-yeongyu/oh-my-openagent

)
Co-authored-by: Sisyphus <clio-agent@sisyphuslabs.ai>

095200bd

Fix OpenAI default model forwarding · 4617ef2b
Jiahao Luo authored Mar 20, 2026

4617ef2b

fix: quota display shows stale cumulative usage after daily/weekly reset · 0d45d866

wucm667 authored Mar 20, 2026

The quota reset mechanism is lazy — quota_daily_used/quota_weekly_used
in the database are only reset on the next IncrementQuotaUsed call.
The scheduling layer (IsQuotaExceeded) correctly checks period expiry
before enforcing limits, so the account remains usable. However, the
API response mapper reads the raw DB value without checking expiry,
causing the frontend to display cumulative usage (e.g. 110%) even
after the reset period has passed.

Add IsDailyQuotaPeriodExpired/IsWeeklyQuotaPeriodExpired methods and
use them in the mapper to return used=0 when the period has expired.

0d45d866

feat: add max_claude_code_version setting and disable auto-upgrade env var · 01d8286b

shaw authored Mar 20, 2026

Add maximum Claude Code version limit to complement the existing minimum
version check. Refactor the version cache from single-value to unified
bounds struct (min+max) with a single atomic.Value and singleflight group.

- Backend: new constant, struct field, cache refactor, validation (semver
  format + cross-validation max >= min), gateway enforcement, audit diff
- Frontend: settings UI input, TypeScript types, zh/en i18n
- Add CLAUDE_CODE_DISABLE_NONESSENTIAL_TRAFFIC=1 to all Claude Code
  tutorials on /keys page (unix/cmd/powershell/vscode settings.json)

01d8286b

19 Mar, 2026 4 commits

feat(admin): 用户管理新增分组列、分组筛选与专属分组一键替换 · ba7d2aec

QTom authored Mar 18, 2026

- 新增分组列：展示用户的专属/公开分组，支持 hover 查看详情
- 新增分组筛选：下拉选择或模糊搜索分组名过滤用户
- 专属分组替换：点击专属分组弹出操作菜单，选择目标分组后
  自动授予新分组权限、迁移绑定的 Key、移除旧分组权限
- 后端新增 POST /admin/users/:id/replace-group 端点，事务内
  完成分组替换并失效认证缓存

ba7d2aec

feat: Anthropic 账号被动用量采样，页面默认展示被动数据 · 525cdb88

shaw authored Mar 19, 2026

从上游 /v1/messages 响应头被动采集 5h/7d utilization 并存储到
Account.Extra，页面加载时直接读取本地数据而非调用外部 Usage API。
用户可点击"查询"按钮主动拉取最新数据，主动查询结果自动回写被动缓存。

后端:
- UpdateSessionWindow 合并采集 5h + 7d headers 为单次 DB 写入
- 新增 GetPassiveUsage 从 Extra 构建 UsageInfo (复用 estimateSetupTokenUsage)
- GetUsage 主动查询后 syncActiveToPassive 回写被动缓存
- passive_usage_ 前缀注册为 scheduler-neutral

前端:
- Anthropic 账号 mount/refresh 默认 source=passive
- 新增"被动采样"标签和"查询"按钮 (带 loading 动画)

525cdb88

feat: add ungrouped filter to account · 8027531d
Hg authored Mar 19, 2026

8027531d

fix: record original upstream status code when failover exhausted (#1128) · 1fd1a58a

haruka authored Mar 19, 2026

When all failover accounts are exhausted, handleFailoverExhausted maps
the upstream status code (e.g. 403) to a client-facing code (e.g. 502)
but did not write the original code to the gin context. This caused ops
error logs to show the mapped code instead of the real upstream code.

Call SetOpsUpstreamError before mapUpstreamError in all failover-
exhausted paths so that ops_error_logger captures the true upstream
status code and message.
Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

1fd1a58a

18 Mar, 2026 5 commits

feat: add 529 overload cooldown toggle and duration settings in admin gateway page · bf3d6c0e

shaw authored Mar 18, 2026

Move 529 overload cooldown configuration from config file to admin
settings UI. Adds an enable/disable toggle and configurable cooldown
duration (1-120 min) under /admin/settings gateway tab, stored as
JSON in the settings table.

When disabled, 529 errors are logged but accounts are no longer
paused from scheduling. Falls back to config file value when DB
is unreachable or settingService is nil.

bf3d6c0e

fix: 修复 hotpath 测试中 metadata.user_id 格式不合法导致 CI 失败 · 7414bdf0

shaw authored Mar 18, 2026

测试数据使用的 session ID "abc-123" 不符合 ParseMetadataUserID
要求的 36 字符 UUID 格式，替换为合法 UUID。

7414bdf0

feat(admin): 分组管理新增容量列（并发/会话/RPM 实时聚合） · d4cc9871

QTom authored Mar 18, 2026

复用 GroupCapacityService，在 admin 分组列表中添加容量列，
显示每个分组的实时并发/会话/RPM 使用量和上限。
Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

d4cc9871

feat(admin): 分组管理列表新增用量列与账号数分类 · 961c30e7

QTom authored Mar 17, 2026



分组管理列表增强：

1. 今日/累计用量列：
   - 新增独立端点 GET /admin/groups/usage-summary
   - 一次查询返回所有分组的今日费用和累计费用（actual_cost）
   - 前端异步加载后合并显示在分组列表中

2. 账号数区分可用/限流/总量：
   - 将账号数列从单一总量改为 badge 内多行展示
   - 可用: active + schedulable 的账号数（绿色）
   - 限流: rate_limit/overload/temp_unschedulable 的账号数（橙色，无限流时隐藏）
   - 总量: 全部关联账号数
Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

961c30e7

feat: add platform type filter to subscription management page · 50a3c7fa

Gemini Wen authored Mar 18, 2026



Add a platform filter dropdown to the admin subscriptions view, allowing
filtering subscriptions by platform (Anthropic, OpenAI, Gemini, etc.)
through the group association.
Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

50a3c7fa

17 Mar, 2026 2 commits

test(backend): add tests for upstream model tracking and model source filtering · eeff451b

Ethan0x0000 authored Mar 17, 2026

Cover IsValidModelSource/NormalizeModelSource, resolveModelDimensionExpression SQL expressions, invalid model_source 400 responses on both GetModelStats and GetUserBreakdown, upstream_model in scan/insert SQL mock expectations, and updated passthrough/billing test signatures.

eeff451b

feat(api): expose model_source filter in dashboard endpoints · 56fcb20f

Ethan0x0000 authored Mar 17, 2026

Add model_source query parameter to GetModelStats and GetUserBreakdown handlers with explicit IsValidModelSource validation. Include model_source in cache key to prevent cross-source cache hits. Expose upstream_model in usage log DTO with omitempty semantics.

56fcb20f

16 Mar, 2026 3 commits

test(dashboard): add unit tests for user-breakdown API · e0286e50

erio authored Mar 16, 2026

Handler tests (9 cases): group_id/model/endpoint filters, default
endpoint_type, custom limit, limit clamping, response format,
empty result, no-filter pass-through.

Repository test: resolveEndpointColumn mapping for inbound/upstream/path.

e0286e50

feat(dashboard): add per-user drill-down for group, model, and endpoint distributions · 4b41e898

erio authored Mar 16, 2026

Click on a group name, model name, or endpoint name in the distribution
tables to expand and show per-user usage breakdown (requests, tokens,
actual cost, standard cost).

Backend: new GET /admin/dashboard/user-breakdown API with group_id,
model, endpoint, endpoint_type filters.
Frontend: clickable rows with expand/collapse sub-table in all three
distribution charts.

4b41e898

feat(backup): 备份/恢复异步化，解决 504 超时 · c1fab7f8

QTom authored Mar 16, 2026



POST /backups 和 POST /backups/:id/restore 改为异步：立即返回 HTTP 202，
后台 goroutine 独立执行 pg_dump → gzip → S3 上传，前端每 2s 轮询状态。

后端:
- 新增 StartBackup/StartRestore 方法，后台 goroutine 不依赖 HTTP 连接
- Graceful shutdown 等待活跃操作完成，启动时清理孤立 running 记录
- BackupRecord 新增 progress/restore_status 字段支持进度和恢复状态追踪

前端:
- 创建备份/恢复后轮询 GET /backups/:id 直到完成或失败
- 标签页切换暂停/恢复轮询，组件卸载清理定时器
- 正确处理 409（备份进行中）和轮询超时
Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

c1fab7f8

15 Mar, 2026 12 commits

fix(admin): polish spending ranking and usage defaults · 8147866c
Peter authored Mar 16, 2026

8147866c

refactor: migrate all handlers to shared endpoint normalization middleware · 7bd1972f

Ethan0x0000 authored Mar 15, 2026

- Apply InboundEndpointMiddleware to all gateway route groups
- Replace normalizedOpenAIInboundEndpoint/normalizedOpenAIUpstreamEndpoint and normalizedGatewayInboundEndpoint/normalizedGatewayUpstreamEndpoint with GetInboundEndpoint/GetUpstreamEndpoint
- Remove 4 old constants and 4 old normalization functions (-70 lines)
- Migrate existing endpoint normalization test to new API

Ultraworked with [Sisyphus](https://github.com/code-yeongyu/oh-my-opencode

)
Co-authored-by: Sisyphus <clio-agent@sisyphuslabs.ai>

7bd1972f

refactor: add unified endpoint normalization infrastructure · 2c9dcfe2

Ethan0x0000 authored Mar 15, 2026

Introduce endpoint.go with shared constants, NormalizeInboundEndpoint, DeriveUpstreamEndpoint, InboundEndpointMiddleware, and context helpers. This replaces the two separate normalization implementations (OpenAI and Gateway) with a single source of truth. Includes comprehensive test coverage.

Ultraworked with [Sisyphus](https://github.com/code-yeongyu/oh-my-opencode

)
Co-authored-by: Sisyphus <clio-agent@sisyphuslabs.ai>

2c9dcfe2

fix: use half-open date ranges for DST-safe usage queries · c637e6cf

Ethan0x0000 authored Mar 15, 2026

Replace t.Add(24*time.Hour - time.Nanosecond) with t.AddDate(0, 0, 1) and use SQL < instead of <= for end-of-day boundaries. This avoids edge-case misses around DST transitions.

Ultraworked with [Sisyphus](https://github.com/code-yeongyu/oh-my-opencode

)
Co-authored-by: Sisyphus <clio-agent@sisyphuslabs.ai>

c637e6cf

fix(ops): align constant declarations for gofmt compliance · bdbc8fa0
erio authored Mar 15, 2026

bdbc8fa0

fix(ops): match "insufficient account balance" in error filter · 63f3af0f

erio authored Mar 15, 2026

The upstream Gemini API returns "Insufficient account balance" which
doesn't contain the substring "insufficient balance". Add explicit
match for the full phrase to ensure the filter works correctly.

63f3af0f

style: 修复 gofmt 格式问题 · 686f890f
IanShaw027 authored Mar 15, 2026

686f890f
fix: 重置密码功能新增UI配置发送邮件域名 · ae44a943
shaw authored Mar 15, 2026

ae44a943

fix: 兼容部分限额字段为空的情况 #1021 · c31974c9

IanShaw027 authored Mar 15, 2026

修复在填写限额时，如果不填写完整的三个限额额度（日限额、周限额、月限额）就会报错的问题。

变更内容：
- 后端：添加 optionalLimitField 类型处理空值和空字符串，兼容部分限额字段为空的情况
- 前端：添加 normalizeOptionalLimit 函数规范化限额输入，将空值、空字符串和无效数字统一处理为 null

c31974c9

feat(ops): add ignore insufficient balance errors toggle and extract error constants · cfe72159

erio authored Mar 15, 2026

- Add 5th error filter switch IgnoreInsufficientBalanceErrors to suppress
  upstream insufficient balance / insufficient_quota errors from ops log
- Extract hardcoded error strings into package-level constants for
  shouldSkipOpsErrorLog, normalizeOpsErrorType, classifyOpsPhase, and
  classifyOpsIsBusinessLimited
- Define ErrNoAvailableAccounts sentinel error and replace all
  errors.New("no available accounts") call sites
- Update tests to use require.ErrorIs with the sentinel error

cfe72159

增加测试 · 359e5675
Elysia authored Mar 15, 2026

359e5675

fix: extract and log Claude output_config.effort in usage records · 1bff2292

YanzheL authored Mar 15, 2026

Claude's output_config.effort parameter (low/medium/high/max) was not
being extracted from requests or logged in the reasoning_effort column
of usage logs. Only the OpenAI path populated this field.

Changes:
- Extract output_config.effort in ParseGatewayRequest
- Add ReasoningEffort field to ForwardResult
- Populate reasoning_effort in both RecordUsage and RecordUsageWithLongContext
- Guard against overwriting service-set effort values in handler
- Update stale comments that described reasoning_effort as OpenAI-only
- Add unit tests for extraction, normalization, and persistence

1bff2292