Commits · 90b38381737405e50b8f127aa26bd04ccee613ed · 陈曦 / sub2api

15 Mar, 2026 1 commit
- fix: 移除 Gemini 不支持的 patternProperties 字段 #795 · 90b38381
  IanShaw027 authored Mar 15, 2026
  
  90b38381
03 Mar, 2026 1 commit

fix(gateway): 分组隔离 — 禁止未分组账号被跨组调度 · 530a1629

QTom authored Mar 03, 2026

当 API Key 无分组时，调度仅从未分组账号池中选取。
修复 isAccountInGroup 在 groupID==nil 时的逻辑，
同时补全 scheduler_snapshot_service 和 gemini_compat_service
中的 SimpleMode 保护，确保分组隔离在所有调度路径生效。

新增 ListSchedulableUngroupedByPlatform/s 方法，
使用 Ent 的 Not(HasAccountGroups()) 谓词实现未分组账号隔离。
新增 17 个单元和端到端隔离测试，覆盖所有分支和边界条件。

530a1629

28 Feb, 2026 1 commit
- feat(sync): full code sync from release · bb664d9b
  yangjianbo authored Feb 28, 2026
  
  bb664d9b
14 Feb, 2026 1 commit
- feat(backend): 提交后端审计修复与配套测试改动 · d04b47b3
  yangjianbo authored Feb 14, 2026
  
  d04b47b3
12 Feb, 2026 1 commit

chore(logging): 完成后端日志审计与结构化迁移 · 584cfc3d

yangjianbo authored Feb 12, 2026

- 将高密度服务与处理器日志迁移到新日志系统（LegacyPrintf/结构化日志）
- 增加 stdlog bridge 与兼容测试，保留旧日志捕获能力
- 将 OpenAI 断流告警改为结构化 Warn 并改造对应测试为 sink 捕获
- 补齐后端相关文件 logger 引用并通过全量 go test

584cfc3d

11 Feb, 2026 1 commit

fix: include Gemini thoughtsTokenCount in output token billing · d21d70a5

sususu98 authored Feb 11, 2026

Gemini 2.5 Pro/Flash thinking models return thoughtsTokenCount separately
from candidatesTokenCount in usageMetadata, but this field was not parsed
or included in billing calculations, causing thinking tokens to be
unbilled.

- Add ThoughtsTokenCount field to GeminiUsageMetadata struct
- Include thoughtsTokenCount in OutputTokens across all 3 Gemini usage
  parsing paths (non-streaming, streaming, compat layer)
- Add tests covering thinking token scenarios

Closes #554

d21d70a5

10 Feb, 2026 1 commit

perf(backend): 使用 gjson/sjson 优化热路径 JSON 处理 · 58912d4a

yangjianbo authored Feb 10, 2026



将 API 网关热路径中的 json.Unmarshal+json.Marshal 替换为 gjson 零拷贝查询和 sjson 精准写入：
- unwrapV1InternalResponse 性能提升 22x（4009ns→182ns），内存分配减少 28.5x
- unwrapGeminiResponse、extractGeminiUsage、estimateGeminiCountTokens、ParseGeminiRateLimitResetTime 改为接收 []byte 使用 gjson 提取
- ParseGatewayRequest 的 model/stream/metadata/thinking/max_tokens 改用 gjson 类型安全提取
- Handler 层（sora/openai）改用 gjson 提取字段、sjson 注入/修改字段，移除 map[string]any 中间变量
- Sora Client 响应解析改用 gjson ForEach 遍历，减少内存分配
- 新增约 100 个单元测试用例，所有改动函数覆盖率 >85%
Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

58912d4a

09 Feb, 2026 5 commits

feat: same-account retry before failover for transient errors · d6c2921f

Edric Li authored Feb 10, 2026

For retryable transient errors (Google 400 "invalid project resource name"
and empty stream responses), retry on the same account up to 2 times
(with 500ms delay) before switching to another account.

- Add RetryableOnSameAccount field to UpstreamFailoverError
- Add same-account retry loop in both Gemini and Claude/OpenAI handler paths
- Move temp-unschedule from service layer to handler layer (only after
  all same-account retries exhausted)
- Reduce temp-unschedule cooldown from 30 minutes to 1 minute

d6c2921f

feat: failover and temp-unschedule on Google "Invalid project resource name" 400 · 89905ec4

Edric Li authored Feb 09, 2026

Google 后端间歇性返回 400 "Invalid project resource name" 错误，
此前该错误直接透传给客户端且不触发账号切换，导致请求失败。

- 在 Antigravity 和 Gemini 两个平台的所有转发路径中，
  精确匹配该错误消息后触发 failover 自动换号重试
- 命中后将账号临时封禁 1 小时，避免反复调度到同一故障账号
- 提取共享函数 isGoogleProjectConfigError / tempUnscheduleGoogleConfigError
  消除跨 Service 的代码重复

89905ec4

fix: Gemini error policy check should precede retry logic · a70d37a6
erio authored Feb 09, 2026

a70d37a6

fix: skip rate limiting when custom error codes don't match upstream status · 6892e84a

erio authored Feb 09, 2026

Add ShouldHandleErrorCode guard at the entry of handleGeminiUpstreamError
and AntigravityGatewayService.handleUpstreamError so that accounts with
custom error codes (e.g. [599]) are not rate-limited when the upstream
returns a non-matching status (e.g. 429).

6892e84a

feat: ErrorPolicySkipped returns 500 instead of upstream status code · 73f45574

erio authored Feb 09, 2026

When custom error codes are enabled and the upstream error code is NOT
in the configured list, return HTTP 500 to the client instead of
transparently forwarding the original status code.

Also adds integration test TestCustomErrorCode599 verifying that 429,
500, 503, 401, 403 all return 500 without triggering SetRateLimited
or SetError.

73f45574

08 Feb, 2026 2 commits

feat: integrate CheckErrorPolicy into Gemini error handling paths · a67d9337
erio authored Feb 09, 2026

a67d9337

refactor(upstream): replace upstream account type with apikey, auto-append /antigravity · fb58560d

erio authored Feb 08, 2026

Upstream accounts now use the standard APIKey type instead of a dedicated
upstream type. GetBaseURL() and new GetGeminiBaseURL() automatically append
/antigravity for Antigravity platform APIKey accounts, eliminating the need
for separate upstream forwarding methods.

- Remove ForwardUpstream, ForwardUpstreamGemini, testUpstreamConnection
- Remove upstream branch guards in Forward/ForwardGemini/TestConnection
- Add migration 052 to convert existing upstream accounts to apikey
- Update frontend CreateAccountModal to create apikey type
- Add unit tests for GetBaseURL and GetGeminiBaseURL

fb58560d

07 Feb, 2026 3 commits

fix: restore non-failover error passthrough from 7b156489 · edb09370
erio authored Feb 07, 2026

edb09370

feat(antigravity): comprehensive enhancements - model mapping, rate limiting, scheduling & ops · 5e98445b

erio authored Feb 07, 2026

Key changes:
- Upgrade model mapping: Opus 4.5 → Opus 4.6-thinking with precise matching
- Unified rate limiting: scope-level → model-level with Redis snapshot sync
- Load-balanced scheduling by call count with smart retry mechanism
- Force cache billing support
- Model identity injection in prompts with leak prevention
- Thinking mode auto-handling (max_tokens/budget_tokens fix)
- Frontend: whitelist mode toggle, model mapping validation, status indicators
- Gemini session fallback with Redis Trie O(L) matching
- Ops: enhanced concurrency monitoring, account availability, retry logic
- Migration scripts: 049-051 for model mapping unification

5e98445b

fix: make error passthrough effective for non-failover upstream errors · 7b156489
shaw authored Feb 07, 2026

7b156489

05 Feb, 2026 1 commit

feat: 新增全局错误透传规则功能 · 39e05a2d

shaw authored Feb 05, 2026

支持管理员配置上游错误如何返回给客户端：
- 新增 ErrorPassthroughRule 数据模型和 Ent Schema
- 实现规则的 CRUD API（/admin/error-passthrough-rules）
- 支持按错误码、关键词匹配，支持 any/all 匹配模式
- 支持按平台过滤（anthropic/openai/gemini/antigravity）
- 支持透传或自定义响应状态码和错误消息
- 实现两级缓存（Redis + 本地内存）和多实例同步
- 集成到 gateway_handler 的错误处理流程
- 新增前端管理界面组件
- 新增单元测试覆盖核心匹配逻辑

优化：
- 移除 refreshLocalCache 中的冗余排序（数据库已排序）
- 后端 Validate() 增加匹配条件非空校验

39e05a2d

02 Feb, 2026 3 commits
- fix(gemini): 为 Gemini 工具调用添加 thoughtSignature 避免 INVALID_ARGUMENT 错误 · 03e94f9f
  ianshaw authored Feb 03, 2026
  
  03e94f9f
- merge upstream main · 0170d19f
  song authored Feb 02, 2026
  
  0170d19f
- fix(billing): 修复 Gemini 接口缓存 token 统计 · 4bfeeecb
  liuxiongfeng authored Feb 02, 2026
```
extractGeminiUsage 函数未提取 cachedContentTokenCount，
导致计费时缓存读取 token 始终为 0。

修复：
- 提取 usageMetadata.cachedContentTokenCount
- 设置 CacheReadInputTokens 字段
- InputTokens 减去缓存 token（与 response_transformer 逻辑一致）
```
  4bfeeecb
29 Jan, 2026 1 commit

fix(gateway): 过滤 Gemini 请求中 parts 为空的消息 · 7ade9baa

song authored Jan 29, 2026

Gemini API 不接受 contents 数组中 parts 为空的消息，会返回 400 INVALID_ARGUMENT 错误。
添加 filterEmptyPartsFromGeminiRequest 函数在转发前过滤这类消息。

影响范围：ForwardGemini (antigravity) 和 ForwardNative (gemini)

7ade9baa

26 Jan, 2026 2 commits

feat(gemini): 为 Gemini 原生平台添加图片计费支持 · 7cea6b6f

song authored Jan 26, 2026

对齐 Antigravity 平台的图片计费逻辑：
- 添加 extractImageSize() 方法提取图片尺寸
- Forward() 和 ForwardNative() 返回 ImageCount/ImageSize
- 支持分组自定义图片价格和倍率

7cea6b6f

feat(gemini): 为 Gemini 原生平台添加图片计费支持 · 0059a232

song authored Jan 26, 2026

对齐 Antigravity 平台的图片计费逻辑：
- 添加 extractImageSize() 方法提取图片尺寸
- Forward() 和 ForwardNative() 返回 ImageCount/ImageSize
- 支持分组自定义图片价格和倍率

0059a232

23 Jan, 2026 1 commit

fix(gateway): aggregate all text chunks in non-streaming Gemini responses · 909b8a8f

lynoot authored Jan 23, 2026

Previously, collectGeminiSSE() only returned the last chunk received
from the upstream streaming response when converting to non-streaming.
This caused incomplete responses where only the final text fragment
was returned to clients.

For example, a request asking to "count from 1 to 10" would only
return "\n" (the last chunk) instead of "1\n2\n3\n...\n10\n".

This was especially problematic for JSON structured output where
the opening brace "{" from the first chunk was lost, resulting
in invalid JSON like: colors": ["red", "blue"]}

The fix:
- Collect all text parts from each SSE chunk into a slice
- Merge all collected text parts into the final response
- Reuse the same pattern as handleGeminiStreamToNonStreaming
  in antigravity_gateway_service.go

Fixes: non-streaming responses returning incomplete text
Fixes: structured output (JSON schema) returning invalid JSON

909b8a8f

20 Jan, 2026 2 commits

fix(调度): 完善粘性会话清理与账号调度刷新 · 91f01309

yangjianbo authored Jan 20, 2026

- Update/BulkUpdate 按不可调度字段触发缓存刷新
- GatewayCache 支持多前缀会话键清理
- 模型路由与混合调度优化粘性会话处理
- 补充调度与缓存相关测试覆盖

91f01309

fix(调度): 完善粘性会话清理与账号调度刷新 · 7a83db61

yangjianbo authored Jan 20, 2026

- Update/BulkUpdate 按不可调度字段触发缓存刷新
- GatewayCache 支持多前缀会话键清理
- 模型路由与混合调度优化粘性会话处理
- 补充调度与缓存相关测试覆盖

7a83db61

15 Jan, 2026 1 commit
- feat: merge dev · 90bce60b
  yangjianbo authored Jan 15, 2026
  
  90bce60b
14 Jan, 2026 2 commits
- refactor(ops): 完善gateway服务ops集成 · 63711067
  IanShaw027 authored Jan 14, 2026
  
  63711067
- refactor(ops): 更新gateway服务集成ops功能 · 060699c3
  IanShaw027 authored Jan 14, 2026
  
  060699c3
12 Jan, 2026 1 commit

feat(scheduler): 引入调度快照缓存与 outbox 回放 · 3141aa51

yangjianbo authored Jan 12, 2026

- 调度热路径优先读 Redis 快照，保留分组排序语义
- outbox 回放 + 全量重建纠偏，失败重试不推进水位
- 自动 Atlas 基线对齐并同步调度配置示例

3141aa51

11 Jan, 2026 1 commit

feat(ops): 实现上游错误事件记录与查询功能 · 7ebca553

IanShaw027 authored Jan 11, 2026

**新增功能**:
- 新建ops_upstream_error_events表存储上游服务错误详情
- 支持记录上游429/529/5xx错误的详细上下文信息
- 提供按时间范围查询上游错误事件的API

**后端改动**:
1. 模型层（ops_models.go, ops_port.go）:
   - 新增UpstreamErrorEvent结构体
   - 扩展Repository接口支持上游错误事件CRUD

2. 仓储层（ops_repo.go）:
   - 实现InsertUpstreamErrorEvent写入上游错误
   - 实现GetUpstreamErrorEvents按时间范围查询

3. 服务层（ops_service.go, ops_upstream_context.go）:
   - ops_service: 新增GetUpstreamErrorEvents查询方法
   - ops_upstream_context: 封装上游错误上下文构建逻辑

4. Handler层（ops_error_logger.go）:
   - 新增GetUpstreamErrorsHandler处理上游错误查询请求

5. Gateway层集成:
   - antigravity_gateway_service.go: 429/529错误时记录上游事件
   - gateway_service.go: OpenAI 429/5xx错误时记录
   - gemini_messages_compat_service.go: Gemini 429/5xx错误时记录
   - openai_gateway_service.go: OpenAI 429/5xx错误时记录
   - ratelimit_service.go: 429限流错误时记录

**数据记录字段**:
- request_id: 关联ops_logs主记录
- platform/model: 上游服务标识
- status_code/error_message: 错误详情
- request_headers/response_body: 调试信息（可选）
- created_at: 错误发生时间

7ebca553

09 Jan, 2026 4 commits

fix(分组): 防止降级环并校验上下文分组 · 2597fe78

yangjianbo authored Jan 10, 2026

- 增加降级链路环检测并拦截配置

- 仅复用合法分组上下文并必要时回退查询

- 标注 GetByIDLite 轻量语义并补充测试

2597fe78

perf(网关): 复用分组上下文减少热路径查询 · 67554324

yangjianbo authored Jan 09, 2026

新增 GetByIDLite 并在网关与 Gemini 选择流程复用上下文 group，避免 COUNT 触发
更新 API key 中间件注入 group 上下文，减少重复查库
补充 gateway/gemini 中间件与仓库层回归测试

测试: make test

67554324

feat: antigravity 配额域限流 + SSE 上限 (#222) · 7d1fe818

Song Siyu authored Jan 09, 2026

* fix: 添加 gemini-3-flash 前缀映射支持 gemini-3-flash-preview

* feat(antigravity): 增强请求参数和注入 Antigravity 身份 system prompt

* feat: antigravity 配额域限流

* chore: 调整 SSE 单行上限到 25MB

* chore: 提升 SSE 单行上限到 40MB

7d1fe818

feat: antigravity 配额域限流 · da1f3d61
song authored Jan 09, 2026

da1f3d61

08 Jan, 2026 1 commit

feat(groups): add Claude Code client restriction and session isolation · a4210588

Edric Li authored Jan 08, 2026

- Add claude_code_only field to restrict groups to Claude Code clients only
- Add fallback_group_id for non-Claude Code requests to use alternate group
- Implement ClaudeCodeValidator for User-Agent detection
- Add group-level session binding isolation (groupID in Redis key)
- Prevent cross-group sticky session pollution
- Update frontend with Claude Code restriction controls

a4210588

05 Jan, 2026 2 commits

fix(安全): 关闭白名单时保留最小校验与默认白名单 · 048ed061

yangjianbo authored Jan 05, 2026

实现 allow_insecure_http 并在关闭校验时执行最小格式验证
- 关闭 allowlist 时要求 URL 可解析且 scheme 合规
- 响应头过滤关闭时使用默认白名单策略
- 更新相关文档、示例与测试覆盖

048ed061

feat(安全): 添加安全开关并完善测试流程 · 794a9f96

yangjianbo authored Jan 05, 2026

实现安全开关默认关闭与响应头透传逻辑
- URL 校验与响应头过滤支持开关并覆盖流式路径
- 非流式 Content-Type 透传/默认值按配置生效
- 接入 go test、golangci-lint 与前端 lint/typecheck
- 补充相关测试与配置/文档说明

794a9f96

04 Jan, 2026 1 commit

fix(backend): 改进 thinking/tool block 签名处理和重试策略 · 87426e5d

IanShaw027 authored Jan 04, 2026

主要改动：
- request_transformer: thinking block 缺少签名时降级为文本而非丢弃，保留内容并在上层禁用 thinking mode
- antigravity_gateway_service: 新增两阶段降级策略，先处理 thinking blocks，如仍失败且涉及 tool 签名错误则进一步降级 tool blocks
- gateway_request: 新增 FilterSignatureSensitiveBlocksForRetry 函数，支持将 tool_use/tool_result 降级为文本
- gateway_request: 改进 FilterThinkingBlocksForRetry，禁用顶层 thinking 配置以避免结构约束冲突
- gateway_service: 实现保守的两阶段重试逻辑，优先保留内容，仅在必要时降级工具调用
- 新增 antigravity_gateway_service_test.go 测试签名块剥离逻辑
- 更新相关测试用例以验证降级行为

此修复解决了跨平台/账户切换时历史消息签名失效导致的请求失败问题。

87426e5d