sub2api

Author	SHA1	Message	Date
Pluviobyte	ed2aac25a6	fix(billing): apply long-context multiplier to cache_creation price Follow-up to #2816 (already merged): the same long-context pricing exemption that affected cache_read also applies to all three cache_creation price fields (standard, 5m ephemeral, 1h ephemeral). computeCacheCreationCost reads these prices directly from pricing and never sees the LongContextInputMultiplier that computeTokenBreakdown applies to inputPrice / outputPrice / cacheReadPrice. For GPT-5.4 / 5.5 above the 272k threshold, this causes the cache_write portion of long sessions to be billed at roughly half what it should be (default multiplier 2.0). Cache writes are conceptually input-side operations and should share the same long-context treatment as input / cache_read. This patch threads an explicit multiplier into computeCacheCreationCost so the function can be unit-tested in isolation and matches the existing pattern used for cache_read. computeTokenBreakdown captures the long context decision once and passes LongContextInputMultiplier when it applies, 1.0 otherwise. Adds three regression tests mirroring the #2816 cache_read tests: - positive: long-context triggered -> cache_creation scaled by 2.0x - negative: below threshold -> cache_creation stays at base price - breakdown: 5m + 1h ephemeral prices both scaled when applicable Refs #2816 Co-authored-by: Cursor <cursoragent@cursor.com>	2026-05-27 09:59:58 +00:00
Wesley Liddick	b0142146af	Merge pull request #2816 from Pluviobyte/fix/long-context-cache-read-multiplier fix(billing): apply long-context multiplier to cache_read price (#2293)	2026-05-27 15:59:11 +08:00
Wesley Liddick	2387cf9934	Merge pull request #2799 from siyuan-123/fix/ws-rate-limit-failover 修复 OpenAI WS 限额时不自动切换账号	2026-05-27 15:14:28 +08:00
SlientRainyDay	b9509e823a	fix(billing): apply long-context multiplier to cache_read price When session long-context pricing is triggered in computeTokenBreakdown (e.g. GPT-5.4 / GPT-5.5 above the 272k token threshold), the multiplier was only being applied to InputPricePerToken and OutputPricePerToken. The cache_read price was left at its base value, so CacheReadCost was silently undercharged whenever a long-context session also had cache hits — which is essentially every long Codex / Claude Code session. Concretely for gpt-5.4 with 300k cache_read tokens, the bug under-billed the request by exactly 1x the LongContextInputMultiplier on the cache portion (e.g. 0.075 instead of 0.150 in the regression test). Cache reads are conceptually input-side replays, so they should scale with LongContextInputMultiplier, matching the treatment of InputPricePerToken. Adds two regression tests: - positive: long-context triggered -> cache_read scaled by 2.0x - negative: below threshold -> cache_read stays at base price Fixes #2293 Co-authored-by: Cursor <cursoragent@cursor.com>	2026-05-27 07:09:28 +00:00
shaw	f7ac5e5931	fix(openai): preserve chat responses usage billing	2026-05-26 21:33:28 +08:00
Wesley Liddick	4b9b63443f	Merge pull request #2790 from Arron196/from-arron-main 修复 Ops SLA 本地限制错误统计	2026-05-26 20:21:11 +08:00
siyuan	08061717b8	fix: enable account failover for OpenAI WS rate limits	2026-05-26 20:07:00 +08:00
Wesley Liddick	4a5c5367cf	Merge pull request #2796 from DaydreamCoding/fix/account-reauth-keep-extra fix(account): 重新授权不再清空 Extra 配置	2026-05-26 20:06:48 +08:00
Wesley Liddick	b9f421d647	Merge pull request #2751 from wucm667/fix/bedrock-strip-context-management-when-beta-removed fix(bedrock): v0.1.130 回归 — beta token 被移除时同步剥离 context_management 字段	2026-05-26 20:05:43 +08:00
DaydreamCoding	11fe7de926	fix(account): 重新授权不再清空 Extra 配置 Claude / OpenAI 账号重新授权走通用 PUT /accounts/:id 时，后端 UpdateAccount 会全量覆盖 account.Extra（仅保留 5 个 quota 用量键），导致 base_rpm / window_cost_limit / window_cost_sticky_reserve / max_sessions / quota_* / privacy_mode 等持久化配置全部丢失。新增专用接口 POST /accounts/:id/apply-oauth-credentials，沿用现有 /refresh 路径模式：Credentials-only update + Extra JSONB key 级合并（UpdateAccountExtra） + ClearError + InvalidateToken。作用域：Claude OAuth / Claude Cookie auth / OpenAI OAuth 三个调用点。Gemini / Antigravity 现有路径本就不传 extra，保持不变。顺带修复：旧重新授权路径未调用 InvalidateToken，导致重新授权后首请求可能仍用缓存中的旧 token 而立即 401。 Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-26 19:46:08 +08:00
benjamin	03ae510c68	fix(ops): exclude count-tokens from metrics errors Ultraworked with [Sisyphus](https://github.com/code-yeongyu/oh-my-openagent) Co-authored-by: Sisyphus <clio-agent@sisyphuslabs.ai>	2026-05-26 17:21:56 +08:00
benjamin	9c56fe0b0b	fix(openai): mark fast-policy entrypoints business-limited Ultraworked with [Sisyphus](https://github.com/code-yeongyu/oh-my-openagent) Co-authored-by: Sisyphus <clio-agent@sisyphuslabs.ai>	2026-05-26 17:21:45 +08:00
benjamin	5d7df678b1	fix(openai): mark local gateway denials business-limited Ultraworked with [Sisyphus](https://github.com/code-yeongyu/oh-my-openagent) Co-authored-by: Sisyphus <clio-agent@sisyphuslabs.ai>	2026-05-26 17:19:50 +08:00
benjamin	47fe90eab4	fix(antigravity): mark whitelist denials business-limited Ultraworked with [Sisyphus](https://github.com/code-yeongyu/oh-my-openagent) Co-authored-by: Sisyphus <clio-agent@sisyphuslabs.ai>	2026-05-26 17:19:37 +08:00
benjamin	5c4101ac53	feat(ops): add local business limit reasons Ultraworked with [Sisyphus](https://github.com/code-yeongyu/oh-my-openagent) Co-authored-by: Sisyphus <clio-agent@sisyphuslabs.ai>	2026-05-26 17:18:27 +08:00
Wesley Liddick	bebc082306	Merge pull request #2766 from DaydreamCoding/feat/user-platform-quota feat(quota): 用户 × 平台 USD 配额	2026-05-26 14:13:18 +08:00
Wesley Liddick	83248478e2	Merge pull request #2777 from lyen1688/feat/content-moderation-risk-threshold feat: 支持内容审计风险阈值配置	2026-05-26 14:12:54 +08:00
lyen1688	23f3d426c6	feat: 支持内容审计风险阈值配置	2026-05-26 13:58:02 +08:00
mt21625457	33ac8eb27d	fix openai http2 response header timeout	2026-05-26 13:57:59 +08:00
DaydreamCoding	6b39b344d8	feat(quota): 用户 × 平台 USD 配额为用户在 anthropic/openai/gemini/antigravity 四个平台上提供日/周/月三个窗口的 USD 配额管控。配额语义：未设置=不限制，0=禁用，>0=美元上限。两层模型： - 配置层：系统默认配额，以及 email/linuxdo/oidc/wechat/github/google/ dingtalk 七个鉴权来源的默认配额，存于 settings，以嵌套 JSON 整体读写（系统 1 个 key + 每个来源 1 个 key），整体替换语义。 - 运行时层：user_platform_quota 表按用户记录实际配额，与配置层解耦。后端：新增 ent schema 与 140_user_platform_quotas.sql 迁移、repository 与 service 端口、计费链路集成、管理端与用户端读写接口。前端：管理端设置页配额编辑、用户配额管理 Modal、用户 Dashboard 展示、中英文案。 Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-26 10:49:20 +08:00
wucm667	a9c7a3a095	fix(bedrock): strip context_management when beta is removed	2026-05-25 14:15:39 +08:00
siyuan	fc66cd704a	fix: recognize codex tool outputs in ws continuation	2026-05-25 10:46:58 +08:00
Wesley Liddick	3c5a444802	Merge pull request #2698 from deqiying/fix/log-real-client-ip fix: 修复反代部署下拒绝日志客户端 IP 不准确	2026-05-23 11:08:47 +08:00
shaw	1e406fed52	fix: optimize OpenAI account cooldown scheduling	2026-05-23 10:18:43 +08:00
deqiying	0af44ce4c2	fix: 修复反代部署下拒绝日志客户端 IP 不准确将 OpenAI codex_cli_only 拒绝诊断日志中的 request_client_ip 改为复用 ip.GetClientIP，与 usage 记录和 access log 的真实客户端 IP 解析逻辑保持一致。保留 request_remote_addr 用于排查底层 Docker/反代 peer 地址，并补充单元测试覆盖反代头与 remote addr 分离的场景。	2026-05-22 23:28:21 +08:00
Wesley Liddick	f59d9a5f8e	Merge pull request #2674 from wucm667/feat/moderation-per-model-toggle feat(risk-control): 内容审计支持按模型生效	2026-05-22 20:10:38 +08:00
Wesley Liddick	301032dc72	Merge pull request #2672 from wucm667/feat/email-whitelist-wildcard-suffix feat(registration): 邮箱白名单支持后缀通配符匹配(*.edu.cn)	2026-05-22 17:33:29 +08:00
Wesley Liddick	9f91a8af17	Merge pull request #2662 from touwaeriol/feat/bedrock-cc-compat feat(bedrock): add Claude Code compatibility for AWS Bedrock	2026-05-22 17:32:11 +08:00
Wesley Liddick	a33a294970	Merge pull request #2658 from wucm667/feat/account-test-chat-completions-path feat(account): 测试连接支持 OpenAI-compatible Chat Completions 路径	2026-05-22 17:31:14 +08:00
wucm667	199a5bcc69	fix(risk-control): Agent 工具循环中同一用户消息重复审计去重末尾 role 检查方案：当 messages / input / contents 数组末尾一项不是用户消息（而是 assistant、tool / function_call_output 等）时，直接跳过内容审计，从而避免 Agent 工具循环中同一用户输入被反复审计、计费、写日志。 Fixes #2678	2026-05-22 14:54:06 +08:00
wucm667	0d5c6f7cc7	feat(risk-control): 内容审计支持按模型生效	2026-05-21 21:18:43 +08:00
wucm667	a5b9b68b76	feat(registration): 支持邮箱白名单后缀通配符	2026-05-21 21:02:26 +08:00
wucm667	ca60cede14	feat(account): 支持测试连接 Chat Completions 路径	2026-05-21 16:37:20 +08:00
Wesley Liddick	35901a174b	Merge pull request #2655 from ye4241/feat/oidc-trust-verified-email-fast-path feat(oidc): 上游邮箱已验证时跳过 choice 页直接登录注册	2026-05-21 14:47:08 +08:00
shaw	a613a587ba	feat: add subscription expiry email toggle	2026-05-21 14:27:50 +08:00
ye4241	39fe7aa0eb	feat(oidc): 上游邮箱已验证时跳过 choice 页直接登录注册当前 OIDC 首次登录无条件创建 choose_account_action_required 的 pending session，即使 force_email_on_third_party_signup 关闭，前端仍然会强制弹出"创建账号 / 绑定已有账号"的二选一界面，并展示内部合成邮箱（oidc-xxx@oidc-connect.invalid），用户体验差。本次复用已存在的 LoginOrRegisterVerifiedEmailOAuth 路径（原本仅供 github/google 使用），在以下条件全部满足时跳过 choice 页，直接信任上游身份完成注册/登录： - force_email_on_third_party_signup = false - 邀请码模式未启用 - 上游声明 email_verified = true 且 compat_email 非空 - 本地不存在同邮箱已有账号失败时（如邮箱后缀不在白名单、注册关闭等）自动回退到现有 choice 流程，行为完全向后兼容。测试覆盖： - TestTryOIDCVerifiedEmailFastPathCreatesUserAndIdentity - TestTryOIDCVerifiedEmailFastPathSkippedWhenInvitationCodeRequired - TestTryOIDCVerifiedEmailFastPathSkippedWhenForceEmailEnabled	2026-05-21 13:32:20 +08:00
erio	fe1c6c958b	feat(bedrock): add Claude Code compatibility for AWS Bedrock - Export ApplyBedrockCCCompat() in GatewayService, called after channel model mapping to ensure mapped model ID is used for Opus 4.7+ detection - Add sanitizeBedrockCCFields(): remove service_tier/interface_geo/ context_management, inject max_tokens/anthropic_version defaults - Add sanitizeBedrockCCBetaTokens(): filter anthropic_beta to keep only Bedrock-supported tokens, reusing autoInjectBedrockBetaTokens and filterBedrockBetaTokens for consistent rules - Remove unsupported beta tokens (interleaved-thinking, context-management) from whitelist based on AWS official docs - Simplify IsBedrockCCCompatEnabled() to check boolean toggle directly, applying CC compat to all accounts regardless of platform - Add unit tests for IsBedrockCCCompatEnabled (8 cases), sanitizeBedrockCCFields (8 cases), sanitizeBedrockCCBetaTokens (7 cases) - Update bedrock beta policy tests for removed auto-injection	2026-05-21 11:46:24 +08:00
Wesley Liddick	bd3d4d9a24	Merge pull request #2399 from gaoren002/fix/openai-image-upstream-errors fix(openai): surface image moderation errors	2026-05-21 11:31:22 +08:00
Wesley Liddick	131d4b3050	Merge pull request #2374 from gaoren002/fix/openai-refresh-token-reused fix: mark reused refresh tokens non-retryable and unschedule errored accounts	2026-05-21 11:30:52 +08:00
Wesley Liddick	eda04c6129	Merge pull request #2615 from wucm667/feat/redeem-code-batch-update feat(redeem): 兑换码支持批量修改	2026-05-21 10:39:46 +08:00
Wesley Liddick	d3c4e50753	Merge pull request #2645 from lyen1688/fix/trusted-forwarded-ip-acl PR：为 API Key IP 白/黑名单增加可配置的反代真实 IP 判断	2026-05-21 10:34:28 +08:00
lyen1688	1d2445ff52	修复 API Key ACL 开关的 CI 校验	2026-05-20 23:51:39 +08:00
lyen1688	08c8c67df7	为 API Key ACL 增加反代真实 IP 开关	2026-05-20 22:51:46 +08:00
Wesley Liddick	e5d6f1727f	Merge pull request #2641 from Arron196/fix/channel-monitor-responses-reasoning fix(channel-monitor): 兼容 Responses reasoning 输出	2026-05-20 22:36:46 +08:00
erio	4fd21994c5	feat(bedrock): add Claude Code compatibility transformations for Bedrock accounts Add channel-level Bedrock CC compatibility toggle (similar to web_search_emulation) that fixes 4 types of Bedrock 400 errors seen with Claude Code: 1. thinking.type "enabled" → "adaptive" for Opus 4.7+ (only supports adaptive) 2. Add default budget_tokens when missing for older models 3. Replace illegal characters in tool_use IDs to match Bedrock's ^[a-zA-Z0-9_-]+$ pattern 4. anthropic_version / invalid beta flag (already handled elsewhere) Transformations run in Forward() before any forwarding path, so both native Bedrock accounts and apikey passthrough accounts pointing to Bedrock relays benefit. Includes channel-level toggle UI and unit tests.	2026-05-20 21:47:38 +08:00
benjamin	d3d5843b9d	fix(channel-monitor): 兼容 Responses reasoning 输出 Ultraworked with [Sisyphus](https://github.com/code-yeongyu/oh-my-openagent) Co-authored-by: Sisyphus <clio-agent@sisyphuslabs.ai>	2026-05-20 21:19:06 +08:00
name	8211aa7066	fix: retry on "thinking block must contain thinking" upstream error Some clients reuse assistant history from other models when switching to claude with extended thinking enabled. If a prior thinking block lacks the thinking text field, upstream returns: messages.X.content.Y.thinking: each thinking block must contain thinking Add this pattern to isThinkingBlockSignatureError so the existing FilterThinkingBlocksForRetry retry path triggers and rewrites/drops the offending blocks.	2026-05-20 18:46:50 +08:00
gaoren002	49b415e333	fix: mark reused refresh tokens non-retryable	2026-05-20 09:24:51 +00:00
gaoren002	888cd8092d	fix(openai): surface image moderation errors	2026-05-20 09:19:20 +00:00
Wesley Liddick	51f72186a5	Merge pull request #2613 from wucm667/feat/api-key-usage-daily-detail feat(usage): 用户 API Key 用量页支持按日明细	2026-05-20 16:55:42 +08:00

1 2 3 4 5 ...

1875 Commits