sub2api

Author	SHA1	Message	Date
KnowSky404	a161f9d045	feat: align OpenAI bulk edit compact settings	2026-04-27 18:15:23 +08:00
KnowSky404	c5a1a82223	test: cover missing OpenAI bulk edit fields	2026-04-27 18:13:14 +08:00
KnowSky404	2ab6b34fd1	feat: add filtered-result account bulk edit	2026-04-27 18:12:24 +08:00
KnowSky404	764afbe37a	test: cover account bulk edit target scopes	2026-04-27 18:08:22 +08:00
KnowSky404	25c7b0d9f4	feat: support filter-target account bulk update	2026-04-27 17:59:49 +08:00
KnowSky404	f422ac6dcc	test: cover filter-target account bulk update	2026-04-27 17:32:34 +08:00
KnowSky404	54de4e008c	docs: add account bulk edit implementation plan	2026-04-27 17:26:57 +08:00
KnowSky404	65c27d2c69	docs: add account bulk edit scope design	2026-04-27 17:21:11 +08:00
hansnow	53f919f8f0	fix(api-key): reset rate limit usage cache	2026-04-27 16:47:44 +08:00
Wesley Liddick	c92b88e34a	Merge pull request #1996 from Cloud370/fix/claude-code-read-empty-pages fix(anthropic): drop empty Read.pages in responses-to-anthropic tool input	2026-04-27 08:47:13 +08:00
Wesley Liddick	ed0c85a17e	Merge pull request #2006 from gaoren002/pr/openai-images-explicit-session fix(openai): avoid implicit image sticky sessions	2026-04-27 08:43:40 +08:00
gaoren002	9fe02bba7e	fix(openai): strip unsupported passthrough fields	2026-04-27 00:39:06 +00:00
gaoren002	615557ec20	fix(openai): avoid implicit image sticky sessions	2026-04-26 17:09:41 +00:00
Oliver Li	3f05ef2ae3	Merge branch 'Wei-Shaw:main' into vertex	2026-04-26 08:39:41 -04:00
Cloud370	3022090365	fix(anthropic): drop empty Read.pages in responses-to-anthropic tool input	2026-04-26 20:21:38 +08:00
Hai Chang	798fd673e9	feat(httputil): decode compressed request bodies (zstd/gzip/deflate) Codex CLI 0.125+ defaults to sending request bodies with Content-Encoding: zstd. Without server-side decompression the gateway returns 'Failed to parse request body' on /v1/responses (and any other JSON endpoint) because gjson sees raw zstd bytes. ReadRequestBodyWithPrealloc now inspects Content-Encoding and transparently decodes zstd, gzip/x-gzip, and deflate bodies before returning them, then strips the encoding headers and updates ContentLength so downstream code can reuse the bytes safely. Unsupported encodings produce a clear error. Adds unit tests covering identity, zstd, gzip, deflate, unsupported encoding, corrupt zstd payloads, nil bodies, and explicit identity.	2026-04-26 20:52:45 +10:00
github-actions[bot]	c056db740d	chore: sync VERSION to 0.1.119 [skip ci]	2026-04-26 05:24:11 +00:00
Wesley Liddick	a0b5e5bfa0	Merge pull request #1973 from Nobody-Zhang/main fix(payment): 修复 Zpay 退款接口调用	2026-04-26 13:11:42 +08:00
Wesley Liddick	41d0657330	Merge pull request #1970 from deqiying/fix-1754-claude-openai-cache-usage fix(anthropic): 修正缓存 token 的 Anthropic 用量语义	2026-04-26 13:08:18 +08:00
Nobody-Zhang	1a0cabbfd6	Fix Zpay refund endpoint handling	2026-04-26 04:57:34 +00:00
shaw	9b6dcc57bd	feat(affiliate): 完善邀请返利系统 - 修复返利不到账的根因：tryClaimAffiliateRebateAudit 中 PostgreSQL 参数类型推断冲突 - 补全 OAuth 注册路径（LinuxDo/OIDC/WeChat/Pending Flow）的邀请码绑定 - 前端 OAuth 注册页面传递 aff_code 参数 - 新增返利冻结期机制：可配置冻结时间，到期后自动解冻（懒解冻） - 新增返利有效期：绑定后 N 天内有效，过期不再产生返利 - 新增单人返利上限：超出上限部分精确截断 - 增强返利流程 slog 结构化日志，便于排查问题 - 已邀请用户列表增加返利明细列	2026-04-26 12:42:35 +08:00
Oliver	6d11f9ed77	Add Vertex service account support	2026-04-25 20:39:58 -04:00
Oliver	489a4d934e	Show today stats for Vertex usage window	2026-04-25 19:46:32 -04:00
deqiying	b17704d6ef	fix(anthropic): 修正缓存 token 的 Anthropic 用量语义	2026-04-26 01:14:59 +08:00
shaw	496469ac4e	fix(gateway): skip body mimicry for real Claude Code clients to restore prompt caching PR #1914 unconditionally applied the full mimicry pipeline to all OAuth accounts, including real Claude Code CLI clients. This replaced the client's long system prompt (~10K+ tokens with stable cache_control breakpoints) with a short ~45 token [billing, CC prompt] pair, which falls below Anthropic's 1024-token minimum cacheable prefix threshold. The result: every request created a new cache but never hit an existing one. Fix: restore the Claude Code client detection gate so that real CC clients bypass body-level mimicry (system rewrite, message cache management, tool name obfuscation). Non-CC third-party clients (opencode, etc.) continue to receive full mimicry. Also harden the detection logic: - Make UA regex case-insensitive (align with claude_code_validator.go) - Validate metadata.user_id format via ParseMetadataUserID() instead of just checking non-empty, preventing third-party tools from spoofing a claude-cli/* UA with an arbitrary user_id string to bypass mimicry	2026-04-25 22:50:35 +08:00
shaw	c1b52615be	fix(payment): allow Stripe payment pages to bypass router auth guard Stripe payment routes (/payment/stripe, /payment/stripe-popup) are reached via hard navigation (window.location.href), which caused the router guard to block access before the page could load. Set requiresAuth and requiresPayment to false, consistent with /payment/result. Backend API still enforces authentication.	2026-04-25 21:38:40 +08:00
shaw	3af9940b85	style: fix gofmt and ineffassign lint errors - gofmt: realign AffiliateDetail struct tags in affiliate_service.go - ineffassign: remove dead seenCompleted assignment before return in account_test_service.go	2026-04-25 20:37:42 +08:00
Wesley Liddick	22b1277572	Merge pull request #1948 from hungryboy1025/fix/openai-account-test-responses-stream fix(openai): tighten responses stream account tests	2026-04-25 20:31:07 +08:00
Wesley Liddick	aff98d5ae1	Merge pull request #1960 from gaoren002/fix/openai-stream-keepalive-downstream-idle fix(openai): keep responses stream alive during pre-output failover	2026-04-25 20:24:25 +08:00
shaw	4e1bb2b445	feat(affiliate): add feature toggle and per-user custom invite settings - 在系统设置「功能开关」中新增邀请返利总开关，默认关闭；关闭态：菜单隐藏、注册忽略 aff、新充值不返利，但已有 quota 仍可转余额 - 支持管理员为指定用户设置专属邀请码（覆盖随机码，全局唯一） - 支持管理员为指定用户设置专属返利比例（覆盖全局比例，可单条/批量调整） - 在系统设置邀请返利卡片内嵌入专属用户管理表格（搜索/编辑/批量/删除），删除采用项目通用 ConfirmDialog，会同时清除专属比例并把邀请码重置为系统随机码 - /affiliate 用户页新增「我的返利比例」卡片与动态使用说明，让用户直观看到分享后能拿到多少（同源 resolveRebateRatePercent 计算，与实际充值一致） - 新增数据库迁移 132 添加 aff_rebate_rate_percent 与 aff_code_custom 列 - 新增 admin 路由组 /api/v1/admin/affiliates/users/* 共 5 个端点 - AffiliateService 改为只依赖 *SettingService，去除冗余的 SettingRepository - 邀请码格式校验放宽到 [A-Z0-9_-]{4,32}，兼容旧 12 位系统码与新自定义码 - 补充单元测试与集成测试覆盖新方法、冲突路径与边界值	2026-04-25 20:22:07 +08:00
gaoren002	dac6e52091	fix(openai): keep responses stream alive during pre-output failover	2026-04-25 12:11:27 +00:00
hungryboy1025	8987e0ba67	fix(openai): tighten responses stream account tests	2026-04-25 16:56:50 +08:00
github-actions[bot]	9d1751ec57	chore: sync VERSION to 0.1.118 [skip ci]	2026-04-25 08:06:21 +00:00
Wesley Liddick	5d1c12e60e	Merge pull request #1943 from AyeSt0/fix/openai-responses-preoutput-failover fix(openai): 修复 Responses 流式失败前置事件导致无法 failover	2026-04-25 15:43:00 +08:00
AyeSt0	5b63a9b02d	fix(openai): fail over before responses stream output	2026-04-25 15:09:40 +08:00
Wesley Liddick	641e61073f	Merge pull request #1940 from 4fuu/fix/bump-codex-cli-version-to-0.125.0 fix(openai): bump codex CLI version from 0.104.0 to 0.125.0	2026-04-25 14:57:51 +08:00
shaw	095f457c57	feat(openai): port /responses/compact account support flow (PR #1555 ) 将 vansour/sub2api#1555 的 OpenAI compact 能力建模手工移植到当前 main：账号级 compact 状态/auto-force_on-force_off 模式、compact-only 模型映射、调度器 tier 分层（已支持 > 未知 > 已知不支持）、管理后台 compact 主动探测，以及对应 i18n/状态徽章。普通 /responses 流量行为不变，无数据库迁移。	2026-04-25 14:52:58 +08:00
4fuu	1e57e88e43	fix(openai): bump codex CLI version from 0.104.0 to 0.125.0 The hardcoded codex CLI version (0.104.0) causes upstream rejection when using gpt-5.5 with compact, as the server treats the request as an outdated client and returns 400/502. Update codexCLIVersion, codexCLIUserAgent, and openAICodexProbeVersion to 0.125.0 to match the current Codex CLI release. Fixes #1933, #1887, #1865 Related: #1609, #1298, #849	2026-04-25 05:26:33 +00:00
Wesley Liddick	b95ffce244	Merge pull request #1772 from KnowSky404/fix/openai-test-state-reconciliation [codex] reconcile OpenAI admin test rate-limit state	2026-04-25 10:02:21 +08:00
shaw	8f28a834f8	fix(payment): 同时启用易支付和 Stripe 时显示 Stripe 按钮 VISIBLE_METHOD_ALIASES 漏了 stripe，导致 getVisibleMethods 把后端返回的 stripe 过滤掉。点 Stripe 按钮时省略 method 查询参数，让落地页渲染完整的 Payment Element。	2026-04-25 09:46:27 +08:00
shaw	7424c73b05	chore: remove unused model IDs	2026-04-25 09:04:34 +08:00
Wesley Liddick	1afd81b019	Merge pull request #1920 from Wuxie233/fix/responses-web-search-tool-types fix(apicompat): recognize web_search_20250305 / google_search in Responses→Anthropic tool conversion	2026-04-25 09:00:37 +08:00
shaw	732d6495ea	chore(gateway): fix lint issues from cc-mimicry-parity merge - staticcheck QF1001: apply De Morgan's law to the OAuth-mimic header passthrough guard (`!(a && b)` → `a != ... \|\| !b`). - unused: drop `isClaudeCodeRequest`, which became dead after PR #1914 switched both `/v1/messages` and `/count_tokens` paths to unconditional `account.IsOAuth()` mimicry. The lowercase helper `isClaudeCodeClient` is kept (still referenced by `TestIsClaudeCodeClient`).	2026-04-25 08:58:57 +08:00
Wesley Liddick	6d20ab8082	Merge pull request #1914 from keh4l/feat/cc-mimicry-parity fix(claude): align Claude Code OAuth mimicry with real CLI traffic	2026-04-25 08:54:04 +08:00
shaw	aa8ee33b0a	refactor(affiliate): tighten DI and harden inviter code validation - Drop SetAffiliateService setters and ProvideAuthService / ProvidePaymentService / ProvideUserHandler wrappers in favor of direct Wire constructor injection. AffiliateService has no back-edge to Auth/Payment/User, so the indirection was never required. - Change RegisterWithVerification's variadic affiliateCode to a fixed parameter; adjust all call sites. - Validate aff_code length and charset in BindInviterByCode before any DB lookup, eliminating timing-side-channel and useless DB roundtrips on malformed input. - Make affiliate cache invalidation synchronous; surface Redis errors via the project logger instead of swallowing them in a detached goroutine. - Add an integration test guarding cross-layer tx propagation in AccrueQuota and a unit test pinning the aff_code format rules.	2026-04-25 08:44:18 +08:00
Wuxie233	5f630fbb19	fix(apicompat): recognize web_search_20250305 / google_search in Responses to Anthropic tool conversion	2026-04-25 01:09:51 +08:00
keh4l	bdbd2916f5	fix(gateway): skip client header passthrough on OAuth mimicry path Root cause of persistent third-party detection: sub2api's buildUpstreamRequest transparently forwards client headers via allowedHeaders whitelist (addHeaderRaw) before applying mimicry overrides. When third-party clients (opencode, etc.) send their own anthropic-beta / user-agent / x-stainless-* / x-claude-code-session-id values, these get appended to the request alongside our injected headers, creating an inconsistent header set that Anthropic detects. Parrot's build_upstream_headers constructs exactly 9 headers from scratch and never forwards anything from the client. This is why 'same opencode version, some users work some don't' — different opencode configs/versions send different header combinations. Fix: when tokenType=oauth and mimicClaudeCode=true, skip the client header passthrough loop entirely. The subsequent applyClaudeCodeMimicHeaders + ApplyFingerprint + beta merge pipeline constructs all necessary headers from our controlled values. Also: remove systemIncludesClaudeCodePrompt gate — OAuth accounts now unconditionally rewrite system (even if client already sent a Claude Code-style prompt), ensuring billing attribution block is always present.	2026-04-25 00:43:38 +08:00
keh4l	6dc89765fd	fix(gateway): always apply full mimicry for OAuth accounts regardless of client identity Before: isClaudeCodeRequest() checked whether the client looks like a real Claude Code CLI (UA, system prompt, X-App header, metadata format). If it looked like Claude Code, all mimicry was skipped — the assumption being that a real CLI needs no help. Problem: third-party tools like opencode partially impersonate Claude Code (sending claude-cli UA + claude-code beta + CC system prompt) but miss critical details (billing attribution block, tool-name obfuscation, cache breakpoints, full beta set). Some users' opencode instances pass the isClaudeCodeRequest check, causing sub2api to skip mimicry entirely, while Anthropic still detects the request as third-party. This explains why 'same opencode version, some users work, some don't' — it depends on which opencode features/config trigger the validator. Fix: OAuth accounts now unconditionally run the full mimicry pipeline, matching Parrot's behavior (Parrot never checks client identity). This is safe because our mimicry is strictly more complete than any third-party client's partial impersonation. Changed: - /v1/messages path: remove isClaudeCode gate - /v1/messages/count_tokens path: same	2026-04-25 00:26:37 +08:00
keh4l	f3233db01f	fix(gateway): apply D/E/F mimicry to native /v1/messages and count_tokens paths The previous commit only wired stripMessageCacheControl, addMessageCacheBreakpoints, and tool-name obfuscation into applyClaudeCodeOAuthMimicryToBody (used by /chat/completions and /responses). The native /v1/messages path and count_tokens path have their own independent mimicry code blocks and were missed. Now all three entry points share the same D/E/F pipeline: - /v1/messages (gateway_service.go forwardAnthropic) - /v1/messages/count_tokens (gateway_service.go countTokens) - OpenAI compat (applyClaudeCodeOAuthMimicryToBody)	2026-04-24 23:16:32 +08:00
keh4l	6e12578bc5	feat(gateway): port Parrot tool-name obfuscation + message cache breakpoints Implements the remaining three parity items with Parrot cc_mimicry: D) Tool-name obfuscation - Dynamic mapping when tools.length > 5 (matches Parrot threshold). Fake names follow {prefix}{name[:3]}{i:02d} (e.g. 'manage_bas00'). Go port of random.Random(hash(tuple(names))) uses fnv64a seed + math/rand; byte-exact reproduction is impossible (Python hash vs Go hash), but the two invariants that matter are preserved: * same input tool_names yield identical mapping (cache hit) * prefix pool is shuffled (names look distributed) - Static prefix map (sessions_ -> cc_sess_, session_ -> cc_ses_) applied as fallback, matching Parrot TOOL_NAME_REWRITES verbatim. - Server tools (web_search_20250305, computer_, etc.) are NOT renamed; only type=='function' and type=='custom' tools are. - tool_choice.name is rewritten in sync (only when type=='tool'). - Response side: bytes-level replace on every SSE chunk / JSON body at 6 injection points (standard stream/non-stream, passthrough stream/non-stream, chat_completions stream + non-stream, responses stream + non-stream). Reverse mapping applied longest-fake-name-first to prevent substring conflicts (parity with Parrot _restore_tool_names_in_chunk). - tool_choice is no longer unconditionally deleted in normalizeClaudeOAuthRequestBody — Parrot passes it through. E) tools[-1] cache_control breakpoint - Injected as {type:ephemeral, ttl:<DefaultCacheControlTTL>} when the last tool has no cache_control. Client-provided ttl is passed through unchanged (repo-wide policy). F) messages cache_control strategy - stripMessageCacheControl removes every client-provided messages[].content[].cache_control (multi-turn stability). - addMessageCacheBreakpoints then injects two stable breakpoints: (1) last message, and (2) second-to-last user turn when messages.length >= 4. - Combined with the system block breakpoint and tools[-1] breakpoint, this gives exactly the 4 breakpoints Anthropic allows per request. Non-trivial implementation details to be aware of when rebasing: Two new files, no upstream collision: gateway_tool_rewrite.go (D + E algorithms) gateway_messages_cache.go (F strip + breakpoints) * Two new feature calls bolted onto the tail of applyClaudeCodeOAuthMimicryToBody in gateway_service.go — rebase conflicts will be ~10 lines maximum. * Response-side injection points all wrap their existing write with reverseToolNamesIfPresent(c, ...), preserving original behavior when no mapping is stored (static prefix rollback still runs). * Non-stream chat/responses switched from c.JSON to json.Marshal + c.Data so bytes-level replace is possible. * Retry bodies (FilterThinkingBlocksForRetry, FilterSignatureSensitiveBlocksForRetry, RectifyThinkingBudget) only prune blocks — they preserve the already-obfuscated tool names, so no extra mapping re-application is needed. Manual QA: end-to-end scenario verified with 6 tools (above threshold) and tool_choice.type=='tool'. Obfuscation + restore roundtrip shown in test logs; then removed the temp test file. Tests (16 new): - buildDynamicToolMap stability + below-threshold guard - sanitizeToolName precedence (dynamic > static) - restoreToolNamesInBytes longest-first + static rollback - applyToolNameRewriteToBody skips server tools + syncs tool_choice - applyToolsLastCacheBreakpoint defaults to 5m + passes client ttl - stripMessageCacheControl + addMessageCacheBreakpoints in the 1/4/string-content cases + second-to-last user turn selection - buildToolNameRewriteFromBody ReverseOrdered is desc-by-fake-length - fake name shape follows Parrot {prefix}{head3}{i:02d}	2026-04-24 23:16:32 +08:00

1 2 3 4 5 ...

3256 Commits