9router

Author	SHA1	Message	Date
decolua	573b0f0241	- Refines the overall structure of the CLI tools and MITM server functionalities. - Add buildQwenBaseUrl function to construct URLs for Qwen resources. - Update buildProviderUrl to support Qwen model requests. - Enhance token refresh logic to include provider-specific data for Qwen. - Refactor CLI Tools page to exclude MITM tools and streamline model retrieval. - Introduce new components for MITM server management. - Update API routes to handle Qwen-specific resource URLs and improve error handling.	2026-03-05 11:25:03 +07:00
decolua	50990e84b4	Fix AG MITM	2026-03-01 18:40:55 +07:00
decolua	a7365c5a4e	Fix : Codex on cursor	2026-03-01 15:35:41 +07:00
gen	5a015e5b4d	feat(proxy): add outbound HTTP proxy support for OAuth + provider requests - Patch Node fetch via undici ProxyAgent when HTTP_PROXY/HTTPS_PROXY/ALL_PROXY is set - Ensure proxy patch is loaded for both chat pipeline and OAuth token exchange - Add Dashboard Settings → Network to edit outbound proxy and apply immediately - Persist outbound proxy settings in local db and initialize on server startup - Move proxy helpers to src/lib/network/ for better structure - Rename src/proxy.js → src/dashboardGuard.js to avoid naming confusion - Re-apply proxy env after DB import - Fix: close old dispatcher on proxy URL change to prevent connection pool leak - Fix: idempotency guard to avoid patching globalThis.fetch multiple times Made-with: Cursor	2026-02-28 10:11:53 +07:00
decolua	833069caac	Fix MITM on window	2026-02-28 10:04:57 +07:00
decolua	25c2ad7360	feat: implement model lock functionality for connection management	2026-02-27 10:29:11 +07:00
Quan	07717bad60	feat: cherry-pick PR #183 — multi-provider support, PWA, dynamic models, UI improvements Cherry-picked from decolua/9router PR #183. Note: open-sse changes included but need further review due to extensive modifications. Co-authored-by: Cursor <cursoragent@cursor.com>	2026-02-25 11:40:50 +07:00
zx07	ea67742f2a	feat: implement real project ID fetching for Antigravity (#170 ) * feat: implement Project ID service to fetch and cache real Project IDs from Google Cloud Code API * fix: implement caching and cleanup for Project ID retrieval * feat: add project ID invalidation and refresh logic after token updates * refactor: remove unnecessary format changes * feat: add on-demand project ID retrieval for antigravity requests	2026-02-21 23:15:18 +07:00
decolua	3debf84b9a	Add Providers	2026-02-20 17:05:46 +07:00
decolua	4cf25dc53d	feat: implement API key requirement toggle	2026-02-18 13:46:14 +07:00
HXD.VN	e1b836168a	feat: add /v1/embeddings endpoint (OpenAI-compatible) (#146 ) * feat: implement /v1/embeddings endpoint (#117) Add OpenAI-compatible POST /v1/embeddings endpoint that routes through the existing provider credential + fallback infrastructure. Changes: - open-sse/handlers/embeddingsCore.js: core handler (handleEmbeddingsCore) * Validates input (string or array), encoding_format * Builds provider-specific URL and headers for openai, openrouter, and openai-compatible providers * Handles 401/403 token refresh via executor.refreshCredentials * Returns normalized OpenAI-format response { object: 'list', data, model, usage } - cloud/src/handlers/embeddings.js: cloud Worker handler (handleEmbeddings) * Auth + machineId resolution identical to handleChat * Provider credential fallback loop with rate-limit tracking - cloud/src/index.js: wire new routes * POST /v1/embeddings (new format — machineId from API key) * POST /{machineId}/v1/embeddings (old format — machineId from URL) * test: add unit tests for /v1/embeddings endpoint - Setup vitest as test framework (tests/ directory) - embeddingsCore.test.js (36 tests): - buildEmbeddingsBody: single string, array, encoding_format, default float - buildEmbeddingsUrl: openai, openrouter, openai-compatible-, unsupported - buildEmbeddingsHeaders: per-provider headers, accessToken fallback - handleEmbeddingsCore: input validation, success path, provider errors, network errors, invalid JSON, token refresh 401 handling - embeddings.cloud.test.js (23 tests): - CORS OPTIONS preflight - Auth: missing/invalid/old-format/wrong key → 401/400 - Body validation: bad JSON, missing model, missing input, bad model → 400 - Happy path: single string, array, delegation, CORS header, machineId override - Rate limiting: all-rate-limited → 429 + Retry-After, no credentials → 400 - Error propagation: non-fallback errors, 429 exhausts accounts Total: 59/59 tests passing Framework: vitest v4.0.18, Node v22.22.0 feat: add Next.js API route for /v1/embeddings endpoint Wire the embeddings handler into Next.js App Router. - src/app/api/v1/embeddings/route.js: Next.js API route (POST + OPTIONS) - src/sse/handlers/embeddings.js: SSE-layer handler mirroring chat.js pattern Uses handleEmbeddingsCore from open-sse/handlers/embeddingsCore.js with the same auth, credential fallback, and token refresh logic as the chat handler. Supports REQUIRE_API_KEY env var, provider fallback loop, and consistent logging.	2026-02-18 13:24:02 +07:00
Nick Roth	202fee714b	feat(auth): add model-level rate limit locking for multi-bucket providers (#120 ) Providers like Antigravity maintain separate quota buckets per model family (e.g. Claude vs Gemini). A 429 on claude-opus previously locked the entire account, preventing gemini-pro requests even though its quota was full. This adds in-memory per-model locking so that only the specific model is skipped during account selection while other models remain accessible. Changes: - Add model-aware lock tracking in auth.js (Map<connectionId:model, expiry>) - Pass model context from chat handler to auth service - Multi-bucket behavior gated to known providers (MULTI_BUCKET_PROVIDERS set) - No database schema changes — locks are in-memory and clear on restart Closes #110	2026-02-15 11:35:13 +07:00
Blade096	1ae4e311b7	feat: add GLM Coding (China) provider and Usage by API Keys statistics Co-authored-by: Cursor <cursoragent@cursor.com>	2026-02-11 15:44:08 +07:00
Diego Souza	3d439839d9	feat(cloud): harden sync/auth flow, SSE fallback, and update changelog Co-authored-by: Cursor <cursoragent@cursor.com>	2026-02-08 16:45:31 +07:00
decolua	bdbe8162e7	feat(provider): add free providers and enhance error handling	2026-02-07 11:17:06 +07:00
Hellodebasishsahu	127475df84	feat(codex): add GPT 5.3, fix API translation, add thinking levels - Add GPT 5.3 Codex model with thinking level variants (none/low/medium/high/xhigh) - Extract thinking level from model name suffix (e.g., gpt-5.3-codex-high) - Fix Codex translation: preserve openai-responses format for Droid CLI - Add effort level logging in request logs Co-authored-by: Cursor <cursoragent@cursor.com>	2026-02-06 09:46:11 +07:00
ramhaidar	da5bdef4cb	feat: Add Anthropic Compatible provider support - Added support for 'anthropic-compatible' provider nodes in backend. - Implemented isAnthropicCompatible logic in open-sse for /messages URL construction and headers. - Added UI for creating and managing Anthropic Compatible providers in the dashboard. - Updated validation logic for Anthropic-compatible endpoints. - Sanitize base URL input (strip trailing /messages) to prevent 404s and improve UX. - Improve validation: use GET /models (2xx success), and support x-api-key / Authorization Bearer hybrid proxies. - Enable model import via /models for Anthropic Compatible providers. - Ensure Authorization is omitted when x-api-key is present to avoid strict proxy conflicts. - Resolve Anthropic-compatible credentials by prefix during model resolution (e.g., acx/model). - Update default executor to match provider header/url behavior for Anthropic-compatible providers.	2026-02-03 15:11:41 +07:00
decolua	0a28f9f924	feat: Add OpenAI-compatible provider nodes - Support multiple OpenAI-compatible providers with custom prefix/baseUrl - Add provider nodes CRUD (create/read/update/delete) - URL building: baseUrl + /chat/completions or /responses - Model import from /models endpoint - API key validation via /models - Usage type safety across all translators - OAuth token auto-refresh for expired tokens	2026-02-02 19:45:12 +07:00
decolua	c208f244ee	Enhance chat handling.	2026-01-14 15:42:38 +07:00
Catalin Stanciu	3ad2f8dc58	fix: prevent race conditions in sticky round-robin Adds a mutex to serialize account selection and updates in the proxy engine. This ensures that concurrent requests respect the sticky limit and don't distribute to the same account simultaneously. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>	2026-01-09 17:46:52 +07:00
Catalin Stanciu	4f292aae63	feat: add sticky round-robin routing strategy Implements a "sticky" round-robin strategy that uses the same provider account for a configurable number of consecutive calls (default 3) before switching to the next one. This optimizes for prompt caching by reducing organization/account rotation. Adds a configuration input to the Profile settings page. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>	2026-01-09 17:45:32 +07:00
Catalin Stanciu	9ebd7d3062	feat: add round-robin routing strategy Implements a round-robin (least recently used) account selection strategy alongside the existing fill-first priority system. Adds a toggle in the Profile dashboard to switch between strategies. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>	2026-01-09 17:25:28 +07:00
Catalin Stanciu	9c3d6f4ad8	feat: implement usage tracking for AI requests Adds local token usage tracking for all AI providers. Usage data is captured during stream processing and stored in a local database. Includes a new Usage tab in the Providers dashboard to visualize historical token consumption. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>	2026-01-09 17:23:28 +07:00
decolua	18533505ef	refactor: restructure translator from from-openai/to-openai to request/response folders	2026-01-09 17:14:51 +07:00
decolua	3857598de4	Initial commit	2026-01-05 09:58:59 +07:00

25 commits