9router

Author	SHA1	Message	Date
decolua	875a1282ea	Fix bug	2026-04-11 11:36:33 +07:00
Payne	32a746181a	fix: update Cursor client version to 3.1.0 for Composer 2 compatibility (#525 ) Cursor's API now rejects requests with outdated client versions, returning [400]: Update Required for Composer 2. Bump x-cursor-client-version from 2.3.41 to 3.1.0 across all three locations where it is set. Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-08 15:37:51 +07:00
decolua	67e0db77da	Fix : Updated Anthropic-Beta header.	2026-04-05 07:46:26 +07:00
kwanLeeFrmVi	666aecfc7c	feat(translator): lossless passthrough via CLI tool + provider pairing Add clientDetector utility to identify CLI tools (Claude Code, Gemini CLI, Antigravity, Codex) from request headers. When the CLI tool and provider are a native pair, skip all translation — only swap model and Bearer token. Made-with: Cursor	2026-04-04 23:48:58 +07:00
kwanLeeFrmVi	1c160cc8d9	feat(claude-code): spoof TLS fingerprint and stabilize headers for Anthropic - Add claudeHeaderCache.js to intercept and cache live Claude Code client headers - Forward cached headers dynamically to api.anthropic.com via default.js - Strip first-party identity headers (x-app, claude-code-* beta) for non-Anthropic upstreams - Validate and sanitize tool call IDs to match Anthropic pattern (^[a-zA-Z0-9_-]+$) - Skip thinking blocks when applying cache_control; fix max_tokens buffer (+1024) - Strip cache_control from thinking blocks in openai-to-claude translator - Comment out thoughtSignature in Gemini translator (kept for reference) - Expand .gitignore to match all deploy*.sh variants Co-authored-by: kwanLeeFrmVi <quanle96@outlook.com> Closes #433 Made-with: Cursor	2026-03-30 16:27:28 +07:00
decolua	f1c53a319e	refactor: update MITM bypass logic and enhance combo name validation	2026-03-19 22:47:32 +07:00
decolua	f264bb9a23	Refactor error logging to provide clearer context on provider failures	2026-03-14 17:08:11 +07:00
decolua	877eea8ebe	chore: Update package version to 0.3.51 and improve connection handling in API route	2026-03-14 11:56:29 +07:00
decolua	6b624af4d0	fix: Update abort method in pipeWithDisconnect to return a promise for better error handling	2026-03-14 11:38:33 +07:00
decolua	adae2605bf	Feat : Auto restart after crash	2026-03-14 09:37:29 +07:00
decolua	373b10ebb5	feat(chat): Enhance bypass handling and introduce CC filter naming feature Fix : Ollam Provider response	2026-03-13 09:41:40 +07:00
decolua	b0c6b61398	Refactor config	2026-03-12 16:20:46 +07:00
decolua	8223c87988	feat(memory-management): Introduce MEMORY_CONFIG for session and DNS management, including session TTL, cleanup intervals, and proxy dispatcher limits.	2026-03-12 15:57:21 +07:00
decolua	83d94daa82	feat(ollama): Enhance Ollama support by adding new models, updating API format handling, and integrating translation functionality.	2026-03-12 15:24:10 +07:00
decolua	d9dad5bcf3	Fix : Add custom to model selector	2026-03-11 11:59:07 +07:00
decolua	880f4eca91	feat(proxy): add proxy pool and per-connection binding + strictProxy support - Centralize proxy management with reusable proxy pools - Per-connection proxy binding with legacy fallback - Add strictProxy option: fail hard instead of silently falling back to direct - Resolve alicode-intl conflict: keep alicode-intl support + proxy support Made-with: Cursor	2026-03-09 15:46:06 +07:00
decolua	07d4cdfa7e	Fix : Claude OAuth	2026-03-03 14:46:05 +07:00
decolua	4903a9b2cb	Feat : console log	2026-03-02 09:31:16 +07:00
decolua	50990e84b4	Fix AG MITM	2026-03-01 18:40:55 +07:00
decolua	2f4b813c5b	feat(usage): implement timeout and error handling for antigravity usage and subscription requests - Add a 10-second timeout for fetch requests in getAntigravityUsage and getAntigravitySubscriptionInfo functions. - Include error logging for fetch failures in both functions. - Update headers to include "x-request-source" for MITM bypass. - Enhance proxyFetch with DNS resolution and MITM bypass capabilities. - Ensure proxyFetch is loaded in the API route for proper fetch patching.	2026-02-28 12:12:49 +07:00
gen	5a015e5b4d	feat(proxy): add outbound HTTP proxy support for OAuth + provider requests - Patch Node fetch via undici ProxyAgent when HTTP_PROXY/HTTPS_PROXY/ALL_PROXY is set - Ensure proxy patch is loaded for both chat pipeline and OAuth token exchange - Add Dashboard Settings → Network to edit outbound proxy and apply immediately - Persist outbound proxy settings in local db and initialize on server startup - Move proxy helpers to src/lib/network/ for better structure - Rename src/proxy.js → src/dashboardGuard.js to avoid naming confusion - Re-apply proxy env after DB import - Fix: close old dispatcher on proxy URL change to prevent connection pool leak - Fix: idempotency guard to avoid patching globalThis.fetch multiple times Made-with: Cursor	2026-02-28 10:11:53 +07:00
decolua	a5eb5a864e	chore: add Gemini 3.1 Pro models to provider configurations	2026-02-22 15:20:24 +07:00
Hồ Xuân Dũng	a57a8ce206	feat: add Gemini embeddings support + Letta compatibility fixes Cherry-picked from decolua/9router#148 (author: xuandung38 / Hồ Xuân Dũng <me@hxd.vn>) - Add Google AI (Gemini) embeddings support for /v1/embeddings endpoint - Add Gemini embedding models: gemini-embedding-001, text-embedding-005, text-embedding-004 - Inject missing object/created fields for Letta and strict OpenAI clients - Strip Azure-specific fields (prompt_filter_results, content_filter_results) from responses - Fix Dockerfile: copy open-sse directory into Docker runner stage Skipped: whitelist message field stripping (commit 3/7/8) — too aggressive for all providers Skipped: default stream=false change (commit 9) — behavior change needs further review Co-authored-by: Cursor <cursoragent@cursor.com>	2026-02-20 15:01:10 +07:00
zx	a229d79158	feat(antigravity): initial steps for Antigravity anti-ban alignment Cherry-picked from decolua/9router#141 (author: LinearSakana / zx <me@char.moe>) - Implement client identity spoofing with numeric enums (ideType: 9, pluginType: 2) - Add runtime platform detection for User-Agent and metadata - Implement per-connection session ID caching (binary-compatible format) - Add ANTIGRAVITY_HEADERS (X-Client-Name, X-Client-Version, x-goog-api-client) - Add X-Machine-Session-Id header injection - Align metadata/mode parameters across all Antigravity API calls - Implement double injection for system prompt (raw + [ignore] wrapped) - Rename internal anti-loop header to x-request-source for anonymity Skipped: commit 6 (signature side-channel caching) — kept DEFAULT_THINKING_GEMINI_SIGNATURE Co-authored-by: Cursor <cursoragent@cursor.com>	2026-02-20 14:44:29 +07:00
misterdas	b9a697925e	fix(open-sse): emit [DONE] in passthrough SSE mode (#142 )	2026-02-18 13:26:23 +07:00
zx07	3d29b86d44	feat: enhance disconnect handling and request tracking in chatCore.js (#126 ) Co-authored-by: zx <me@char.moe>	2026-02-15 11:51:37 +07:00
apple-techie	d7d5dc90bc	fix: update Codex executor for gpt-5.3-codex support Co-authored-by: Cursor <cursoragent@cursor.com>	2026-02-12 18:12:38 +07:00
Blade096	1ae4e311b7	feat: add GLM Coding (China) provider and Usage by API Keys statistics Co-authored-by: Cursor <cursoragent@cursor.com>	2026-02-11 15:44:08 +07:00
decolua	d3c3a4ae0a	Remove Docker publish workflow and update error handling in various modules - Added handling for HTTP_STATUS.NOT_ACCEPTABLE in error types and messages. - Enhanced the `prepareClaudeRequest` function to filter built-in tools for non-Anthropic providers and clean up empty tool arrays. - Updated the `openaiToClaudeRequest` function to handle built-in tools more effectively and ensure proper tool conversion. - Improved the `claudeToOpenAIResponse` function to skip processing for built-in server tool blocks. - Refined error message handling in the `parseUpstreamError` function to ensure meaningful output. - Adjusted command checks for tool installations across various settings routes to use `command -v` for better compatibility.	2026-02-10 19:18:40 +07:00
Blade	85b7a0b136	Feature/ai observability dashboard (#79 ) * feat: add AI request details feature with latency tracking Add comprehensive request history and debugging capability to the Usage dashboard: Storage Layer (usageDb.js): - Add saveRequestDetail() for storing full request/response details - Implement FIFO queue with 1000-record limit in request-details.json - Auto-sanitize sensitive headers (authorization, api-key, cookie, token) - Add getRequestDetails() with pagination and filtering support - Add getRequestDetailById() for single record lookup Pipeline Integration (chatCore.js): - Track request start time and calculate total latency - Record TTFT (Time To First Token) and total latency for all requests - Capture full request details (messages, model, parameters) - Save response content for non-streaming, mark streaming responses - Handle error cases with detailed error information - Async non-blocking saves to avoid impacting request performance API Layer (/api/usage/request-details): - GET endpoint with pagination (page, pageSize: 1-100) - Filter by provider, model, connectionId, status, date range - Returns { details: [...], pagination: {...} } format UI Components: - Drawer.js: Right slide-out panel with backdrop blur and ESC close - Pagination.js: Full pagination with page size selector (10/20/50) - RequestDetailsTab.js: Complete table view with filters and detail drawer Dashboard Integration: - Add "Details" tab to Usage page (4th tab after Overview/Logger/Limits) - Table columns: Timestamp, Model, Provider, Input Tokens, Output Tokens, Latency (TTFT/Total), Action - Provider filter dropdown (9 providers supported) - Date range filters (start/end datetime) - Click "Detail" button to view full request/response JSON in slide-out drawer Features: - Real-time latency monitoring (TTFT & Total) - Complete request/response inspection for debugging - Filterable and searchable request history - Responsive design with mobile-friendly filters - Data security with automatic header sanitization - Performance: async saves don't block request pipeline Files Created/Modified: - src/lib/usageDb.js (modified) - open-sse/handlers/chatCore.js (modified) - src/app/api/usage/request-details/route.js (new) - src/shared/components/Drawer.js (new) - src/shared/components/Pagination.js (new) - src/app/(dashboard)/dashboard/usage/components/RequestDetailsTab.js (new) - src/app/(dashboard)/dashboard/usage/page.js (modified) Closes: AI Observability Dashboard feature * feat: enhance request details with full config and streaming content capture Improve Request Details feature to capture comprehensive request parameters and actual streaming response content: Request Configuration Enhancement (chatCore.js): - Add extractRequestConfig() helper function to capture all request parameters - Include temperature controls: temperature, top_p, top_k - Include token limits: max_tokens, max_completion_tokens - Include thinking/reasoning modes: thinking, reasoning, enable_thinking - Include OpenAI parameters: presence_penalty, frequency_penalty, seed, stop, tools, tool_choice, response_format, n, logprobs, top_logprobs, logit_bias, user, parallel_tool_calls, prediction, store, metadata - Apply to all request types: non-streaming, streaming, and error cases Streaming Content Capture (chatCore.js & stream.js): - Add onStreamComplete callback mechanism to stream processors - Accumulate content from all formats: OpenAI, Claude, Gemini - Track content from delta.content, delta.reasoning_content, delta.text, delta.thinking, and Gemini content.parts - Save initial record with "[Streaming in progress...]" marker - Update record with actual content when stream completes - Include usage tokens when available from stream Files Modified: - open-sse/handlers/chatCore.js - extractRequestConfig() + streaming capture - open-sse/utils/stream.js - onStreamComplete callback + content accumulation Benefits: - View complete request configuration in Request Details (thinking mode, etc.) - See actual streaming response content instead of placeholder - Better debugging and observability for AI requests Refs: #request-details-enhancement * feat: separate thinking/reasoning content from response content Improve Request Details to display thinking process separately from final response: Backend Changes: - stream.js: Capture content and thinking separately in streaming mode - Add accumulatedThinking variable alongside accumulatedContent - Route delta.content to content, delta.reasoning_content to thinking - Support OpenAI (reasoning_content), Claude (thinking), Gemini (part.thought) - Update onStreamComplete callback to return { content, thinking } object - chatCore.js: Update response structure to include thinking field - Non-streaming: Extract thinking from reasoning_content field - Streaming: Receive { content, thinking } from stream callback - Error responses: Include thinking: null - Initial streaming save: Include thinking: null Frontend Changes: - RequestDetailsTab.js: Display thinking and content in separate sections - Add amber/yellow themed "Thinking Process" section with psychology icon - Show "Final Response" label when thinking is present - Use distinct visual styling for thinking (amber bg) vs content (gray bg) - Only show thinking section when thinking content exists Benefits: - Users can clearly see model's reasoning process vs final answer - Better debugging for models with thinking capabilities (Claude, o1, etc.) - Visual distinction makes it easy to identify thinking vs response Refs: #thinking-content-separation * fix: map Claude thinking to reasoning_content field Fix Claude thinking content to be properly captured as reasoning_content instead of regular content, enabling separate display in Request Details: Changes: - claude-to-openai.js: Use reasoning_content field for thinking blocks - thinking start: send { reasoning_content: "" } instead of { content: "```\n```" } - thinking delta: map to reasoning_content instead of content - thinking stop: send { reasoning_content: "" } instead of { content: "```\n```" } Why This Matters: - Previously Claude thinking was sent as `content` field, mixed with actual response - Now thinking uses `reasoning_content` field, matching OpenAI's o1 format - stream.js can now properly route thinking to accumulatedThinking variable - Request Details UI will show Claude thinking in separate "Thinking Process" section Supported Thinking Formats: - OpenAI: delta.reasoning_content → thinking - Claude: delta.thinking → reasoning_content (now fixed) - Gemini: part.thought === true → thinking Refs: #claude-thinking-fix * feat(observability): capture and display full 4-layer request chain Capture complete request/response chain in AI Request Details: - Add providerRequest field (translated request sent to provider) - Add providerResponse field (raw provider response, streaming indicator) - Update chatCore.js at all 5 saveRequestDetail() call sites - Reorganize UI into 4 collapsible sections with Material icons - Preserve backward compatibility for old records - Add distinct styling for streaming indicator * fix(observability): resolve React duplicate key warning in request details table - Use composite key (detail.id + index) to ensure unique keys - Prevents React warnings when database contains duplicate IDs from old ID generation * fix(observability): display actual content in streaming request details Change providerResponse field for streaming requests from placeholder "[Streaming - raw response not captured]" to actual final content. This improves debugging experience by showing the real AI response in the "Provider Response (Raw)" section instead of a confusing placeholder message. Files changed: - open-sse/handlers/chatCore.js: Save contentObj.content to providerResponse - src/app/.../RequestDetailsTab.js: Remove special handling for placeholder * refactor(observability): migrate request details to SQLite for improved concurrency - Replace LowDB JSON storage with better-sqlite3 - Enable WAL mode for true concurrent read/write support - Add 5 indexes to accelerate queries (timestamp, provider, model, connection_id, status) - Perform pagination at the database level to reduce memory footprint - Maintain 1000 record limit with automatic cleanup of old data - Ensure API compatibility via re-exports, requiring no caller changes Performance improvements: - Concurrent Writes: Lock-free WAL mode prevents data contention - Query Efficiency: Index-based searches replace full dataset loading - Data Integrity: Atomic operations prevent file corruption * fix(observability): resolve pagination statistics display issues - Fix issue where totalItems=0 showed 'Showing 1 to 0 of 0 results' - Hide pagination controls when totalItems=0 or totalPages<=1 - Standardize API response fields: pagination.total -> pagination.totalItems Before: Incorrect stats shown for empty data, and pager visible even for single-page results After: Stats hidden for empty data, pager hidden when navigation is unnecessary * feat(observability): display friendly provider names in request details - Add /api/usage/providers endpoint to dynamically fetch provider list with names - Replace hardcoded provider options with dynamic loading from database - Display friendly provider names instead of IDs in both table and detail drawer - Support custom provider nodes (e.g., OpenAI-compatible) with user-defined names - Add provider name caching to optimize performance * fix(observability): use INSERT OR REPLACE for request details to handle streaming updates * fix(observability): resolve zero-token display issue by ensuring streaming usage capture and fixing key mismatch * fix(observability): separate TTFT and total latency calculation for streaming requests * feat(observability): implement SQLite write queue and JSON size limits - Added in-memory buffer and batch writing for SQLite to prevent lock contention - Implemented with configurable 1MB limit to prevent DB bloat - Added dashboard UI for observability performance and data management settings - Integrated graceful shutdown handlers to prevent data loss * fix(observability): resolve ReferenceError by declaring dbInstance	2026-02-09 10:30:42 +07:00
decolua	388389c972	Revert "feat(request-details): implement observability settings and enhance request detail tracking" This reverts commit `cbabf5547c`.	2026-02-09 10:29:38 +07:00
decolua	cbabf5547c	feat(request-details): implement observability settings and enhance request detail tracking - Added new observability settings in the dashboard for max records, batch size, flush interval, and max JSON size. - Introduced `extractRequestConfig` function to capture full request configurations. - Enhanced error handling by saving detailed request information on failures. - Updated usage tracking to include new token metrics. - Modified streaming functions to support detailed content and reasoning tracking.	2026-02-09 10:20:24 +07:00
decolua	bdbe8162e7	feat(provider): add free providers and enhance error handling	2026-02-07 11:17:06 +07:00
decolua	c6127412a6	Fix Antigravity	2026-02-06 09:18:01 +07:00
decolua	32aefe5a76	feat(executors): Improved UI components for displaying provider limits and usage statistics in the dashboard.	2026-02-05 18:38:50 +07:00
decolua	0a026c7af6	feat(cursor): Add cursor Provider	2026-02-05 11:06:20 +07:00
decolua	abaeb22863	Merge branch 'pr-50'	2026-02-05 11:02:50 +07:00
triadmoko	137f315bec	feat(cursor): Integrate Cursor IDE support with OAuth import token flow - Add CursorExecutor for handling requests to the Cursor API using protobuf over HTTP/2. - Implement CursorAuthModal for user token import from local SQLite database. - Update provider models and constants to include Cursor as a supported provider. - Enhance API service with token validation and user info extraction from Cursor tokens. - Introduce utility functions for checksum generation and protobuf encoding/decoding for Cursor API interactions.	2026-02-04 12:07:29 +07:00
decolua	e6e44ac364	Fix : usage convert	2026-02-04 09:54:11 +07:00
decolua	7881db81ec	feat: Implement buffer addition to usage tracking for improved context handling	2026-02-03 10:39:20 +07:00
decolua	df0e1d6485	feat: Update response handling and logging for improved usage tracking	2026-02-03 10:22:43 +07:00
decolua	a33924b336	feat: Enhance usage tracking across response handlers	2026-02-03 00:29:22 +07:00
decolua	0a28f9f924	feat: Add OpenAI-compatible provider nodes - Support multiple OpenAI-compatible providers with custom prefix/baseUrl - Add provider nodes CRUD (create/read/update/delete) - URL building: baseUrl + /chat/completions or /responses - Model import from /models endpoint - API key validation via /models - Usage type safety across all translators - OAuth token auto-refresh for expired tokens	2026-02-02 19:45:12 +07:00
decolua	c7219d0ac9	Fix cloud	2026-01-29 18:07:28 +07:00
decolua	79342c0c3e	ok	2026-01-27 10:49:16 +07:00
decolua	d9b8e48725	feat: OpenAI compatibility improvements & build fixes - Fix hydration mismatches and initialization errors - Add /v1/models endpoint for OpenAI clients - Add Codex response translator (Responses → OpenAI) - Fix circular dependencies and PropTypes - Add Material Symbols font and CSS fixes - Update README with deployment guide Co-merged from PR #18 (14/15 commits, skipped debug)	2026-01-20 13:16:34 +07:00
decolua	0943387d24	Integrated proxy support	2026-01-20 12:09:14 +07:00
decolua	26b61e5fbb	Feat Kiro OAuth, Fix Codex	2026-01-15 18:29:47 +07:00
decolua	c208f244ee	Enhance chat handling.	2026-01-14 15:42:38 +07:00
decolua	f9ef718fc6	Fix Antigravity	2026-01-13 17:19:41 +07:00

1 2

57 commits