marketing-shibata50/9router

Fork 0

decolua fd92af77a0 Feat : Gitbook

2026-05-11 11:50:24 +07:00

8.9 KiB

Raw Blame History

Cheap Providers - Ultra-Cheap Backup

When subscription quota runs out, pay pennies instead of dollars. ~90% cheaper than ChatGPT API!

Overview

Cheap tier providers are your backup when subscription quota exhausted:

💰 GLM-4.7 - $0.6/$2.2 per 1M tokens (daily reset)
💰 MiniMax M2.1 - $0.2/$1.0 per 1M tokens (5h reset)
💰 Kimi K2 - $9/month flat (10M tokens)

Strategy: Use after subscription quota out, before free tier. Massive cost savings vs ChatGPT API ($20/1M).

GLM-4.7 (Daily Reset)

Pricing

Tier	Input	Output	Reset
Standard	$0.60/1M	$2.20/1M	Daily 10:00 AM
Coding Plan	$0.60/1M	$2.20/1M	Daily 10:00 AM (3× quota)

Cost Example (10M tokens):

Input: 10M × $0.60 = $6
Output: 10M × $2.20 = $22
Total: $6-22 vs $200 on ChatGPT API!

Setup

Step 1: Sign Up

Visit Zhipu AI
Create account (phone verification)
Choose Coding Plan for 3× quota at same price

Step 2: Get API Key

Dashboard → API Keys → Create New
→ Copy API key (starts with "zhipu-")

Step 3: Add to 9Router

9router
# Dashboard → Providers → Add API Key

Provider: glm
API Key: zhipu-your-api-key-here

Step 4: Use in CLI

Model: glm/glm-4.7
       glm/glm-4.6v (vision)

Available Models

Model ID	Description	Context	Best For
`glm/glm-4.7`	GLM 4.7	128K	Coding, general tasks
`glm/glm-4.6v`	GLM 4.6V Vision	128K	Image analysis

Pro Tips

Coding Plan - 3× quota at same price ($0.6/$2.2)
Daily reset - Fresh quota at 10:00 AM Beijing time
Best for coding - Optimized for code generation
128K context - Handle large files

Quota Reset

Daily reset: 10:00 AM Beijing Time (UTC+8)
→ 2:00 AM UTC
→ 6:00 PM PST (previous day)
→ 9:00 PM EST (previous day)

Plan your heavy tasks around reset time!

MiniMax M2.1 (5-Hour Reset)

Pricing

Tier	Input	Output	Reset
Standard	$0.20/1M	$1.00/1M	5-hour rolling

Cost Example (10M tokens):

Input: 10M × $0.20 = $2
Output: 10M × $1.00 = $10
Total: $2-10 - Cheapest option!

Setup

Step 1: Sign Up

Visit MiniMax
Create account
Verify email/phone

Step 2: Get API Key

Dashboard → API Management → Create Key
→ Copy API key

Step 3: Add to 9Router

9router
# Dashboard → Providers → Add API Key

Provider: minimax
API Key: your-minimax-api-key

Step 4: Use in CLI

Model: minimax/MiniMax-M2.1

Available Models

Model ID	Description	Context	Best For
`minimax/MiniMax-M2.1`	MiniMax M2.1	1M tokens	Long context, coding

Pro Tips

Cheapest option - $0.20/1M input (90% cheaper than ChatGPT)
5-hour rolling - Quota resets every 5 hours
1M context - Massive context window
Best for long files - Handle entire codebases

Quota Reset

5-hour rolling window:
→ Use quota → Wait 5 hours → Fresh quota

Example:
10:00 AM - Use 5M tokens
3:00 PM - Fresh quota available
8:00 PM - Fresh quota available

Code 24/7 with minimal cost!

Kimi K2 (Flat $9/month)

Pricing

Plan	Monthly Cost	Included Tokens	Effective Cost
Subscription	$9	10M tokens	$0.90/1M

Cost Example:

$9/month flat
10M tokens included
Effective: $0.90/1M - Best value for consistent usage!

Setup

Step 1: Subscribe

Visit Moonshot AI
Create account
Subscribe to $9/month plan

Step 2: Get API Key

Dashboard → API Keys → Create New
→ Copy API key

Step 3: Add to 9Router

9router
# Dashboard → Providers → Add API Key

Provider: kimi
API Key: your-kimi-api-key

Step 4: Use in CLI

Model: kimi/kimi-latest

Available Models

Model ID	Description	Context	Best For
`kimi/kimi-latest`	Kimi Latest	200K	General coding

Pro Tips

Fixed cost - $9/month regardless of usage (up to 10M)
Best for consistent usage - If you use 10M/month, only $0.90/1M
Monthly reset - 10M tokens reset monthly
Predictable billing - No surprise costs

Quota Reset

Monthly reset: 1st of each month
→ 10M tokens refresh

Example monthly usage:
Week 1: 3M tokens
Week 2: 2M tokens
Week 3: 3M tokens
Week 4: 2M tokens
Total: 10M tokens = $9 flat

Pricing Comparison

Provider	Input/1M	Output/1M	Reset	10M Cost	Best For
GLM-4.7	$0.60	$2.20	Daily 10AM	$6-22	Daily quota users
MiniMax M2.1	$0.20	$1.00	5-hour	$2-10	Cheapest!
Kimi K2	$0.90	$0.90	Monthly	$9 flat	Consistent usage
ChatGPT API	$20.00	$20.00	None	$200	❌ Expensive

Savings: 90-95% cheaper than ChatGPT API!

Usage Example

Cursor IDE Setup

Settings → Models → Advanced:
  OpenAI API Base URL: http://localhost:20128/v1
  OpenAI API Key: [from 9router dashboard]
  Model: glm/glm-4.7

Create Combo (Recommended)

Dashboard → Combos → Create New

Name: cheap-backup
Models:
  1. cc/claude-opus-4-5 (Subscription primary)
  2. glm/glm-4.7 (Cheap backup, daily reset)
  3. minimax/MiniMax-M2.1 (Cheapest fallback)
  4. if/kimi-k2-thinking (FREE emergency)

Use in CLI: cheap-backup

Result: Subscription → Cheap → Cheapest → Free

Cost Optimization

Strategy 1: Daily Reset Routine

Morning (10AM): Fresh GLM quota
→ Use GLM for heavy tasks
→ Save subscription quota

Afternoon: Subscription quota
→ Use Claude/Codex for complex tasks

Evening: MiniMax (5h reset)
→ Cheap fallback for late work

Night: Free tier (iFlow)
→ Zero cost emergency backup

Strategy 2: Budget-First

Set monthly budget: $20

Allocation:
- $9 Kimi K2 (10M tokens flat)
- $6 GLM daily quota (10M tokens)
- $5 MiniMax overflow (25M tokens)

Total: 45M tokens for $20
vs 1M tokens for $20 on ChatGPT API!

Strategy 3: Maximize Subscriptions First

Priority:
1. Gemini CLI (180K/month FREE)
2. Claude Code (subscription you already pay)
3. GLM-4.7 (cheap backup, $0.6/1M)
4. MiniMax M2.1 (cheapest, $0.2/1M)
5. iFlow (FREE emergency)

Monthly cost example (100M tokens):
- 60M via Gemini CLI: $0 (free)
- 30M via Claude Code: $0 (subscription)
- 8M via GLM: $4.80
- 2M via MiniMax: $0.40
Total: $5.20/month!

Real-World Examples

Example 1: Heavy Coding Month (100M tokens)

Breakdown:
- 60M via subscription (Claude/Codex): $0 extra
- 30M via GLM-4.7: $18
- 10M via MiniMax M2.1: $2

Total: $20/month
vs $2000 on ChatGPT API!

Savings: 99% cheaper!

Example 2: Budget Coder ($10/month)

Strategy:
- $9 Kimi K2 (10M tokens)
- $1 MiniMax overflow (5M tokens)

Total: 15M tokens for $10
vs 0.5M tokens for $10 on ChatGPT API!

30× more tokens!

Example 3: Freelancer (Variable Usage)

Light month (20M tokens):
- 15M via subscription: $0
- 5M via GLM: $3
Total: $3

Heavy month (150M tokens):
- 60M via subscription: $0
- 60M via GLM: $36
- 30M via MiniMax: $6
Total: $42

Average: $22.50/month
vs $3400 on ChatGPT API!

Best Practices

1. Track Daily Quota

Dashboard shows:
- GLM quota: 75% used (reset in 6h)
- MiniMax quota: 50% used (reset in 2h)
- Kimi quota: 8M/10M used (reset in 15 days)

Plan heavy tasks around reset times!

2. Use Coding Plan (GLM)

Standard: 1× quota
Coding Plan: 3× quota (same price!)

→ Always choose Coding Plan

3. Combine with Free Tier

Combo:
1. gc/gemini-3-flash (FREE primary)
2. glm/glm-4.7 (cheap backup)
3. minimax/MiniMax-M2.1 (cheapest)
4. if/kimi-k2-thinking (FREE emergency)

Result: Minimize costs, maximize uptime

4. Set Budget Alerts

Dashboard → Settings → Budget Alerts

Daily: $2 limit
Weekly: $10 limit
Monthly: $30 limit

→ Auto switch to free tier when limit reached

Troubleshooting

"Quota exhausted"

Solution:

GLM: Wait until 10:00 AM Beijing time
MiniMax: Wait 5 hours from first use
Kimi: Wait until 1st of next month
Use combo fallback to free tier

"API key invalid"

Solution:

Check API key copied correctly
Verify account has credits
Regenerate API key if needed

"High costs"

Solution:

Check usage stats in Dashboard
Set budget alerts
Switch to MiniMax ($0.2/1M cheapest)
Use free tier for non-critical tasks

Next Steps

Add free fallback: Free Providers
Setup subscriptions: Subscription Providers
Create combos: Dashboard → Combos → Create New

8.9 KiB Raw Blame History Unescape Escape

Cheap Providers - Ultra-Cheap Backup

Overview

GLM-4.7 (Daily Reset)

Pricing

Setup

Available Models

Pro Tips

Quota Reset

MiniMax M2.1 (5-Hour Reset)

Pricing

Setup

Available Models

Pro Tips

Quota Reset

Kimi K2 (Flat $9/month)

Pricing

Setup

Available Models

Pro Tips

Quota Reset

Pricing Comparison

Usage Example

Cursor IDE Setup

Create Combo (Recommended)

Cost Optimization

Strategy 1: Daily Reset Routine

Strategy 2: Budget-First

Strategy 3: Maximize Subscriptions First

Real-World Examples

Example 1: Heavy Coding Month (100M tokens)

Example 2: Budget Coder ($10/month)

Example 3: Freelancer (Variable Usage)

Best Practices

1. Track Daily Quota

2. Use Coding Plan (GLM)

3. Combine with Free Tier

4. Set Budget Alerts

Troubleshooting

"Quota exhausted"

"API key invalid"

"High costs"

Next Steps

8.9 KiB

Raw Blame History