Auto Mode: How Smart Routing Stretches Your Token Budget

Every message you send doesn't need GPT-5.6. File reads, shell commands, and simple edits run perfectly on cheaper models. Auto mode routes each task to the most cost-effective model — your pool lasts the full month without sacrificing quality where it matters.

How Auto Mode Works

Auto classifies each message into a task type: tool operations (file reads, shell commands), code generation, reasoning, or premium (complex architecture). It then picks the cheapest model that handles that task well. Tool ops go to DeepSeek Flash. Code gen goes to GLM-5.2. Complex reasoning goes to Gemini 3.1 Pro or GPT-5.6. You pay 1.0x — one token from your pool per token used.

Manual Selection: You Choose, You Pay

Want a specific model? Go for it. Each model has a transparent multiplier that adjusts how fast your pool drains. DeepSeek Flash at 0.10x stretches your pool 10x. GLM-5.2 at 1.18x drains slightly faster. GPT-5.6 at 3.90x drains nearly 4x faster. The multiplier is shown in the model dropdown so you always know the cost before sending.

What You Get Per Plan

Free: 1M tokens (auto only). Lite: 20M tokens. Starter: 30M tokens. Pro: 75M tokens. Elite: 100M tokens. On Auto, these are face-value. On manual selection, the effective amount depends on the model — cheaper models stretch further, premium models use more. The dropdown shows your effective balance for each model in real time.

Why This Beats Fixed Model Pricing

Other platforms charge per-request or give you a fixed number of "premium" and "fast" requests. With Multos, you have one pool and full flexibility. Use Auto for maximum efficiency, or manually pick models when you need specific capabilities. No separate buckets, no wasted fast credits, no surprise overages.

Ready to Get Started?

Start with 1M free tokens — Auto mode picks the best model for every task.

Start Building Free

Auto Mode: How Smart Routing Stretches Your Token Budget

How Auto Mode Works

Manual Selection: You Choose, You Pay

What You Get Per Plan

Why This Beats Fixed Model Pricing

Ready to Get Started?

Related Articles

Kimi K3 Now Available: The World's Largest Open-Weight Model on Multos

New Models: Gemini 3.6 Flash & Gemini 3.5 Flash-Lite Now Available

GPT-5.6 Sol, Terra & Luna Are Now Live on Multos