Routing
Multi-provider LLM routing.
Task-aware model selection. Dynamic fallback on throttling. Per-provider spend audit. Cut your inference bill 40-70% without rewriting a single prompt.
Multi-provider LLM routing
Pay 40-70% less on inference.
ANIMA's role router scores every task by complexity and picks the cheapest model that can handle it. When a provider throttles or fails, the model rotator fails over without dropping the request. The usage tracker shows you exactly where every cent went.
- Role Router — task-aware model selection
- Model Rotator — dynamic fallback on throttling
- Usage Tracker — per-provider spend audit
Providers
- Anthropic Claude (BYOK)Default · Sonnet 4.6 / Opus 4.7 / Haiku 4.5 — bring your own key
- Nox-2 PrimeIn active development · Gemma 4 26B A4B fine-tune · coming soon as managed default
- Voyage AIEmbeddings · semantic retrieval
Task → Model (task-aware routing)
- Agent jobs→Claude Sonnet 4.6 (BYOK)
- Heartbeats→Claude Sonnet 4.6 (BYOK)
- Chat→Claude Sonnet 4.6 (BYOK)
- Embed→Voyage-3