Routing

Multi-provider LLM routing.

Task-aware model selection. Dynamic fallback on throttling. Per-provider spend audit. Cut your inference bill 40-70% without rewriting a single prompt.

Multi-provider LLM routing

Pay 40-70% less on inference.

ANIMA's role router scores every task by complexity and picks the cheapest model that can handle it. When a provider throttles or fails, the model rotator fails over without dropping the request. The usage tracker shows you exactly where every cent went.

  • Role Router — task-aware model selection
  • Model Rotator — dynamic fallback on throttling
  • Usage Tracker — per-provider spend audit

Providers

  • Anthropic Claude (BYOK)
    Default · Sonnet 4.6 / Opus 4.7 / Haiku 4.5 — bring your own key
  • Nox-2 Prime
    In active development · Gemma 4 26B A4B fine-tune · coming soon as managed default
  • Voyage AI
    Embeddings · semantic retrieval

Task → Model (task-aware routing)

  • Agent jobsClaude Sonnet 4.6 (BYOK)
  • HeartbeatsClaude Sonnet 4.6 (BYOK)
  • ChatClaude Sonnet 4.6 (BYOK)
  • EmbedVoyage-3