Definitive reference for AI model selection. 119 models × 55+ categories × 54 benchmarks. Sources: LM Council, Artificial Analysis, Scale AI SEAL, BFCL V4, BenchLM.ai, MTEB, Zyte, Speko.
Click header to sort. Filter by tier or search by name.
| Model | $/M in→out | Ctx | Max Out | Caps | SWE-V% | SWE-Pro% | GPQA% | HLE% | ARC-AGI-2% | Tau2% | BenchLM | Best for |
|---|
Quality #1/#2 — best by benchmark. Budget #1/#2 — best under $1/M input. Free — $0.
26 models. MTEB scores, pricing, OpenRouter IDs. For RAG use Retrieval NDCG@10, not average MTEB.
| Model | OR model ID | MTEB | Dims | Context | $/M tokens | Best for |
|---|
Which embedding model for which use case? Best → Budget → Free.
| Use Case | Best Model | Budget Alt | Free Alt |
|---|
27 free models on OpenRouter + 5 CLI tools. Which one to use for which task?
| Task | Best FREE (OR) | Backup FREE (OR) | FREE CLI |
|---|
Which model to use for each pipeline phase.
Cache = real savings. Batch tasks with repeated system prompts reduce input cost by 50–90%.
Fastest lookup. Verified 2026-04-01.
Only unsaturated benchmarks with real model differentiation.