AI WAR ROOM — 350+ AI models, ranked by live arena ELO

AI WAR ROOM is an independent, daily-updated leaderboard of the strongest large language models in the world. We pull live ELO scores from arena.ai's human-vote battle system, combine them with current pricing from OpenRouter, and present the entire field — 350+ models from OpenAI, Anthropic, Google, xAI, Meta, Alibaba, DeepSeek, Moonshot, Z.ai, Mistral, Baidu, ByteDance, Xiaomi, Tencent and more — in one ranked, filterable view.

Every score comes from real human preference: anonymous head-to-head battles where voters pick the better of two responses. The result is a relative skill rating — the same ELO system used in competitive chess — that is harder to game than benchmark numbers and more useful than marketing claims.

What you'll find here

Leaderboard — every active model ranked by ELO, with confidence intervals, votes, live pricing, context length, licence, and per-model descriptions.
The Guide — a plain-English walkthrough of how arena ELO works, the 2026 lab landscape, how to pick a model, and common pitfalls.
FAQ — honest answers to the questions we get most often about ratings, pricing, thinking variants, open-weight models, and arena mechanics.
Blog — long-form analysis on reading ELO leaderboards, open vs closed weights in 2026, and when thinking models help or hurt.
Model and lab profiles — in-depth pages for the top frontier models and the labs building them.
Methodology — the exact rules behind the rankings, the ELO tiers (S/A/B/C), and how pricing data is sourced.

How to choose a model

The #1 model is rarely the model you want. Top-ranked frontier models are the most expensive, slowest, and built for the hardest reasoning. For most real workloads — chat, summarisation, classification, RAG pipelines — a model in positions 5–20 will perform indistinguishably for a fraction of the cost. Start by sorting on ELO to identify the top tier, then filter on context window, licence (open vs proprietary), and price. Run your own evals on the three candidates with the best fit. The arena gives you the shortlist; your own judgement picks the winner.

How the data is sourced

ELO ratings come live from arena.ai, refreshed every 30 minutes. Pricing comes live from OpenRouter, refreshed on the same cadence. Model descriptions are maintained by hand from public lab announcements and model cards. AI WAR ROOM is not affiliated with any AI lab, with arena.ai, or with OpenRouter — we surface their public data and add editorial context.

Start exploring

Open the leaderboard · Read the guide · Browse the FAQ · Visit the blog · See the methodology

Loading interactive view… If you're seeing this for more than a few seconds, JavaScript may be disabled in your browser. The content above is a summary; the full leaderboard requires JavaScript to load.