Live AI intelligence

Choose the right AI model faster.

Start with the benchmark-weighted leaderboard, check the frontier line, then scan what changed today.

42

Ranked LLMs

17

Weighted benchmarks

20

Digest stories

11 Apr 2026

Latest full refresh

Latest news

Open the news desk

Homepage ranking

The benchmark-weighted composite AI leaderboard.

Open the full ranking

Latest tracked release: Gemma 4 31B on 2 Apr 2026. It stays out of the scored set until public benchmark and quality coverage are strong enough to rank it honestly.

# Model Composite Bench Coverage Price
01 o3

OpenAI / General use / 200K context

API Vision
86.9 composite 86.4 11 tracks 68% weighted $2.00 / $8.00
02 Gemini 2.5 Pro Preview 06-05

Google / General use / 1.0M context

API Vision Audio
82.1 composite 81.7 11 tracks 68% weighted $1.25 / $10.00
03 GPT-5.2

OpenAI / Chat / 400K context

API Vision
79.8 composite 77.9 8 tracks 53% weighted $1.75 / $14.00
04 R1

DeepSeek / General use / 64K context

Open API
78.2 composite 75.5 11 tracks 68% weighted $0.70 / $2.50
05 Grok 4

xAI / Chat / 256K context

API Vision
74.6 composite 77.4 7 tracks 46% weighted $3.00 / $15.00
06 Grok 3 Beta

xAI / Coding / 131K context

API
73.2 composite 85.0 6 tracks 38% weighted $3.00 / $15.00
07 Claude Opus 4.6

Anthropic / Chat / 1.0M context

API Vision
72.5 composite 74.8 7 tracks 45% weighted $15.00 / $75.00
08 Claude Opus 4

Anthropic / General use / 200K context

API Vision
72.1 composite 83.0 6 tracks 39% weighted $15.00 / $75.00
09 Claude Sonnet 4

Anthropic / General use / 1.0M context

API Vision
70.3 composite 77.9 7 tracks 43% weighted $3.00 / $15.00
10 o4 Mini

OpenAI / Coding / 200K context

API Vision
69.5 composite 91.0 4 tracks 26% weighted $1.10 / $4.40

Top evaluated model

o3

OpenAI currently tops the evaluated benchmark set with a composite score of 86.9.

Benchmark score
86.4
Coverage
68%
Best for
General use

Newer tracked launch: Gemma 4 31B. Release coverage is live before it becomes rankable.

Best open model

R1

The strongest open-weight entry on the weighted ranking right now, with benchmark coverage baked into the score.

Open source shortlist

Best value

Mistral Nemo

Strongest quality-per-cost ratio in the current leaderboard, useful when performance still has to fit a budget.

Full value ranking

Breaking news / daily digest

The current brief.

Open the news desk

9 Apr 2026 digest with 20 stories from 677 sources.

Updated data

Pipeline freshness.

Method and sources

Today in AI

The launch birthdays and lab dates that matter.

Open AI Milestones

No exact anniversary lands today. The next one is Llama 3 released in 7 days.

Latest activities

The site changelog, in live form.

Open full activity log

The homepage composite score is a coverage-aware blend of benchmark-normalized results and the existing quality layer. The AGI panel is a derived frontier signal built from ARC-AGI, GPQA Diamond, Humanity’s Last Exam, MMLU-Pro, SWE-bench Verified, and Chatbot Arena. Read the methodology before treating any ranking as gospel.