AI League — Game Day 7: GPT-5.5 Breaks Out with a Season-High 68.2 t/s

AI League — Game Day 7: GPT-5.5 Breaks Out with a Season-High 68.2 t/s

GPT-5.5 hits 68.2 t/s — new season-high at the 60+ index tier. Claude bumps to 63.7. Google fields the fastest pro model in the 57+ club at 138 t/s AND a 187 t/s flash unit. DeepSeek quietly +6.2 t/s. Intelligence board locked at 61. Full June 4 stats. #AILeague

AIL·Stats Board
2026. 6. 4. · 08:13
구독 1개 · 콘텐츠 7개

AI League — Game Day 7: GPT-5.5 Breaks Out, Google's Two-Speed Strategy Pays Off

GPT-5.5 posts 68.2 t/s — a new season-high. Claude bumps to 63.7. Google fields the fastest pro-tier model in the league at 138 t/s AND a 187 t/s flash unit. Intelligence board: locked. Full June 4 stats. #AILeague

Final score: June 4, 2026

The intelligence board didn't move. Five straight days, same standings. But in the speed lane, OpenAI had something to say: GPT-5.5 punched out 68.2 tokens per second today, a +5.6 t/s jump from yesterday's 62.6 — the new season-high for the 60+ index tier. 1
Claude answered with 63.7 t/s, also creeping up from 62.4 yesterday. The two-team dead heat at the top is over: OpenAI now holds a 4.5 t/s edge in raw output speed while Anthropic retains the intelligence crown by one point. Two different definitions of "winning."
Meanwhile, the Google franchise is quietly running the most efficient two-car strategy in the league. Gemini 3.1 Pro at 138.1 t/s is the fastest pro-tier model among the 57+ index club — by a wide margin. Gemini 3.5 Flash sits at 186.6 t/s, making it the fastest model in any bracket that charges under $5/M blended. 2 3

Intelligence board — unchanged, but context is building

RankTeamModelAI IndexΔBlended $/M
1AnthropicClaude Opus 4.8 (max)61$4.10
2OpenAIGPT-5.5 (xhigh)60$4.35
3OpenAIGPT-5.5 (high)59
4AnthropicClaude Opus 4.7 (max)57$4.10
4GoogleGemini 3.1 Pro Preview57$1.74
6GoogleGemini 3.5 Flash (high)55$1.31
7Kimi (challenger)Kimi K2.654$0.70
7Xiaomi (challenger)MiMo-V2.5-Pro54$0.18
9xAIGrok 4.3 (high)53$0.64
10DeepSeekDeepSeek V4 Pro (max)52$0.18
Source: Artificial Analysis Intelligence Index v4.0, June 4, 2026. 4
Five straight days of a locked top-five is unusual. The Build Fast with AI June 2026 leaderboard frames it this way: the gap from Claude Opus 4.8 at 61 to Gemini 3.1 Pro at 57 is "meaningful, but small enough that workload selection matters more than picking the leader." 5 That's the right read. When the top three teams are within 4 points of each other, price becomes the next tiebreaker — and Google wins that fight at $1.74/M versus $4.10 and $4.35 for the top two.
AI model intelligence vs price scatter plot from Artificial Analysis
Artificial Analysis Intelligence Index v4.0 — intelligence vs. price scatter across 392 models. 4

Speed panel — two records, one franchise

TeamModelSpeed (t/s)Δ vs YesterdayPrice $/M blend
OpenAIGPT-5.5 (xhigh)68.2↑ +5.6 🔥$4.35
AnthropicClaude Opus 4.8 (max)63.7↑ +1.3$4.10
GoogleGemini 3.5 Flash (high)186.6$1.31
GoogleGemini 3.1 Pro Preview138.1$1.74
xAIGrok 4.3 (high)147.5↑ +2.3$0.64
DeepSeekDeepSeek V4 Pro (max)53.7↑ +6.2$0.18
Source: Artificial Analysis output tokens per second, June 4, 2026. Speed measured as output tokens/sec after first chunk received (streaming). 4
GPT-5.5's 68.2 t/s is the new season-high for any model sitting above AI Index 59. To be clear about what's happening in the top bracket: Claude is faster than OpenAI's long-form thinking system at reasoning tasks, but OpenAI's raw output throughput is now ahead for the week. These are different clocks measuring different things, and both matter depending on your workload.
June 2026 AI model leaderboard — 10 frontier models ranked by Artificial Analysis Intelligence Index v4.0
June 2026 ten-model leaderboard — intelligence index rankings and category winners. 5
DeepSeek V4 Pro also turned in a quiet standout line today: 53.7 t/s is a +6.2 t/s improvement from yesterday's 47.5. For a model priced at $0.18/M blended, that's a real shift — the dark-horse squad is getting faster without touching price. 6

Pricing war breakdown

No price changes today across the core six. The post-promo DeepSeek rate of $0.18/M blended — permanently set after the promo clock expired on May 31 — continues to compress the midfield. Here's the full pricing grid:
TeamModelInput $/MOutput $/MBlended $/M
AnthropicClaude Opus 4.8$6.25$25.00$4.10
OpenAIGPT-5.5 (xhigh)$5.00$30.00$4.35
GoogleGemini 3.1 Pro Preview$2.00$12.00$1.74
GoogleGemini 3.5 Flash$1.50$9.00$1.31
xAIGrok 4.3 (high)$1.25$2.50$0.64
KimiKimi K2.6$0.95$4.00$0.70
DeepSeekDeepSeek V4 Pro$0.435$0.87$0.18
XiaomiMiMo-V2.5-Pro$0.18
Official API pricing as of June 4, 2026. 7 8
The math that keeps surfacing: DeepSeek V4 Pro and MiMo-V2.5-Pro both sit at $0.18/M blended, both scoring 52-54 on the intelligence index. That puts them at roughly 30× the cost efficiency of GPT-5.5. The Build Fast with AI leaderboard calculates a "capability-per-dollar" score of 171.9 for DeepSeek versus 3.8 for GPT-5.5 — a gap wide enough that any high-volume workload where frontier-adjacent intelligence suffices should be routing through the sub-$0.50 tier, not the $4+ tier. 5
Capability-per-dollar ranking: DeepSeek V4 Pro at 171.9, vs 5.6 for Claude and 3.8 for GPT-5.5
Cost-per-intelligence ranking — DeepSeek V4 Pro dominates at 171.9 capability/dollar vs. 3.8 for GPT-5.5. 5

Challenger watch

Two players continue to hold the open-weight leaderboard above DeepSeek:
Kimi K2.6 (Moonshot AI, modified MIT): AI Index 54, speed 43.7 t/s, $0.70/M blended. Open weights, 300-agent parallel orchestration support. Currently the top open-weight model in the entire 392-model AA evaluation set. 4
MiMo-V2.5-Pro (Xiaomi): AI Index 54, $0.18/M blended. Ties DeepSeek on both intelligence and price, which makes it the most efficient model in the field when running at full price. Neither challenger moved today — holding their spots one and two on the open-weight sub-leaderboard.
Anthropic Mythos is the name to watch over the next four to eight weeks. According to the June 2026 leaderboard analysis, Mythos-class models are in restricted release with approximately 50 partner organizations. When it ships publicly, it will likely push the intelligence index above 61 — and force the rest of the board to respond. For now, it's benched. 5

Post-game take

The speed story from Game Day 7 is OpenAI's sprint. GPT-5.5 at 68.2 t/s puts it 4.5 t/s clear of Claude in raw output throughput — but Claude holds a 1-point lead on intelligence and costs $0.25/M less in blended terms. There is no clean winner here; the two franchises are trading different stats in the same bracket.
Google's two-speed strategy is the less-discussed story. Gemini 3.1 Pro at 138 t/s is twice as fast as any model in the 59+ intelligence tier. Gemini 3.5 Flash at 187 t/s is the league speed record in any bracket that charges under $5/M blended. 2 3 Google doesn't have the intelligence trophy, but for latency-sensitive production workloads, no franchise is currently beating their throughput numbers at comparable price points.
DeepSeek's 53.7 t/s today — up 6.2 from yesterday — is the other quiet standout. At $0.18/M, a speed increase costs nothing. That's the low-budget dark horse doing what low-budget dark horses do: improving without fanfare.
Board stays locked heading into Game Day 8. The next intelligence index movement likely comes when Anthropic ships Mythos publicly — or when one of the incumbents blinks on price. Until then, the stats panel runs the same five names in the same five slots.
#AILeague

이 콘텐츠를 둘러싼 관점이나 맥락을 계속 보강해 보세요.

  • 로그인하면 댓글을 작성할 수 있습니다.