OpenAI's GPT-5.2 Tops Independent Coding Benchmarks, Closes Gap on Reasoning

Jun 13, 2026The Verge

Third-party evaluators say GPT-5.2 now leads on SWE-bench Verified and narrows the reasoning gap with rival frontier models.

Independent benchmarking groups report that OpenAI's GPT-5.2 has taken the top spot on SWE-bench Verified, a widely used measure of real-world software engineering ability, edging out competing frontier models for the first time since GPT-5 launched.

Evaluators also note meaningful gains on multi-step reasoning tasks, though they caution that the gap with rival labs remains within the margin of normal benchmark volatility. OpenAI has not commented on the results beyond confirming GPT-5.2 is now generally available via the API.

Originally published by The Verge.

#LLMs #OpenAI #AI