OpenAI releases GDPval, a benchmark to test AI per

Techmeme · 2025-09-25T22:36:07+0530

Maxwell Zeff / TechCrunch:
OpenAI releases GDPval, a benchmark to test AI performance on “economically valuable, real-world tasks”, and says Claude Opus 4.1 was the best performing model — OpenAI released a new benchmark on Thursday that tests how its AI models perform compared to human professionals across a wide range of industries and jobs.

Search

Search

OpenAI releases GDPval, a benchmark to test AI per

Techmeme

Guest