OpenAI announces o3 and o3-mini, its next simulated reasoning models

o3 matches human levels on ARC-AGI benchmark, and o3-mini exceeds o1 at some tasks.