
By Michael Timothy Bennett & Elija Perrier
A new artificial intelligence (AI) model has just achieved human-level results on a test designed to measure “general intelligence”.
On December 20, OpenAI’s o3 system scored 85 per cent on the ARC-AGI benchmark, well above the previous AI best score of 55 per cent and on par with the average human score. It also scored well on a very difficult mathematics test.
Creating