
ARC-AGI-2: Leading AI models fail new test of artificial general intelligence
The ARC-AGI-2 benchmark is designed to be a difficult test for AI models Just_Super/Getty Images The most sophisticated AI models in existence today have scored poorly on a new benchmark designed to measure their progress towards artificial general intelligence (AGI) – and brute-force computing power won’t be enough to improve, as evaluators are now taking…