New AI Benchmark Sets Bar for Expert-Level Intelligence – NeXuS! News from all over the world!

A new global benchmark, “Humanity’s Last Exam” (HLE), has been created to test the limits of today’s advanced artificial intelligence systems. The test consists of 2,500 rigorously reviewed questions in various disciplines, with a focus on precision and closed-ended answers. Despite high scores on conventional benchmarks, AI models struggled with HLE, passing fewer than 10% of the questions when first released in 2025. However, top models still show significant improvement, reaching just below 40%. This new benchmark aims to identify remaining limitations and emerging generalist research capabilities in AI systems.

Source: https://www.manchester.ac.uk/about/news/mathematicians-contribute-to-ai-benchmark

Manchester United Stuns Man City with Last-Minute Win
Polygenic Scores Correlate Moderately with Intelligence
"Mistral Large 2: A New Frontier in Language Models…
Manchester's Working-Class Roots Fuel Global Labor Movement
Arsenal's 'New' Free-Kick Routine Yields Dividends…
Introducing gpt-oss: Open-Weight Reasoning Models…

M	T	W	T	F	S	S
						1
2	3	4	5	6	7	8
9	10	11	12	13	14	15
16	17	18	19	20	21	22
23	24	25	26	27	28

Related Posts: