General scales unlock AI evaluation with explanatory and predictive power
Nature, Published online: 01 April 2026; doi:10.1038/s41586-026-10303-2A fully automated methodology based on rubrics capturing a broad range of cognitive and intellectual demands is illustrated us...
Source: www.nature.com
Nature, Published online: 01 April 2026; doi:10.1038/s41586-026-10303-2A fully automated methodology based on rubrics capturing a broad range of cognitive and intellectual demands is illustrated using LLMs and tasks, demonstrating a new way to evaluate the capabilities of AI systems and anticipate their performance.