These researchers used NPR Sunday Puzzle questions to benchmark AI ‘reasoning’ models

01:25 17.02.2025
Every Sunday, NPR host Will Shortz, The New York Times’ crossword puzzle guru, gets to quiz thousands of listeners in a long-running segment called the Sunday Puzzle. While written to be solvable without too much foreknowledge, the brainteasers are usually challenging even for skilled contestants. That’s why some experts think they’re a promising way to […] © 2024 TechCrunch. All rights reserved. For personal use only....
  385