A major new study suggests that many of the tests used to evaluate artificial intelligence are unreliable and frequently exaggerate what AI systems can actually do. Researchers at the Oxford Internet Institute, working with more than thirty collaborators from other institutions, analyzed 445 widely used AI benchmarks. These benchmarks are the primary tools developers use […]
- AI
- artificial intelligence