There’s a problem with leading artificial intelligence tools like ChatGPT, Gemini and Claude: We don’t really know how smart they are. That’s because, unlike companies that make cars or drugs or baby formula, A.I.
ORIGINAL LINK: https://www.nytimes.com/2024/04/15/technology/ai-models-measurement.html