News on Artificial Intelligence in Education and Libraries
Monday, May 6, 2024
AI models are getting better at grade school math — but a new study suggests they may be cheating
Large language models (LLMs) that power chatbots like ChatGPT may be getting better at answering benchmark questions that measure mathematical reasoning. But this may actually be a bad thing. This is when data resembling benchmark questions leaks into training data.