AI is better than undergraduates at exams

Published in AI

AI is better than undergraduates at exams

by Nick Farrell on01 July 2024

font size decrease font size increase font size
Print
Email

Not so good when it needed abstract reasoning

A top boffin at the University of Reading Peter Scarfe has discovered that AI is passes exams much better than most undergraduates.

Scarfe experimented to test the university’s examination system's vulnerability to AI-generated answers.

Scarfe's team submitted over 30 AI-generated answers across multiple undergraduate psychology modules. More than 94 per cent of these submissions went undetected, and nearly 84 per cent received higher grades than human counterparts.

The experiment involved five undergraduate modules spanning all three years of a psychology degree. The AI-generated assignments included 200-word answers and more elaborate 1,500-word essays.

Scarfe's team submitted answers without editing, except for minimal essay formatting. They even used copy-pasting to keep the answers within the required word limits.

Despite no efforts to conceal AI usage, 63 AI-generated submissions slipped into the examination system. Most weren't flagged due to repetition or robotic language; they were flagged because they were too good.

The AI didn't fare well in the final module, where students provide deeper insights and analytical skills. Large language models struggle with abstract reasoning.

While AI excelled in first- and second-year exams (where questions were more accessible), it's evolving. Scarfe predicts better performance in advanced assignments.

As AI becomes ubiquitous, universities must adapt. Detecting AI cheating is challenging, so integrating it into the education system is inevitable, Scarfe concluded.

Last modified on 01 July 2024

Rate this item

(0 votes)

More in this category: « AI's energy appetite might kill the power grid China takes the lead in AI patents »

Latest comments

Dietche
Sorry but no. This is the Linux Kernel Code, which is one of the most tightly controlled...

Torvalds fumes over "code turds" in Intel graphics code in Linux 6.15 · 1 hour ago
Marc GP
If you read the article you'll notice that he fixed it. But next time he won't do it,...

Torvalds fumes over "code turds" in Intel graphics code in Linux 6.15 · 10 hours ago
Andrea Sibaldi
Code generative AI?? Have you ever tried to write a single line of code? Also.... build from the...

Musk gives DOGE months to replace SSA code · 16 hours ago
Dmitry
Intel is like 6-12 months away from biggest gains (or even a single Nvidia announcement...), and...

Troubled Chipzilla rides new CEO bounce · 1 day ago
Topher.Mctophertone
dear linus .. if you dont like it go fix it.

Torvalds fumes over "code turds" in Intel graphics code in Linux 6.15 · 1 day ago

AI is better than undergraduates at exams

Latest comments

Read more about: