AI can pass almost all level 3 assessments, study finds

OU recommends that institutions should focus on question design rather than detecting AI misuse

OU recommends that institutions should focus on question design rather than detecting AI misuse

Artificial intelligence (AI) tools can pass almost all types of level 3 assessments, a new study has found.

The Open University found that AI performed “particularly highly” at this advanced level across a range of subjects, although its performance was lower at higher levels of 4 and above.

It also found that while markers’ ability to detect generative AI answers increased after training, this was undermined by an increased number of false positives.

Jonquil Lowe, senior lecturer in economics and personal finance, said that rather than focusing on detection, colleges and universities should use AI to design more “robust questions” that focus on the “added value” that humans bring.

He added: “This shifts us away from merely testing knowledge, towards what is often called ‘authentic assessment’ that requires explicit application of what has been learned in order to derive specific conclusions and solutions.”

The study confirms fears raised in a recent FE Week investigation, that students can ‘cheat’ their way through almost any non-exam assignment by using large language models of generative AI such as ChatGPT.

It also addresses concerns that AI detection tools are unreliable, giving rise to false accusations and a breakdown of trust between educators and students.

The study, funded by awarding body and education charity, NCFE, analysed generative AI’s performance by asking a group of 43 markers to grade almost one thousand scripts and to flag those they suspected were AI-generated.

A review of the results found that the most robust assessment types were audience-tailored, observation by learner and reflection on work practice.

While the study found that subjects did not affect AI’s performance, certain disciplines such as law, were easier to detect.

False positives emerged as “hallmarks” of AI-generated scripts, such as superficial answers or not focusing on the question, are also common in weaker students’ work.

The study recommends that institutions designing assessments should focus on question design marking guidance and student skills interventions rather than detecting AI misuse.

When students are identified as using AI in their assessments, institutions should focus on helping them develop their study skills.

Training for dealing with generative AI in assessments should also be ongoing.

Gray Mytton, assessment innovation manager at NCFE, said: “This report highlights the challenges in detecting genAI misuse in assessments, showing that training markers to spot AI-generated content can lead to an increase in the rate of false positives.  

“To address this, educators could help students develop study skills, including genAI use where appropriate, while awarding bodies can focus on creating more authentic assessments, which will also benefit learners as they enter the workforce.”

Read the full report here.

Latest education roles from

Principal & Chief Executive – Bath College

Principal & Chief Executive – Bath College

Dodd Partners

IT Technician

IT Technician

Harris Academy Morden

Teacher of Geography

Teacher of Geography

Harris Academy Orpington

Lecturer/Assessor in Electrical

Lecturer/Assessor in Electrical

South Gloucestershire and Stroud College

Director of Management Information Systems (MIS)

Director of Management Information Systems (MIS)

South Gloucestershire and Stroud College

Exams Assistant

Exams Assistant

Richmond and Hillcroft Adult & Community College

Sponsored posts

Sponsored post

Skills Bootcamps Are Changing – What FE Colleges Must Know 

Skills Bootcamps are evolving as funding moves to local control and digital skills trends shift. Code Institute, an Ofsted...

Code Institute
Sponsored post

Building Strong Leadership for Effective T Level Implementation

Are you struggling with T Level curriculum and implementation, or building strong employer relationships? Do you want to develop...

Advertorial
Sponsored post

Derby College Group DIRT and TOES: A Story of Enhanced Learning and Reduced Workload

"Feedback is one of the most powerful influences on learning and achievement" - Hattie and Timperley 2007. This powerful...

Advertorial
Sponsored post

Keeping it real – enriching T Level teaching with Industry Insights

T Level teachers across all subjects are getting invaluable support from the Education and Training Foundation’s (ETF) Industry Insights...

Advertorial

More from this theme

Assessment

Ofqual demands ‘honesty’ in new rules for awarding organisations

Proposals include 'principles' that could see sanctions on organisations that undermine public confidence in qualifications

Shane Chowen
Assessment, Curriculum

Revealed: Two college principals appointed to Labour’s curriculum review

Academy trust leaders, a SEND consultant and exam experts among those appointed to work with Professor Becky Francis

Freddie Whittaker
Assessment, Long read

AI’s threat to the essence of learning is far bigger than we realise

FE Week has uncovered significant evidence of the harm that generative AI is doing to the integrity of assessment,...

Jessica Hill
Assessment

Ofqual and DfE studying ‘feasibility’ of ‘fully digital’ exams

Study includes looking at whether national infrastructure up to scratch for on-screen exams

Samantha Booth

Your thoughts

Leave a Reply

Your email address will not be published. Required fields are marked *