澳门跑狗论坛

Assessment

AI May Be Coming for Standardized Testing

The international PISA exam plans to experiment with AI tasks and scoring
By Alyson Klein 鈥 March 25, 2024 4 min read
online test checklist 1610418898 brightspot
  • Save to favorites
  • Print
Email Copy URL

The highest profile international test may soon offer clues as to how artificial intelligence can be harnessed to create and score assessments that paint a more detailed picture of how students learn.

The 2025 edition of the Program for International Student Assessment, or PISA, is slated to include performance tasks probing how students approach learning and solve problems, said Andreas Schleicher, the director for education and skills, and special advisor on education policy to the secretary-general at the Organization for Economic Cooperation and Development. He was speaking at a conference of the Council of Chief State School Officers this month in in Washington.

While PISA , along with traditional questions on reading, math, and science, the proposal includes an AI -powered twist.

鈥淲e are going to incorporate giving them tasks to learn and we鈥檙e going to track how they approached鈥 the assignment, to get a sense of how students think critically and creatively, he said.

The tasks would be scored, at least in part, by AI, Schleicher added.

Students would be able to use an AI-powered chatbot to complete their work. They could ask it basic questions about a topic, so that the test could focus on their thinking capability, not whether they possess background knowledge of a particular subject.

鈥淥therwise, many kids [would] struggle simply because they haven鈥檛 learned something specific,鈥 Schleicher said in a follow-up interview.

This could be a step towards figuring out how AI can help educators achieve a long-elusive goal: Creating a new breed of assessments that actually helps inform teaching and learning in real time, he said.

鈥淚 think one of the greatest mistakes that we have made in the history of education is to divorce learning from assessment,鈥 Schleicher told the chiefs. 鈥淲e have young kids pile up years and years and years and years of learning. And then one day we call them back and say, 鈥楾ell me everything you know鈥 in this contrived setting. And that has been superficial, that has made teaching superficial.鈥

OECD administers PISA to 15-year-olds every three years in reading, math, and science, with a special focus on a different subject each time. Some 620,000 students in 38 mostly developed countries and a total of 81 education systems, including four in China, participated in the most recent PISA, in 2022.

鈥楳ore freedom to be innovative鈥

OCED is still working out the details of the AI-informed performance task. But the type of assignments students could begin to tackle include designing a laboratory experiment to test a particular hypothesis or developing an advertising campaign.

AI tools also could be used to help score the tasks, mimicking the role of human scorers. In explaining how this kind of grading would work, Schleicher cited an example he saw at Beijing Normal University in China, in which music students were presented with half a song and asked to compose the remaining half. The assignments were scored both by trained musicians and by AI.

Even though the task is a creative one, the AI scores eventually began to match those of the professionals.

Unlike the annual reading and math tests that U.S. states are required to administer, PISA is only given to a sample of students and is not used for accountability purposes. The test doesn鈥檛 generate scores for specific students or even schools.

Because PISA 鈥渄oesn鈥檛 matter for individual students 鈥 we have a lot more freedom to be more innovative,鈥 Schleicher said in an interview.

Although PISA is often used to compare different countries鈥 educational systems, the performance task may not figure into a nation鈥檚 overall scores, at least initially, Schleicher said. OECD experts will likely want to see how students鈥 mastery of the task compares to the more traditional parts of the test, he explained.

鈥業t would be phenomenal鈥

Getting AI to assess students鈥 thinking skills鈥攈ow they approach learning and process information鈥攃ould be a game-changer not just for assessment but for teaching and learning, said Scott Marion, the executive director of the National Center for the Improvement of Educational Assessment.

It could start a conversation 鈥渕aybe on a countrywide basis, maybe just globally at first [about] how kids are approaching problems. And what does that portend for instruction?鈥 Marion said.

Marion still has questions about exactly how the process would work. But he鈥檚 hoping that eventually the technology PISA is experimenting with could provide more specific information of how individual students learn.

鈥淭eachers could get really deep feedback on how kids are interacting鈥 with performance tasks and 鈥渨here they鈥檙e struggling, where they鈥檙e getting stuck on these things,鈥 Marion said.

Some teachers can do that now, using data from performance tasks, but it 鈥渞equires a lot of skill, and requires a lot of practice. So that鈥檚 why it doesn鈥檛 happen,鈥 Marion said. 鈥淚f AI could shorten that process and improve interpretations teachers make from assessments of all sorts, it would be phenomenal.鈥

鈥楨veryone鈥檚 gonna be watching it very closely鈥

State chiefs were also intrigued.

鈥淵ou have a substantive assessment organization, making a first move in the direction鈥 to see how AI can help assessments capture skills other tests struggle to measure, said Frank Edelblut, New Hampshire鈥檚 state chief.

He wants to know: 鈥淗ow is it going to be deployed? What do we expect out of this? Everyone鈥檚 going to be watching it very closely.鈥

The news came as a 鈥減leasant surprise鈥 to Kirsten Baesler, North Dakota鈥檚 state chief. 鈥淚 do believe there鈥檚 an ability for us to leverage AI鈥 to improve testing, including interim tests given over the course of the school year to offer a real-time snapshot of student performance.

鈥淲e鈥檝e been long talking about performance assessments, and I think to have international leaders also looking at it, I think we鈥檒l go further, faster,鈥 Baesler added.

Events

School Climate & Safety K-12 Essentials Forum Strengthen Students鈥 Connections to School
Join this free event to learn how schools are creating the space for students to form strong bonds with each other and trusted adults.
This content is provided by our sponsor. It is not written by and does not necessarily reflect the views of 澳门跑狗论坛's editorial staff.
Sponsor
Assessment Webinar
Standards-Based Grading Roundtable: What We've Achieved and Where We're Headed
Content provided by Otus
This content is provided by our sponsor. It is not written by and does not necessarily reflect the views of 澳门跑狗论坛's editorial staff.
Sponsor
Reading & Literacy Webinar
Creating Confident Readers: Why Differentiated Instruction is Equitable Instruction
Join us as we break down how differentiated instruction can advance your school鈥檚 literacy and equity goals.
Content provided by 

EdWeek Top School Jobs

Teacher Jobs
Search over ten thousand teaching jobs nationwide 鈥 elementary, middle, high school and more.
Principal Jobs
Find hundreds of jobs for principals, assistant principals, and other school leadership roles.
Administrator Jobs
Over a thousand district-level jobs: superintendents, directors, more.
Support Staff Jobs
Search thousands of jobs, from paraprofessionals to counselors and more.

Read Next

Assessment What the Research Says What Teachers Should Know About Integrating Formative Assessment With Instruction
Teachers need to understand how tests fit into their larger instructional practice, experts say.
3 min read
Students with raised hands.
E+ / Getty
This content is provided by our sponsor. It is not written by and does not necessarily reflect the views of 澳门跑狗论坛's editorial staff.
Sponsor
Assessment Whitepaper
Design for Improvement: The Case for a New Accountability System
Assessments in more frequent intervals provide useful feedback on what students actually study. New curriculum-aligned assessments can le...
Content provided by Cognia
Assessment The 5 Burning Questions for Districts on Grading Reforms
As districts rethink grading policies, they consider the purpose of grades and how to make them more reliable measures of learning.
5 min read
Grading reform lead art
Illustration by Laura Baker/澳门跑狗论坛 with E+ and iStock/Getty
Assessment As They Revamp Grading, Districts Try to Improve Consistency, Prevent Inflation
Districts have embraced bold changes to make grading systems more consistent, but some say they've inflated grades and sent mixed signals.
10 min read
Close crop of a teacher's hands grading a stack of papers with a red marker.
E+