Scaling qualitative insight: An agentic workflow for analysing student voices

Bakharia, A; Shibani, A; Pineda Miranda, B; Lim, L; McCluskey, T; Buckingham Shum, S

Scaling qualitative insight: An agentic workflow for analysing student voices

Bakharia, A Shibani, A Pineda Miranda, B Lim, L McCluskey, T Buckingham Shum, S

Permalink

Publisher:: Open Access Publishing Association
Publication Type:: Conference Abstract
Citation:: https://open-publishing.org/publications/index.php/APUB/issue/view/36, 2025, pp. 44-45
Issue Date:: 2025-11-30

Open Access

Copyright Clearance Process

Recently Added
In Progress
Open Access

This item is open access.

Adobe PDF

Download Published versionAdobe PDF (266.73 kB)

View on publisher's site

View statistics

Full metadata record

Field	Value	Language
dc.contributor.author	Bakharia, A
dc.contributor.author	Shibani, A
dc.contributor.author	Pineda Miranda, B
dc.contributor.author	Lim, L
dc.contributor.author	McCluskey, T
dc.contributor.author	Buckingham Shum, S
dc.date	2025-11-30
dc.date.accessioned	2025-12-18T08:17:45Z
dc.date.available	2025-12-18T08:17:45Z
dc.date.issued	2025-11-30
dc.identifier.citation	https://open-publishing.org/publications/index.php/APUB/issue/view/36, 2025, pp. 44-45
dc.identifier.issn	2653-665X
dc.identifier.issn	2653-665X
dc.identifier.uri	http://hdl.handle.net/10453/190988
dc.description.abstract	Educators often rely on textual data from student evaluation comments and feedback survey responses to gain insights into students’ learning, understand their perceptions of educational innovations, as well as to evaluate curricula for improving educational practices. Such nuanced data from individual students capture subjective perceptions and experiences, and are analysed through interpretive lenses in qualitative research (Denzin & Lincoln, 2011). However, large corpora of data present significant challenges in being able to scale qualitative analysis. In this poster submission, we present a novel multi-agent architecture using large language models (LLMs) for analysing open-text responses as a possible solution to this problem. Building on our previous LLM-based workflow (Bakharia et. al., 2025), our agentic workflow involves multiple steps for responsibly automating the inductive thematic analysis process (Lochmiller, 2021) including validation with a multi-stage process designed to ensure analytical rigour and reliability. Our workflow first finds stable themes within each document by making multiple parallel calls to a LLM, generating a wide range of possible themes. We then use semantic clustering to identify themes that appear across many runs, going beyond just keywords. A verification step checks that all quoted evidence actually exists in the original text, preventing hallucinations and grounding themes in real student voices. Next, all themes go through a refine-and-review loop. A critic agent gives feedback on the quality of each theme, and a refiner agent improves the name, rationale, and keywords. Once all documents are complete, the system groups similar themes using hierarchical clustering to find broader categories. To support human interpretation, we built a user interface that includes a Sankey diagram to show how themes connect back to the original documents. Researchers can interact with the diagram to see the actual quotes behind each theme, providing clarity and context. Our approach emphasises trustworthiness through built-in verification and ensuring transparency at every level of abstraction. Our workflow also incorporates human-in-the-loop processes to ensure rigour.
dc.language	en
dc.publisher	Open Access Publishing Association
dc.relation.ispartof	https://open-publishing.org/publications/index.php/APUB/issue/view/36
dc.relation.ispartof	ASCILITE
dc.relation.isbasedon	10.65106/apubs.2025.2728
dc.rights	info:eu-repo/semantics/openAccess
dc.title	Scaling qualitative insight: An agentic workflow for analysing student voices
dc.type	Conference Abstract
utslib.location.activity	Adeleide
pubs.organisational-group	University of Technology Sydney
pubs.organisational-group	University of Technology Sydney/Provost
pubs.organisational-group	University of Technology Sydney/Provost/TD School
pubs.organisational-group	University of Technology Sydney/UTS Groups
pubs.organisational-group	University of Technology Sydney/UTS Groups/The Trustworthy Digital Society
utslib.copyright.status	open_access	*
pubs.consider-herdc	true
dc.rights.license	This work is licensed under a Creative Commons Attribution 4.0 International License (CC BY 4.0). To view a copy of this license, visit https://creativecommons.org/licenses/by/4.0/
dc.date.updated	2025-12-18T08:17:43Z
pubs.finish-date	2025-12-03
pubs.publication-status	Published online
pubs.start-date	2025-11-30

Abstract:

Educators often rely on textual data from student evaluation comments and feedback survey responses to gain insights into students’ learning, understand their perceptions of educational innovations, as well as to evaluate curricula for improving educational practices. Such nuanced data from individual students capture subjective perceptions and experiences, and are analysed through interpretive lenses in qualitative research (Denzin & Lincoln, 2011). However, large corpora of data present significant challenges in being able to scale qualitative analysis. In this poster submission, we present a novel multi-agent architecture using large language models (LLMs) for analysing open-text responses as a possible solution to this problem. Building on our previous LLM-based workflow (Bakharia et. al., 2025), our agentic workflow involves multiple steps for responsibly automating the inductive thematic analysis process (Lochmiller, 2021) including validation with a multi-stage process designed to ensure analytical rigour and reliability. Our workflow first finds stable themes within each document by making multiple parallel calls to a LLM, generating a wide range of possible themes. We then use semantic clustering to identify themes that appear across many runs, going beyond just keywords. A verification step checks that all quoted evidence actually exists in the original text, preventing hallucinations and grounding themes in real student voices. Next, all themes go through a refine-and-review loop. A critic agent gives feedback on the quality of each theme, and a refiner agent improves the name, rationale, and keywords. Once all documents are complete, the system groups similar themes using hierarchical clustering to find broader categories. To support human interpretation, we built a user interface that includes a Sankey diagram to show how themes connect back to the original documents. Researchers can interact with the diagram to see the actual quotes behind each theme, providing clarity and context. Our approach emphasises trustworthiness through built-in verification and ensuring transparency at every level of abstraction. Our workflow also incorporates human-in-the-loop processes to ensure rigour.

Please use this identifier to cite or link to this item:

http://hdl.handle.net/10453/190988