Reinforcement learning in information searching

Cen, Y; Gan, L; Bai, C

Reinforcement learning in information searching

Cen, Y Gan, L Bai, C

Permalink

Publisher:: University of Sheffield
Publication Type:: Journal Article
Citation:: Bai, Chen, Gan, Liren, and Cen, Yonghua 2013, 'Reinforcement learning in information searching', Information Research, vol. 18, no. 1, pp. 1-24.
Issue Date:: 2013

Open Access

Copyright Clearance Process

Recently Added
In Progress
Open Access

This item is open access.

Adobe PDF

Download 2012006221OK.pdfAdobe PDF (9.78 MB)

View on publisher's site

View statistics

Full metadata record

Field	Value	Language
dc.contributor.author	Cen, Y	en_US
dc.contributor.author	Gan, L	en_US
dc.contributor.author	Bai, C	en_US
dc.date.accessioned	2014-04-03T02:50:02Z
dc.date.available	2014-04-03T02:50:02Z
dc.date.issued	2013	en_US
dc.identifier	2012006221	en_US
dc.identifier.citation	Bai, Chen, Gan, Liren, and Cen, Yonghua 2013, 'Reinforcement learning in information searching', Information Research, vol. 18, no. 1, pp. 1-24.	en_US
dc.identifier.issn	1368-1613	en_US
dc.identifier.other	C1	en_US
dc.identifier.uri	http://hdl.handle.net/10453/23951
dc.description.abstract	Introduction. The study seeks to answer two questions: How do university students learn to use correct strategies to conduct scholarly information searches without instructions? and, What are the differences in learning mechanisms between users at different cognitive levels? Method. Two groups of users, thirteen first year undergraduate students (freshmen) and thirty-four final year undergraduate students (seniors), were recruited into our experimental study and executed ten different search tasks independently. Five reinforcement learning models were introduced to quantitatively simulate the micro process of users' self-regulated learning of search expertise by trial and error. Analysis. The experimental data were divided into two parts. The first 70% of the data was used to estimate the parameters of each model. The remaining 30% was fitted by the estimated models. The model best fitting the data of users in each group was used to explain their learning behaviour. Results. Most undergraduates tended to repeat the strategies that brought success in their earlier experiences. Freshmen's learning behaviour manifested remarkable Markov properties. Their strategy selection was always made according to the feedback obtained in the last search activity. Seniors' strategy adjustment depended on the accumulated effect of past strategy adoptions. They displayed strong characteristics of rational thinking. Conclusions. In the process of learning searching expertise, users demonstrate reinforcement characteristics. Moreover, users at different cognitive levels exhibit different reinforcement patterns. Theoretical and practical implications were proposed from the perspectives of training programme design, adaptive information retrieval system design and information behaviour model development.	en_US
dc.publisher	University of Sheffield	en_US
dc.relation.ispartof	Information Research	en_US
dc.relation.ispartof	Verified OK	en_US
dc.relation.isbasedon	NA	en_US
dc.subject.classification	Library and Information Studies	en_US
dc.title	Reinforcement learning in information searching	en_US
dc.type	Journal article
utslib.citation.volume	18	en_US
utslib.citation.number	1	en_US
utslib.location	UK	en_US
utslib.citation.startpage	1	en_US
utslib.citation.endpage	24	en_US
utslib.identifier.org	FEIT.Faculty of Engineering & Information Technology	en_US
utslib.for	080700	en_US
utslib.percentage	100	en_US
utslib.copyright.status	open_access
pubs.declined	1970-01-01T00:00:00.0+1000

Abstract:

Introduction. The study seeks to answer two questions: How do university students learn to use correct strategies to conduct scholarly information searches without instructions? and, What are the differences in learning mechanisms between users at different cognitive levels? Method. Two groups of users, thirteen first year undergraduate students (freshmen) and thirty-four final year undergraduate students (seniors), were recruited into our experimental study and executed ten different search tasks independently. Five reinforcement learning models were introduced to quantitatively simulate the micro process of users' self-regulated learning of search expertise by trial and error. Analysis. The experimental data were divided into two parts. The first 70% of the data was used to estimate the parameters of each model. The remaining 30% was fitted by the estimated models. The model best fitting the data of users in each group was used to explain their learning behaviour. Results. Most undergraduates tended to repeat the strategies that brought success in their earlier experiences. Freshmen's learning behaviour manifested remarkable Markov properties. Their strategy selection was always made according to the feedback obtained in the last search activity. Seniors' strategy adjustment depended on the accumulated effect of past strategy adoptions. They displayed strong characteristics of rational thinking. Conclusions. In the process of learning searching expertise, users demonstrate reinforcement characteristics. Moreover, users at different cognitive levels exhibit different reinforcement patterns. Theoretical and practical implications were proposed from the perspectives of training programme design, adaptive information retrieval system design and information behaviour model development.

Please use this identifier to cite or link to this item:

http://hdl.handle.net/10453/23951