Cooperative Markov Decision Process model for human–machine co-adaptation in robot-assisted rehabilitation

Guo, K; Cheng, A; Li, Y; Li, J; Duffield, R; Su, SW

Cooperative Markov Decision Process model for human–machine co-adaptation in robot-assisted rehabilitation

Guo, K

Cheng, A Li, Y Li, J Duffield, R

Su, SW

Permalink

Publisher:: ELSEVIER
Publication Type:: Journal Article
Citation:: Knowledge-Based Systems, 2024, 291
Issue Date:: 2024-05-12

Closed Access

	Filename	Description	Size
	1-s2.0-S0950705124002077-main.pdf	Published version	741.24 kB	Adobe PDF	View/Open

Copyright Clearance Process

Recently Added
In Progress
Closed Access

This item is closed access and not available.

Full metadata record

Field	Value	Language
dc.contributor.author	Guo, K https://orcid.org/0000-0003-1570-314X
dc.contributor.author	Cheng, A
dc.contributor.author	Li, Y
dc.contributor.author	Li, J
dc.contributor.author	Duffield, R https://orcid.org/0000-0002-5641-1314
dc.contributor.author	Su, SW
dc.date.accessioned	2024-09-30T23:11:09Z
dc.date.available	2024-09-30T23:11:09Z
dc.date.issued	2024-05-12
dc.identifier.citation	Knowledge-Based Systems, 2024, 291
dc.identifier.issn	0950-7051
dc.identifier.issn	1872-7409
dc.identifier.uri	http://hdl.handle.net/10453/181112
dc.description.abstract	Human–machine interaction is a critical component in robotic rehabilitation systems. A mutual learning strategy involving both machine- and human-oriented learning has shown improvements in learning efficiency and receptiveness. Despite these advancements, a theoretical framework that encompasses high-level human responses during robot-assisted rehabilitation is still needed. This paper introduces a novel human–machine interface that uses a Co-adaptive Markov Decision Process (CaMDP) model based on cooperative multi-agent reinforcement learning. The CaMDP model effectively measures user adaptation to machines, treating the entire rehabilitation process as a collaborative learning experience. It quantifies learning rates at a higher system abstraction level. Policy Iteration in Reinforcement Learning is employed for the cooperative adjustment of Policy Improvement between the human and machine. Simulation studies demonstrate that the proposed new Policy Improvement approach has great potential to address non-stationarity issues and significantly reduce the switching frequency of patients — from 16.6% to 2% in a sample of 120,000 cases. The CaMDP model provides valuable insights into rehabilitation effect prediction and risk avoidance through dual-agent simulation, thereby enhancing the overall performance of the rehabilitation process.
dc.language	English
dc.publisher	ELSEVIER
dc.relation.ispartof	Knowledge-Based Systems
dc.relation.isbasedon	10.1016/j.knosys.2024.111572
dc.rights	info:eu-repo/semantics/closedAccess
dc.subject	08 Information and Computing Sciences, 15 Commerce, Management, Tourism and Services, 17 Psychology and Cognitive Sciences
dc.subject.classification	Artificial Intelligence & Image Processing
dc.subject.classification	4602 Artificial intelligence
dc.subject.classification	4605 Data management and data science
dc.subject.classification	4611 Machine learning
dc.title	Cooperative Markov Decision Process model for human–machine co-adaptation in robot-assisted rehabilitation
dc.type	Journal Article
utslib.citation.volume	291
utslib.for	08 Information and Computing Sciences
utslib.for	15 Commerce, Management, Tourism and Services
utslib.for	17 Psychology and Cognitive Sciences
pubs.organisational-group	University of Technology Sydney
pubs.organisational-group	University of Technology Sydney/Faculty of Engineering and Information Technology
pubs.organisational-group	University of Technology Sydney/Faculty of Health
pubs.organisational-group	University of Technology Sydney/Strength - CHT - Health Technologies
pubs.organisational-group	University of Technology Sydney/Faculty of Engineering and Information Technology/School of Electrical and Data Engineering
pubs.organisational-group	University of Technology Sydney/UTS Groups
pubs.organisational-group	University of Technology Sydney/UTS Groups/Centre for Health Technologies (CHT)
utslib.copyright.status	closed_access	*
dc.date.updated	2024-09-30T23:11:07Z
pubs.publication-status	Published
pubs.volume	291

Abstract:

Human–machine interaction is a critical component in robotic rehabilitation systems. A mutual learning strategy involving both machine- and human-oriented learning has shown improvements in learning efficiency and receptiveness. Despite these advancements, a theoretical framework that encompasses high-level human responses during robot-assisted rehabilitation is still needed. This paper introduces a novel human–machine interface that uses a Co-adaptive Markov Decision Process (CaMDP) model based on cooperative multi-agent reinforcement learning. The CaMDP model effectively measures user adaptation to machines, treating the entire rehabilitation process as a collaborative learning experience. It quantifies learning rates at a higher system abstraction level. Policy Iteration in Reinforcement Learning is employed for the cooperative adjustment of Policy Improvement between the human and machine. Simulation studies demonstrate that the proposed new Policy Improvement approach has great potential to address non-stationarity issues and significantly reduce the switching frequency of patients — from 16.6% to 2% in a sample of 120,000 cases. The CaMDP model provides valuable insights into rehabilitation effect prediction and risk avoidance through dual-agent simulation, thereby enhancing the overall performance of the rehabilitation process.

Please use this identifier to cite or link to this item:

http://hdl.handle.net/10453/181112