pFedRAG: A Personalized Federated Retrieval-Augmented Generation System with Depth-Adaptive Tiered Embedding Tuning

He, H; Yuan, X; Wu, K; Liu, RP; Ni, W

pFedRAG: A Personalized Federated Retrieval-Augmented Generation System with Depth-Adaptive Tiered Embedding Tuning

He, H Yuan, X Wu, K

Liu, RP Ni, W

Permalink

Publisher:: Association for Computational Linguistics (ACL)
Publication Type:: Conference Proceeding
Citation:: Emnlp 2025 2025 Conference on Empirical Methods in Natural Language Processing Findings of Emnlp 2025, 2025, pp. 14255-14268
Issue Date:: 2025-01-01

Open Access

Copyright Clearance Process

Recently Added
In Progress
Open Access

This item is open access.

Adobe PDF

Download Published versionAdobe PDF (975.6 kB)

View on publisher's site

View statistics

Full metadata record

Field	Value	Language
dc.contributor.author	He, H
dc.contributor.author	Yuan, X
dc.contributor.author	Wu, K https://orcid.org/0000-0003-1298-6404
dc.contributor.author	Liu, RP
dc.contributor.author	Ni, W https://orcid.org/0000-0002-4933-594X
dc.date	2025-11
dc.date.accessioned	2026-05-11T00:43:47Z
dc.date.available	2026-05-11T00:43:47Z
dc.date.issued	2025-01-01
dc.identifier.citation	Emnlp 2025 2025 Conference on Empirical Methods in Natural Language Processing Findings of Emnlp 2025, 2025, pp. 14255-14268
dc.identifier.uri	http://hdl.handle.net/10453/194919
dc.description.abstract	Large Language Models (LLMs) can undergo hallucinations in specialized domains, and standard Retrieval-Augmented Generation (RAG) often falters due to general-purpose embeddings ill-suited for domain-specific terminology. Though domain-specific fine-tuning enhances retrieval, centralizing data introduces privacy risks. The use of federated learning (FL) can alleviate this to some extent, but faces challenges of data heterogeneity, poor personalization, and expensive training data generation. We propose pFedRAG, a novel Personalized Federated RAG framework, which enables efficient collaborative fine-tuning of embedding models to address these challenges. The key contribution is a new Depth-Adaptive Tiered Embedding (DATE) architecture, which comprises a Global Shared Layer, combined using FL to capture common knowledge, and a Personalized Layer with adjustable depth tailored for local data and training results of each client. The depth is locally controlled based on crafted metrics and scoring criteria. Also, pFedRAG incorporates a fully client-side pipeline leveraging local small LLMs and vector database filtering to construct high-quality query-document pairs. Experiments on diverse medical non-IID document datasets demonstrate that pFedRAG significantly reduces communication costs, handles data heterogeneity, and improves retrieval performance. Human evaluations confirm the enhanced response quality of pFedRAG.
dc.language	en
dc.publisher	Association for Computational Linguistics (ACL)
dc.relation.ispartof	Emnlp 2025 2025 Conference on Empirical Methods in Natural Language Processing Findings of Emnlp 2025
dc.relation.ispartof	Findings of the Association for Computational Linguistics: EMNLP 2025
dc.relation.isbasedon	10.18653/v1/2025.findings-emnlp.769
dc.rights	info:eu-repo/semantics/openAccess
dc.title	pFedRAG: A Personalized Federated Retrieval-Augmented Generation System with Depth-Adaptive Tiered Embedding Tuning
dc.type	Conference Proceeding
pubs.organisational-group	University of Technology Sydney
pubs.organisational-group	University of Technology Sydney/Faculty of Engineering and Information Technology
pubs.organisational-group	University of Technology Sydney/Faculty of Engineering and Information Technology/School of Electrical and Data Engineering
pubs.organisational-group	University of Technology Sydney/UTS Groups
pubs.organisational-group	University of Technology Sydney/UTS Groups/Global Big Data Technologies Centre (GBDTC)
pubs.organisational-group	University of Technology Sydney/UTS Groups/Cyber Digital Centre (CDC)
pubs.organisational-group	University of Technology Sydney/UTS Groups/Cyber Digital Centre (CDC)/Cyber Digital Centre (CDC) Associate Members
utslib.copyright.status	open_access	*
dc.rights.license	This work is licensed under a Creative Commons Attribution 4.0 International License (CC BY 4.0). To view a copy of this license, visit https://creativecommons.org/licenses/by/4.0/
dc.date.updated	2026-05-11T00:43:45Z
pubs.finish-date	2025-11
pubs.publication-status	Published
pubs.start-date	2025-11

Abstract:

Large Language Models (LLMs) can undergo hallucinations in specialized domains, and standard Retrieval-Augmented Generation (RAG) often falters due to general-purpose embeddings ill-suited for domain-specific terminology. Though domain-specific fine-tuning enhances retrieval, centralizing data introduces privacy risks. The use of federated learning (FL) can alleviate this to some extent, but faces challenges of data heterogeneity, poor personalization, and expensive training data generation. We propose pFedRAG, a novel Personalized Federated RAG framework, which enables efficient collaborative fine-tuning of embedding models to address these challenges. The key contribution is a new Depth-Adaptive Tiered Embedding (DATE) architecture, which comprises a Global Shared Layer, combined using FL to capture common knowledge, and a Personalized Layer with adjustable depth tailored for local data and training results of each client. The depth is locally controlled based on crafted metrics and scoring criteria. Also, pFedRAG incorporates a fully client-side pipeline leveraging local small LLMs and vector database filtering to construct high-quality query-document pairs. Experiments on diverse medical non-IID document datasets demonstrate that pFedRAG significantly reduces communication costs, handles data heterogeneity, and improves retrieval performance. Human evaluations confirm the enhanced response quality of pFedRAG.

Please use this identifier to cite or link to this item:

http://hdl.handle.net/10453/194919