Training-free LLM Merging for Multi-task Learning

Fu, Z; Wu, X; Wang, Y; Wang, W; Ye, S; Yin, H; Chang, Y; Zheng, Y; Zhao, X

Training-free LLM Merging for Multi-task Learning

Fu, Z Wu, X Wang, Y Wang, W Ye, S Yin, H Chang, Y Zheng, Y Zhao, X

Permalink

Publisher:: Association for Computational Linguistics (ACL)
Publication Type:: Conference Proceeding
Citation:: Proceedings of the Annual Meeting of the Association for Computational Linguistics, 2025, 1, pp. 33111-33124
Issue Date:: 2025-01-01

Open Access

Copyright Clearance Process

Recently Added
In Progress
Open Access

This item is open access.

Adobe PDF

Download Published versionAdobe PDF (526.9 kB)

View on publisher's site

View statistics

Full metadata record

Field	Value	Language
dc.contributor.author	Fu, Z
dc.contributor.author	Wu, X
dc.contributor.author	Wang, Y
dc.contributor.author	Wang, W
dc.contributor.author	Ye, S
dc.contributor.author	Yin, H
dc.contributor.author	Chang, Y
dc.contributor.author	Zheng, Y
dc.contributor.author	Zhao, X
dc.date	2025-07
dc.date.accessioned	2026-06-02T05:42:45Z
dc.date.available	2026-06-02T05:42:45Z
dc.date.issued	2025-01-01
dc.identifier.citation	Proceedings of the Annual Meeting of the Association for Computational Linguistics, 2025, 1, pp. 33111-33124
dc.identifier.issn	0736-587X
dc.identifier.uri	http://hdl.handle.net/10453/195210
dc.description.abstract	Large Language Models (LLMs) have demonstrated exceptional capabilities across diverse natural language processing (NLP) tasks. The release of open-source LLMs like LLaMA and Qwen has triggered the development of numerous fine-tuned models tailored for various tasks and languages. In this paper, we explore an important question: is it possible to combine these specialized models to create a unified model with multi-task capabilities. We introduces Hierarchical Iterative Merging (Hi-Merging), a training-free method for unifying different specialized LLMs into a single model. Specifically, Hi-Merging employs model-wise and layer-wise pruning and scaling, guided by contribution analysis, to mitigate parameter conflicts. Extensive experiments on multiple-choice and question-answering tasks in both Chinese and English validate Hi-Merging's ability for multi-task learning. The results demonstrate that Hi-Merging consistently outperforms existing merging techniques and surpasses the performance of models fine-tuned on combined datasets in most scenarios. Code is available at Applied-Machine-Learning-Lab/Hi-Merging.
dc.language	en
dc.publisher	Association for Computational Linguistics (ACL)
dc.relation.ispartof	Proceedings of the Annual Meeting of the Association for Computational Linguistics
dc.relation.ispartof	Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
dc.relation.isbasedon	10.18653/v1/2025.acl-long.1588
dc.rights	info:eu-repo/semantics/openAccess
dc.title	Training-free LLM Merging for Multi-task Learning
dc.type	Conference Proceeding
utslib.citation.volume	1
pubs.organisational-group	University of Technology Sydney
pubs.organisational-group	University of Technology Sydney/Faculty of Engineering and Information Technology
utslib.copyright.status	open_access	*
dc.rights.license	This work is licensed under a Creative Commons Attribution 4.0 International License (CC BY 4.0). To view a copy of this license, visit https://creativecommons.org/licenses/by/4.0/
dc.date.updated	2026-06-02T05:42:44Z
pubs.finish-date	2025-07
pubs.publication-status	Published
pubs.start-date	2025-07
pubs.volume	1

Abstract:

Large Language Models (LLMs) have demonstrated exceptional capabilities across diverse natural language processing (NLP) tasks. The release of open-source LLMs like LLaMA and Qwen has triggered the development of numerous fine-tuned models tailored for various tasks and languages. In this paper, we explore an important question: is it possible to combine these specialized models to create a unified model with multi-task capabilities. We introduces Hierarchical Iterative Merging (Hi-Merging), a training-free method for unifying different specialized LLMs into a single model. Specifically, Hi-Merging employs model-wise and layer-wise pruning and scaling, guided by contribution analysis, to mitigate parameter conflicts. Extensive experiments on multiple-choice and question-answering tasks in both Chinese and English validate Hi-Merging's ability for multi-task learning. The results demonstrate that Hi-Merging consistently outperforms existing merging techniques and surpasses the performance of models fine-tuned on combined datasets in most scenarios. Code is available at Applied-Machine-Learning-Lab/Hi-Merging.

Please use this identifier to cite or link to this item:

http://hdl.handle.net/10453/195210