MASS: MoErging through Adaptive Subspace Selection

@article{zirilli2025mass, title={MASS: MoErging through Adaptive Subspace Selection}, author={Donato Crisostomi* and Alessandro Zirilli* and Antonio Andrea Gargiulo and Maria Sofia Bucarelli and Simone Scardapane and Fabrizio Silvestri and Iacopo Masi and Emanuele Rodolà}, journal={arXiv preprint arXiv:2504.05342}, year={2025} } × Copy BibTeX Abstract Model merging has recently emerged as a lightweight alternative to ensembling, combining multiple fine-tuned models into a single set of parameters with no additional training overhead. Yet, existing merging methods fall short of matching the full accuracy of separately fine-tuned end-points. We present MASS (MoErging through Adaptive Subspace Selection), a new approach that closes this gap by unifying multiple fine-tuned models while retaining near state-of-the-art performance across tasks. Building on the low-rank decomposition of per-task updates, MASS stores only the most salient singular components for each task and merges them into a shared model. At inference time, a non-parametric, data-free router identifies which subspace (or combination thereof) best explains an input’s intermediate features and activates the corresponding task-specific block. This procedure is fully training-free and introduces only a two-pass inference overhead plus a∼2×storage factor compared to a single pretrained model, irrespective of the number of tasks. We evaluate MASS on CLIP-based image classification using ViT-B-16, ViT-B-32 and ViT-L-14 for benchmarks of 8, 14 and 20 tasks respectively, establishing a new state-of-the-art. Most notably, MASS recovers up to∼ 98% of the average accuracy of individual fine-tuned models, making it a practical alternative to ensembling at a fraction of the storage cost. ...

April 6, 2025 · Donato Crisostomi*, Alessandro Zirilli*, Antonio Andrea Gargiulo, Maria Sofia Bucarelli, Simone Scardapane, Fabrizio Silvestri, Iacopo Masi, Emanuele Rodolà

Solar panels energy production forecasting

This repository contains the code for three distinct solutions developed for the Enefit Kaggle challenge. ⭐️ ...

December 24, 2023 · Luca Zhou, Marincione Davide, Zirilli Alessandro