LanDA: Language-Guided Multi-Source Domain Adaptation
Abstract
LanDA is a language-guided MSDA approach that achieves high performance in adapting knowledge from multiple source domains to a target ___domain using only textual descriptions without any target images.
Multi-Source Domain Adaptation (MSDA) aims to mitigate changes in data distribution when transferring knowledge from multiple labeled source domains to an unlabeled target ___domain. However, existing MSDA techniques assume target ___domain images are available, yet overlook image-rich semantic information. Consequently, an open question is whether MSDA can be guided solely by textual cues in the absence of target ___domain images. By employing a multimodal model with a joint image and language embedding space, we propose a novel language-guided MSDA approach, termed LanDA, based on optimal transfer theory, which facilitates the transfer of multiple source domains to a new target ___domain, requiring only a textual description of the target ___domain without needing even a single target ___domain image, while retaining task-relevant information. We present extensive experiments across different transfer scenarios using a suite of relevant benchmarks, demonstrating that LanDA outperforms standard fine-tuning and ensemble approaches in both target and source domains.
Get this paper in your agent:
hf papers read 2401.14148 Don't have the latest CLI?
curl -LsSf https://hf.co/cli/install.sh | bash Models citing this paper 0
No model linking this paper
Datasets citing this paper 0
No dataset linking this paper
Spaces citing this paper 0
No Space linking this paper