Abstract
Open-Set Domain Adaptation (OSDA) aims to transfer knowledge from the labeled source domain to the unlabeled target domain that contains unknown categories, thus facing the challenges of domain shift and unknown category recognition. While recent works have demonstrated the potential of causality for domain alignment, little exploration has been conducted on causal-inspired theoretical frameworks for OSDA. To fill this gap, we introduce the concept of Susceptibility and propose a novel Counterfactual-based susceptibility risk framework for OSDA, termed COSDA. Specifically, COSDA consists of three novel components: (i) a Susceptibility Risk Estimator (SRE) for capturing causal information, along with comprehensive derivations of the computable theoretical upper bound, forming a risk minimization framework under the OSDA paradigm; (ii) a Contrastive Feature Alignment (CFA) module, which is theoretically proven based on mutual information to satisfy the Exogeneity assumption and facilitate cross-domain feature alignment; (iii) a Virtual Multi-unknown-categories Prototype (VMP) pseudo-labeling strategy, providing label information by measuring how similar samples are to known and multiple virtual unknown category prototypes, thereby assisting in open-set recognition and intra-class discriminative feature learning. Extensive experiments demonstrate that our approach achieves state-of-the-art performance.
Contributions
In this paper we build our framework for the MTDA pivoted around two key concepts: feature aggregation and curriculum learning.
Promising Way: We propose a structural causal model within the OSDA paradigm and perform theoretical derivations and algorithm design centered on the counterfactual probability of susceptibility. This addresses its optimizability, evaluability, and identifiability, thus presenting a principled causal framework for OSDA tasks.
Systematic Theoretical Framework: We introduce a novel counterfactual-based risk—Susceptibility risk—along with its evaluator. A theoretically computable upper bound for the target domain’s susceptibility risk is derived within the OSDA paradigm.
Innovative Techniques: We identify an optimization objective that satisfies the Exogeneity causal assumption, which is recognized as maximizing the mutual information between the target sample and the source domain prototype. Based on this, we propose a Virtual Multi-unknown-categories Prototype (VMP) pseudo-labeling strategy and a Contrastiveinspired Feature Alignment (CFA) module.
Comprehensive Experiments: We validate the effectiveness of the model on three benchmark datasets, achieving improvements of 2.9%, 2.2%, and 1.0% respectively, compared to state-of-the-art (SOTA) algorithms. Ablation studies and experiments on synthetic datasets confirm the effectiveness of each proposed module
Pipeline

Figure: An overview of our proposed COSDA.
In Sub-Figure (a), it describes the deployment of feature interventions. We add an intervention module utilizing Multi-Layer Perception (MLP) following feature C, executing an intervention via a nonlinear transformation of feature C. The feature distribution is a Gaussian distribution with parameterized mean and variance, as popularized by VIB (Kingma et al, 2015).
In Sub-Figure (b), the VMP and CFA are proposed to address Exogeneity. The central idea of VMP (1.-4.) is to build the centroids of all known and unknown classes, and then generate the pseudo label for samples by comparing the distance between samples and the centroids. CFA (5.) introduces the concept of feature alignment, which reduces the distance to the appropriate class centroid while maximizing the distance to the centroids of other classes, establishing class-level cross-domain feature alignment. Through adaptation, we align the features of known classes while also learning the decision boundary for unknown classes.
Code
Contact
You can reach us at the following email addresses: