Conditional CVaR DRO¶
- class dro.linear_model.conditional_dro.ConditionalCVaRDRO(input_dim, model_type='svm', fit_intercept=True, solver='MOSEK', kernel='linear')¶
- Bases: - BaseLinearDRO- Y|X (ConditionalShiftBased) Conditional Value-at-Risk (Conditional-CVaR) Distributionally Robust Optimization (DRO) model that only allow likelihood ratio changes in Y|X. - This model minimizes a robust loss function for both regression and classification tasks under a CVaR constraint only for the distribution of Y|X. - Conditional CVaR DRO model following Theorem 2 in: with alpha(x) to be beta^T x for simplicity alpha corresponds to Gamma in the paper. - Reference: <https://arxiv.org/pdf/2209.01754.pdf> - Initialize the linear DRO model. - Parameters:
- input_dim (int) – Number of input features. Must be ≥ 1. 
- model_type (str) – Type of base model. Valid options: ‘svm’, ‘logistic’, ‘ols’, ‘lad’. Defaults to ‘svm’. 
- fit_intercept (bool) – Whether to learn an intercept term. Set False for pre-centered data. Defaults to True. 
- solver (str) – Optimization solver. See class-level documentation for recommended options. Defaults to ‘MOSEK’. 
- kernel (str) – the kernel type to be used in the optimization model, default = ‘linear’ 
 
- Raises:
- If - input_dim< 1
- If - model_typeis not in [‘svm’, ‘logistic’, ‘ols’, ‘lad’]
 
 - Example:
- >>> model = ConditionalCVaRDRO(input_dim=5, model_type='logistic') >>> print(model.model_type) 'logistic' >>> print(model.alpha) 1.0 
 - update(config)¶
- Update Conditional CVaR-DRO model configuration. - Modifies control features and risk sensitivity parameters dynamically. Changes affect subsequent optimization but require manual re-fitting. - Parameters:
- config (Dict[str, Any]) – - Dictionary of configuration updates. Valid keys: - control_name: Indices of controlled features (0 ≤ index < input_dim)
- alpha: Risk level for CVaR constraint (0 < alpha ≤ 1)
 
- Raises:
- ConditionalCVaRDROError – - If - control_namecontains invalid indices
- If - alphais outside (0, 1]
- If unrecognized configuration keys are provided 
 
- Return type:
 - Control Features: - Controlled features ( - control_name) are protected from distribution shifts
- Indices must satisfy: \(0 \leq \text{index} < \text{input_dim}\) 
 - Example:
- >>> model = ConditionalCVaRDRO(input_dim=5) >>> model.update({ ... 'control_name': [0, 2], # Protect 1st and 3rd features ... 'alpha': 0.95 ... }) >>> model.update({'control_name': [5]}) # Invalid index for input_dim=5 Traceback (most recent call last): ... ConditionalCVaRDROError: All indices in 'control_name' must be in [0, input_dim - 1] 
 - Note - Setting - control_name=Nonedisables feature protection
- Lower - alphavalues reduce conservatism (focus on average risk)
- Configuration changes invalidate previous solutions (requires re-fitting) 
 
 - fit(X, y)¶
- Solve the Conditional CVaR-constrained DRO problem via convex optimization. - Constructs and optimizes a distributionally robust model that minimizes the worst-case conditional expected loss, where the ambiguity set is constrained by both CVaR and feature control parameters. - Parameters:
- X (numpy.ndarray) – Training feature matrix of shape (n_samples, n_features). Must satisfy n_features == self.input_dim. 
- y (numpy.ndarray) – - Target values of shape (n_samples,). Format requirements: - Classification: ±1 labels 
- Regression: Continuous values 
 
 
- Returns:
- Dictionary containing trained parameters: - theta: Weight vector of shape (n_features,)
- b: Intercept term (exists if fit_intercept=True)
- cvar_threshold: Optimal CVaR threshold value
 
- Return type:
- Dict[str, Any] 
- Raises:
- ConditionalCVaRDROError – - If optimization fails (solver error/infeasible) 
- If X and control features have dimension mismatch 
 
- If X.shape[1] != self.input_dim 
- If X.shape[0] != y.shape[0] 
 
 
 - Optimization Formulation:
- \[\min_{\theta,b} \ \sup_{Q \in \mathcal{Q}} \mathbb{E}_Q[\ell(\theta,b;X,y)]\]where the ambiguity set \(\mathcal{Q}\) satisfies: \[\text{CVaR}_\alpha(\ell) \leq \tau \quad \text{and} \quad X_{\text{control}} = \mathbb{E}_P[X_{\text{control}}]\]- \(X_{\text{control}}\) = features specified by control_name 
- \(\tau\) = CVaR threshold (optimization variable) 
 
- Example:
- >>> model = ConditionalCVaRDRO(input_dim=5, control_name=[0,2], alpha=0.9) >>> X_train = np.random.randn(100, 5) >>> y_train = np.sign(np.random.randn(100)) >>> params = model.fit(X_train, y_train) >>> assert params["theta"].shape == (5,) >>> assert "b" in params >>> print(f"CVaR threshold: {params['cvar_threshold']:.4f}") 
 - Note - Controlled features ( - control_name) are assumed fixed under distribution shifts
- Solution cache is invalidated after calling - update()