MMD-DRO¶
We set \(d(P, Q)\) as the kernel distance with the Gaussian kernel.
Reference: Zhu, Jia-Jie, et al. “Kernel distributionally robust optimization: Generalized duality theorem and stochastic approximation.” International Conference on Artificial Intelligence and Statistics. PMLR, 2021.