Difference of Convex Relaxation (DC)

news2025/7/12 13:29:19

问题背景

$\begin{aligned}&\underset{m}{\operatorname*{minimize}}\quad\|\boldsymbol{m}\|^2\\&\mathrm{subject~to}\quad\|\boldsymbol{m}^\mathsf{H}\boldsymbol{h}_k^e\|^2\geq1,\forall k.\end{aligned}$

$\begin{aligned}\mathscr{P}_{1}:&\underset{M}{\operatorname*{minimize}}\quad\mathrm{Tr}(M)\\&\mathrm{subject~to}\quad\mathrm{Tr}(\boldsymbol{M}\boldsymbol{H}_{k})\geq1,\forall k,\\&\boldsymbol{M}\succeq0,\mathrm{rank}(\boldsymbol{M})=1,\end{aligned}$

$\begin{aligned}\text{find}&v\\\text{subject to}&\tilde{v}^\mathsf{H}\boldsymbol{R}_k\tilde{\boldsymbol{v}}+|c_k|^2\geq1,\forall k,\\&|v_n|^2=1,\forall v=1,\cdots,N,\end{aligned}$

$\begin{aligned}\mathscr{P}_{2}:\quad\mathrm{find}\quad\boldsymbol{V}\\\mathrm{subject~to}&\mathrm{Tr}(\boldsymbol{R}_{k}\boldsymbol{V})+|c_{k}|^{2}\geq1,\forall k,\\&\boldsymbol{V}_{n,n}=1,\forall n=1,\cdots,N,\\&\boldsymbol{V}\succeq0,\quad\mathrm{rank}(\boldsymbol{V})=1.\end{aligned}$

为了进一步解决问题P1和问题P2中的非凸性，一种流行的方法是通过SDR技术简单地丢弃非凸的秩一约束[17]。由此产生的SDP问题可以通过现有的凸优化解算器（例如CVX [19]）来有效地求解。如果SDP问题的最优解是秩一的，则可以通过秩一分解来恢复原始问题的最优解。另一方面，如果SDP问题的最优解不是秩1，则需要应用高斯随机化[17]等附加步骤来提取原始问题的次优解。然而，观察到对于高维优化问题（例如，天线数量N增加），则对于SDR方法返回秩一解的概率变低，这产生显著的性能恶化[7]、[18]。为了克服SDR方法的局限性，我们在下面的章节中提出了一种新的DC框架来解决问题P1和问题P2。

所针对的一般问题

一阶约束问题的DC框架

为了便于介绍，我们首先考虑具有秩1约束的一般低秩矩阵优化问题的DC算法，如下所示，

$\begin{aligned}&\underset{\boldsymbol{X}\in\mathcal{C}}{\operatorname*{minimize}}\quad\mathrm{Tr}(\boldsymbol{A}_{0}\boldsymbol{X})\\&\mathrm{subject~to}\quad\mathrm{Tr}(\boldsymbol{A}_{k}\boldsymbol{X})\geq d_{k},\forall k,\\&\boldsymbol{X}\succeq0,\mathrm{rank}(\boldsymbol{X})=1, \tag{18} \end{aligned}$

其中约束集 $\mathcal{C}$ 是凸的。关于rank-one约束的一个关键观察是，它可以等效地写为DC函数约束，这在以下Proposition中正式陈述。

Proposition 1. For positive semidefinite (PSD) matrix $\in \mathbb{C}^{N\times N}$ and $\mathrm{Tr}( \boldsymbol{X}) \geq 1, \textit{we have}$
$\operatorname{rank}(\boldsymbol{X})=1\Longleftrightarrow\operatorname{Tr}(\boldsymbol{X})-\|\boldsymbol{X}\|_2=0, \tag{19}$

$\begin{aligned}&where\textit{ trace norm }\operatorname{Tr}(\boldsymbol{X})=\sum_{i=1}^N\sigma_i(\boldsymbol{X})\textit{ and spectral norm}\\&\|\boldsymbol{X}\|_2=\sigma_1(\boldsymbol{X})\textit{ with }\sigma_i(\boldsymbol{X})\textit{ denoting the i-th largest singular}\\&\text{value of matrix X.}\end{aligned}$

为了增强问题（18）的低秩解，我们提出将（19）中的DC函数作为惩罚分量添加到目标函数中，而不是经由SDR方法移除非凸秩一约束，从而得到

$\begin{aligned}&\underset{X\in\mathcal{C}}{\operatorname*{minimize}}\quad\mathrm{Tr}(\boldsymbol{A}_0\boldsymbol{X})+\rho\cdot(\mathrm{Tr}(\boldsymbol{X})-\|\boldsymbol{X}\|_2)\\&\mathrm{subject~to}\quad\mathrm{Tr}(\boldsymbol{A}_k\boldsymbol{X})\geq d_k,\forall k,\\&\boldsymbol{X}\succeq0, \tag{20} \end{aligned}$

where $\rho>0$ is the penalty parameter. Note that we are able to obtain an exact rank-one solution $X^*$ when the nonnegative component (Tr $(\boldsymbol{X}^*)-\|\boldsymbol{X}^*\|_2)$ in the objective function is enforced to be zero.

DC算法

虽然问题（20）仍然是非凸的，但它可以通过利用优化最小化技术(MM方法，见我之前的博客)以迭代方式求解，从而产生DC算法[20]。主要思想是通过线性化凹项 $-\rho\|X\|_{2}$ 将问题（20）转化为一系列简单的子问题X2、目标函数。具体地说，我们需要在第 $t$ 次迭代时求解由下式给出的子问题：

$\begin{aligned}&\operatorname*{minimize}_{\boldsymbol{X}\in\mathcal{C}}\quad\mathrm{Tr}(\boldsymbol{A}_{0}\boldsymbol{X})+\rho\cdot\langle\boldsymbol{X},\boldsymbol{I}-\partial\|\boldsymbol{X}^{t-1}\|_{2}\rangle\\&\mathrm{subject~to}\quad\mathrm{Tr}(\boldsymbol{A}_{k}\boldsymbol{X})\geq d_{k},\forall k,\\&\boldsymbol{X}\succeq0, \tag{21} \end{aligned}$

where $X^{t-1}$ is the optimal solution of the subproblem at iteration $t - 1.$ It is clear that the subproblem (21) is convex and can be solved efficiently by existing solvers such as CVX [19]. In addition, the subgradient $\partial\|\tilde{X}\|_2$ can be computed efficiently by the following proposition [3].

Proposition $\textit{ For given PSD matrix }\boldsymbol{X}\in \mathbb{C} ^{N\times N}, thesub$ - gradient $\partial \| \boldsymbol{X}\| _2\textit{can be computed as }v_1\boldsymbol{v}_1^\mathrm{H} , where\boldsymbol{v}_1\in \mathbb{C} ^N$ is the leading eigenvector of matrix X.

所提出的DC算法从任意初始点收敛到问题（20）的临界点[20]。因此，我们在算法1中总结了所提出的DC算法。

在这里插入图片描述

Proposed Alternating DC Approach

In this subsection, we apply the proposed DC framework to problem $\mathscr{P}_1$ and problem $\mathscr{P}_2.$ Specifically, to find a rank-one solution to problem $\mathscr{P}_1$ , we propose to solve the following DC programming problem

$\begin{aligned}&\underset{M}{\operatorname*{minimize}}\quad\mathrm{Tr}(\boldsymbol{M})+\rho(\mathrm{Tr}(\boldsymbol{M})-\|\boldsymbol{M}\|_{2})\\&\mathrm{subject~to}\quad\mathrm{Tr}(\boldsymbol{MH}_{k})\geq1,\forall k,\\&\boldsymbol{M}\succeq0, \tag{22}\end{aligned}$

where $\rho>0$ is the penalty parameter. When the penalty component is enforced to be zero, problem (22) shall induce a rank-one solution $M^\star$ , we can thus recover the solution $m$ to problem (12) through Cholesky decomposition $M^\star=\boldsymbol{m}m^\mathrm{H}.$

To detect feasibility for problem $\mathscr{P}_2$ , we propose to minimize the difference between trace norm and spectral norm as follows,

$\begin{aligned}&\underset{V}{\operatorname*{minimize}}\quad\mathrm{Tr}(\boldsymbol{V})-\|\boldsymbol{V}\|_{2}\\&\mathrm{subject~to}\quad\mathrm{Tr}(\boldsymbol{R}_{k}\boldsymbol{V})+|c_{k}|^{2}\geq1,\forall k,\\&\boldsymbol{V}_{n,n}=1,\forall n=1,\cdots,N,\\&\boldsymbol{V}\succeq0. \tag{23}\end{aligned}$

When the objective value of problem (23) becomes zero, we shall find an exact rank-one optimal solution $V^{\star}$ . By Cholesky decomposition $V^\star=\tilde{v}\tilde{v}\tilde{v}^\mathrm{H}$ , we can obtain a feasible solution $\tilde{v}$ to problem (14). If the objective value fails to be zero, we claim that problem $\mathscr{P}_2$ (i.e., problem (14)) is infeasible.