奇异值分解（Singular Values Decomposition，SVD）

article/2025/10/14 0:41:36

奇异值分解

1.奇异值分解
- 1.1 变换（Transformations）
- 1.2 线性变换（Linear Transformations）
- 1.3 降维（Dimensionality Reduction）
- 1.4 奇异值分解（SVD）
- - 1.4.1 如果矩阵 $A$ 是方阵（Square Matrix）
  - 1.4.2 如果矩阵 $A$ 是非方阵（Non-Square Matrix）
- 1.5 图像压缩（Image Compression）
- 1.6 如何计算分解的三个矩阵？
- 1.7 联系与区别
- - 1.7.1 特征值和特征向量
  - 1.7.2 奇异值和奇异向量
- 1.8 矩阵估计（Matrix Approximation with SVD）

1.奇异值分解

笔记来源：Singular Value Decomposition (SVD) and Image Compression

1.1 变换（Transformations）

左侧图形先进行水平方向缩放，再进行垂直方向缩放，而后进行旋转，发现经过一系列变换后与右侧图形不符，说明缩放和旋转的操作有时是有顺序的

正确操作：先进行旋转，再进行水平方向缩放，而后进行垂直方向缩放，得到与右侧一致的图形

1.2 线性变换（Linear Transformations）

一步复杂变换 $A$ （主要考虑高维）分解为了三个简单变换 $V^{\dagger}、\Sigma、U$

1.3 降维（Dimensionality Reduction）

我们将上述矩阵 $\Sigma$ 中的 $\sigma_2=0.44$ 修改为 $\sigma_2=0$ ，二维图形直接被压缩成了一维

降维的好处：存储数据减少

1.4 奇异值分解（SVD）

1.4.1 如果矩阵 $A$ 是方阵（Square Matrix）

例子：

1.4.2 如果矩阵 $A$ 是非方阵（Non-Square Matrix）

1.5 图像压缩（Image Compression）

1.6 如何计算分解的三个矩阵？

$U=\begin{bmatrix}\boldsymbol{u}_1 & \boldsymbol{u}_2 & \cdots \end{bmatrix}\\ ~\\ \Sigma=diag\begin{bmatrix}\sigma_1 & \sigma_2 & \cdots \end{bmatrix}\\ ~\\ V^{T}=\begin{bmatrix}v_1^T \\ v_2^T\\ \vdots\end{bmatrix}$

计算矩阵 $A$ 的特征值
$det\ A=\lambda_1\lambda_2=15\\ ~\\ tr\ A=\lambda_1+\lambda_2=8\\ ~\\ \lambda_1=3、\lambda_2=5$
计算 $AA^T$
$AA^T\boldsymbol{u}_i=\sigma^2_i\boldsymbol{u}_i\\ ~\\ AA^T= \begin{bmatrix}3 & 0\\ 4 & 5\end{bmatrix} \begin{bmatrix}3 & 4\\ 0 & 5\end{bmatrix}= \begin{bmatrix}25 & 20\\ 20 & 25\end{bmatrix}$
计算 $AA^T$ 的特征值 $\sigma^2$ 【这里的 $\sigma$ 叫做矩阵 $A$ 的奇异值】
法一： $det(AA^T-\sigma I)=0$
法二：
$det\ AA^T=225=\sigma_1^2\sigma_2^2\\ tr\ AA^T=50=\sigma_1^2+\sigma_2^2\\ \sigma_1^2=45、\sigma_2^2=5\\ \sigma_1=\sqrt{45}、\sigma_2=\sqrt{5}\\ \sigma_1\sigma_2=15=det\ A$

计算 $A^TA$
$A^TA\boldsymbol{v}_i=\sigma^2_i\boldsymbol{v}_i\\ ~\\ A^TA=\begin{bmatrix}3 & 4\\ 0 & 5\end{bmatrix} \begin{bmatrix}3 & 0\\ 4 & 5\end{bmatrix}= \begin{bmatrix}9 & 12\\ 12 & 41\end{bmatrix}$
计算 $A^TA$ 的特征向量 $\boldsymbol{v}_1$ 【这里的 $\boldsymbol{v}_1$ 是矩阵 $A$ 的右奇异向量】
$A^TA\boldsymbol{v}_1=\sigma_1^2\boldsymbol{v}_1\\ ~\\ \begin{bmatrix}9 & 12\\ 12 & 41\end{bmatrix} \begin{bmatrix}a_1 \\ a_2\end{bmatrix}=45 \begin{bmatrix}a_1 \\ a_2\end{bmatrix}\\ ~\\ a_1=a_2\\ ~\\ \begin{bmatrix}9 & 12\\ 12 & 41\end{bmatrix} \begin{bmatrix}1 \\ 1\end{bmatrix}=45 \begin{bmatrix}1 \\ 1\end{bmatrix}$
计算 $A^TA$ 的特征向量 $\boldsymbol{v}_2$ 【这里的 $\boldsymbol{v}_2$ 是矩阵 $A$ 的右奇异向量】
$A^TA\boldsymbol{v}_2=\sigma_2^2\boldsymbol{v}_2\\ ~\\ \begin{bmatrix}9 & 12\\ 12 & 41\end{bmatrix} \begin{bmatrix}a_3 \\ a_4\end{bmatrix}=5 \begin{bmatrix}a_3 \\ a_4\end{bmatrix}\\ ~\\ a_3=-a_4\\ ~\\ \begin{bmatrix}9 & 12\\ 12 & 41\end{bmatrix} \begin{bmatrix}-1 \\ 1\end{bmatrix}=5 \begin{bmatrix}-1 \\ 1\end{bmatrix}$
单位化后的矩阵 $A$ 的右奇异向量 $\boldsymbol{v}_1、\boldsymbol{v}_2$
$\boldsymbol{v}_1=\frac{1}{\sqrt{2}}\begin{bmatrix}1 \\ 1\end{bmatrix}、\boldsymbol{v}_2=\frac{1}{\sqrt{2}}\begin{bmatrix}-1 \\ 1\end{bmatrix}$
计算矩阵 $A$ 的左奇异向量 $\boldsymbol{u}_1、\boldsymbol{u}_2$
$A\boldsymbol{v}_1=\sigma_1\boldsymbol{u}_1\\ ~\\ \boldsymbol{u}_1=\frac{A\boldsymbol{v}_1}{\sigma_1}=\begin{bmatrix}1 \\ 3\end{bmatrix}\\ ~\\ A\boldsymbol{v}_2=\sigma_2\boldsymbol{u}_2\\ ~\\ \boldsymbol{u}_2=\frac{A\boldsymbol{v}_2}{\sigma_2}=\begin{bmatrix}-3 \\ 1\end{bmatrix}$
单位化矩阵 $A$ 的左奇异向量 $\boldsymbol{u}_1、\boldsymbol{u}_2$
$\boldsymbol{u}_1=\frac{1}{\sqrt{10}}\begin{bmatrix}1 \\ 3\end{bmatrix}、\boldsymbol{u}_2=\frac{1}{\sqrt{10}}\begin{bmatrix}-3 \\ 1\end{bmatrix}$

分解成的三个矩阵

1.7 联系与区别

笔记来源：Understanding Eigenvalues and Singular Values

联系：特征值和奇异值都描述了线性变换的量级或者说是变换幅度

They (eigenvalues and singular values) both describe the behavior of a matrix on a certain set of vectors.
And the corresponding eigen- and singular values describe the magnitude of that action.

1.7.1 特征值和特征向量

本人相关博客：特征值、特征向量、迹

笔记来源：Understanding Eigenvalues and Singular Values

The eigenvectors of a matrix describe the directions of its invariant action.（不变作用方向）

That eigenvectors give the directions of invariant action is obvious from the definition. The definition says that when A acts on an eigenvector, it just multiplies it by a constant, the corresponding eigenvalue. In other words, when a linear transformation acts on one of its eigenvectors, it shrinks the vector or stretches it and reverses its direction if λ is negative, but never changes the direction otherwise. The action is invariant.

$A\boldsymbol{v}=\lambda\boldsymbol{v}$

the vector $\boldsymbol{v}$ is called eigenvector
A scalar $\lambda$ is called eigenvalue of $A$

1.7.2 奇异值和奇异向量

笔记来源：Understanding Eigenvalues and Singular Values
$A\boldsymbol{v}=\sigma\boldsymbol{u}\\ A^*\boldsymbol{u}=\sigma\boldsymbol{v}$
where $A^*$ is the conjugate transpose of $A$

$A\boldsymbol{v}_1=\sigma_1\boldsymbol{u}_1\\ A\boldsymbol{v}_2=\sigma_2\boldsymbol{u}_2$

A scalar $\sigma$ is a singular value of $A$
$\sigma_1$ is the largest singular value of $A$ with right singular vector $\boldsymbol{v}$
$\sigma_2$ is the least singular value of $A$ with left singular vector $\boldsymbol{u}$

the vectors $\boldsymbol{u}、\boldsymbol{v}$ are singular vectors
the vector $\boldsymbol{u}$ is called a left singular vectors
the vector $\boldsymbol{v}$ is called a right singular vectors