1、直和
给定两个向量空间$V,W$,维数分别是$n,m$。$V$里有一组向量基底$\left\{\vec{e}_{1}, \vec{e}_{2}, \cdots, \vec{e}_{n}\right\}$,同样$W$里也有一组基底$\left\{\vec{f}_{1}, \vec{f}_{2}, \cdots, \vec{f}_{m}\right\}$.我们现在定义$n+m$个向量构成基组$\left\{\vec{e}_{1}, \cdots, \vec{e}_{n}, \vec{f}_{1}, \cdots, \vec{f}_{m}\right\}$. 由这些基组生成的线性空间叫做$V$与$W$生成的直和空间$V\oplus W$,线性空间维数为$n+m$.
任意向量$\vec{v} \in V$都可以写成基底线性叠加的形式$\vec{v}=v_{1} \vec{e}_{1}+v_{2} \vec{e}_{2}+\cdots+v_{n} \vec{e}_{n}$. 线性代数里我们可以用这些叠加系数构成的列向量来代表这个向量。
\begin{equation}
\left.\vec{v}=\left(\begin{array}{c}
v_{1} \\
v_{2} \\
\vdots \\
v_{n}
\end{array}\right)\right\} n
\end{equation}值得注意,这个列向量是线性空间$V$的一个元素,列向量有$n$个分量。特别地,向量基底可以写成
\begin{equation}
\vec{e}_{1}=\left(\begin{array}{c}
1 \\
0 \\
\vdots \\
0
\end{array}\right), \quad \vec{e}_{2}=\left(\begin{array}{c}
0 \\
1 \\
\vdots \\
0
\end{array}\right)_{V}, \quad \cdots, \quad \vec{e}_{n}=\left(\begin{array}{c}
0 \\
0 \\
\vdots \\
1
\end{array}\right)_{V}
\end{equation}若基组是正交组,$\vec{e}_{i} \cdot \vec{e}_{j}=\delta_{i j}$,则有$v_i=\vec{e}_i\cdot\vec{v}$.同样对于$w$中的任意向量也可以有
\begin{equation}
\left.\vec{w}=\left(\begin{array}{c}
w_{1} \\
w_{2} \\
\vdots \\
w_{n}
\end{array}\right)\right\} m
\end{equation}基向量为
\begin{equation}
\vec{f}_{1}=\left(\begin{array}{c}
1 \\
0 \\
\vdots \\
0
\end{array}\right)_{W}, \quad \vec{f}_{2}=\left(\begin{array}{c}
0 \\
1 \\
\vdots \\
0
\end{array}\right)_{W}, \quad \cdots, \quad \vec{f}_{m}=\left(\begin{array}{c}
0 \\
0 \\
\vdots \\
1
\end{array}\right)_{W}
\end{equation}自然地,对于元素$\vec{v}\in V$和$\vec{w}\in W$在直和空间的列向量形式可以推广成
\begin{equation}
\left.\left.\vec{v}=\left(\begin{array}{c}
v_{1} \\
\vdots \\
v_{n} \\
\hline 0 \\
\vdots \\
0
\end{array}\right)\right\} n+m, \quad \vec{w}=\left(\begin{array}{c}
0 \\
\vdots \\
0 \\
w_{1} \\
\vdots \\
w_{m}
\end{array}\right)\right\} n+m
\end{equation}以看到,只是把其余部分用0填充。
我们也可以定义两个非零向量的直和
\begin{equation}
\left.\vec{v} \oplus \vec{w}=\left(\begin{array}{c}
v_{1} \\
\vdots \\
v_{n} \\
\hline w_{1} \\
\vdots \\
w_{m}
\end{array}\right)=\left(\frac{\vec{v}}{\vec{w}}\right)\right\} n+m
\end{equation}数学上矩阵表示一个线性空间到另一个线性空间的映射。这里我们只讨论一个$V\mapsto V$的特殊情况,因为量子力学里大部分情况便是如此。力学量$A:V\mapsto V$
\begin{equation}
\vec{v}\mapsto A\vec{v}
\end{equation}\begin{equation}
\left(\begin{array}{c}
v_{1} \\
v_{2} \\
\vdots \\
v_{n}
\end{array}\right) \longmapsto\left(\begin{array}{cccc}
a_{11} & a_{12} & \cdots & a_{1 n} \\
a_{22} & a_{22} & \cdots & a_{2 n} \\
\vdots & \vdots & \ddots & \vdots \\
a_{n 1} & a_{n 2} & \cdots & a_{n n}
\end{array}\right)_{V}\left(\begin{array}{c}
v_{1} \\
v_{2} \\
\vdots \\
v_{n}
\end{array}\right)_{V}
\end{equation}类似的有矩阵$B:W\mapsto W,\vec{w}\mapsto B\vec{w}$
在直和空间里,我们希望矩阵$A,B$经过某种运算后作用在向量上保持$\vec{v}\mapsto A\vec{v}$和$\vec{w}\mapsto B\vec{w}$。这样的运算记成$A\oplus B$. 运算结果可以利用分块对角矩阵构造
\begin{equation}
A \oplus B=\left(\begin{array}{c|c}
A & 0_{n \times m} \\
\hline 0_{m \times n} & B
\end{array}\right)
\end{equation}例如$n=2,m=3$的情况
\begin{equation}
A=\left(\begin{array}{cc}
a_{11} & a_{12} \\
a_{21} & a_{22}
\end{array}\right)_{V}, \quad B=\left(\begin{array}{ccc}
b_{11} & b_{12} & b_{13} \\
b_{21} & b_{22} & b_{23} \\
b_{31} & b_{32} & b_{33}
\end{array}\right)_{W}
\end{equation}他们的直和为
\begin{equation}
A \oplus B=\left(\begin{array}{cc|ccc}
a_{11} & a_{12} & 0 & 0 & 0 \\
a_{21} & a_{22} & 0 & 0 & 0 \\
\hline 0 & 0 & b_{11} & b_{12} & b_{13} \\
0 & 0 & b_{21} & b_{22} & b_{23} \\
0 & 0 & b_{31} & b_{32} & b_{33}
\end{array}\right)
\end{equation}显然$A,B$分别作用在不同的子空间里。显然直和矩阵$A\oplus B$作用在直和向量$\vec{v} \oplus \vec{w}$为
\begin{equation}
(A \oplus B)(\vec{v} \oplus \vec{w})=\left(\begin{array}{c|c}
A & 0_{n \times m} \\
\hline 0_{m \times n} & B
\end{array}\right)\left(\frac{\vec{v}}{\vec{w}}\right)=\left(\frac{A \vec{v}}{B \vec{w}}\right)=(A \vec{v}) \oplus(B \vec{w})
\end{equation}若有两个直和矩阵相乘,利用对角矩阵运算显然得到
\begin{equation}
\left(A_{1} \oplus B_{1}\right)\left(A_{2} \oplus B_{2}\right)=\left(A_{1} A_{2}\right) \oplus\left(B_{1} B_{2}\right)
\end{equation}直和空间$V\oplus W$一共有$(n+m)^2$个线性无关矩阵,而子空间$V$和$W$分别只有$n^2,m^2$个线性无关矩阵,这说明并不是所有在直和空间中的矩阵都能表示成直和形式。
利用分块矩阵行列式规则显然得到
\begin{equation}
\operatorname{det}(A \oplus B)=(\operatorname{det} A)(\operatorname{det} B)
\end{equation}分块矩阵求迹得到的是各个子块的迹之和
\begin{equation}
\operatorname{Tr}(A \oplus B)=(\operatorname{Tr} A)+(\operatorname{Tr} B)
\end{equation}2、直积
直积这个词在有些书上叫张量积、Kronecker积
和之前一样,给定两个线性空间$V,W$,分别$n,m$维. $V$中有一组基底$\left\{\vec{e}_{1}, \vec{e}_{2}, \cdots, \vec{e}_{n}\right\}$,$W$中也有一组基底$\left\{\vec{f}_{1}, \vec{f}_{2}, \cdots, \vec{f}_{m}\right\}$.我们现在定义$nm$个基向量$\vec{e}_{i} \otimes \vec{f}_{j}$,其中$i=1,\cdots,n$和$j=1,\cdots,m$.$\otimes$具体运算下面将会讲明,可以看到$\vec{e}_i\otimes\vec{e}_j$有两个指标,这也是为什么直积也叫张量积的原因。
张量积是双线性的,即在$V$中是线性的,在$W$中也是线性的. 如果有多于两个线性空间,则叫多重线性的。这意味着$\vec{v} \otimes \vec{w}=\left(\sum_{i}^{n} v_{i} \vec{e}_{i}\right) \otimes\left(\sum_{j}^{m} w_{j} \vec{f}_{j}\right)=\sum_{i}^{n} \sum_{j}^{m} v_{i} w_{j}\left(\vec{e}_{i} \otimes \vec{f}_{j}\right)$. 换句话说由向量基底$\vec{e}_i\otimes\vec{f}_i$生成的线性空间$V\otimes W$中的向量由这组基$\vec{v}_i\otimes\vec{w}_j$线性表出,系数为$v_iw_j$
一种使它非常明确的方法是把它写出来。例如假设$n=2,m=3$。张量积空间的维数为$nm=6$新的基底分别是$\vec{e}_{1} \otimes \vec{f}_{1}, \vec{e}_{1} \otimes \vec{f}_{2}, \vec{e}_{1} \otimes \vec{f}_{3}, \vec{e}_{2} \otimes \vec{f}_{1}, \vec{e}_{2} \otimes \vec{f}_{2}, \vec{e}_{2} \otimes \vec{f}_{3}$. 写出这6个基底对应的列向量
\begin{equation}
\begin{split}
&\vec{e}_{1} \otimes \vec{f}_{1}=\left(\begin{array}{c}
1 \\
0 \\
0 \\
\hline 0 \\
0 \\
0
\end{array}\right), \quad \vec{e}_{1} \otimes \vec{f}_{2}=\left(\begin{array}{c}
0 \\
1 \\
0 \\
0 \\
0 \\
0
\end{array}\right), \quad \vec{e}_{1} \otimes \vec{f}_{3}=\left(\begin{array}{c}
0 \\
0 \\
1 \\
\hline 0 \\
0 \\
0
\end{array}\right)\\
&\vec{e}_{2} \otimes \vec{f}_{1}=\left(\begin{array}{c}
0 \\
0 \\
0 \\
\hline 1 \\
0 \\
0
\end{array}\right), \quad \vec{e}_{2} \otimes \vec{f}_{2}=\left(\begin{array}{c}
0 \\
0 \\
0 \\
\hline 0 \\
1 \\
0
\end{array}\right), \quad \vec{e}_{2} \otimes \vec{f}_{3}=\left(\begin{array}{c}
0 \\
0 \\
0 \\
\hline 0 \\
0 \\
1
\end{array}\right)
\end{split}
\end{equation}这里横线把基向量分割成了两个不同的子集,第一部分由$\vec{e}_1$生成,第二部分由$\vec{e}_2$生成。对于一般的向量$\vec{v}\in V,\vec{w}\in W$,张量积为
\begin{equation}
\vec{v} \otimes \vec{w}=\left(\begin{array}{l}
v_{1} w_{1} \\
v_{1} w_{2} \\
v_{1} w_{3} \\
\hline v_{2} w_{1} \\
v_{2} w_{2} \\
v_{2} w_{3}
\end{array}\right)
\end{equation}在数学上矩阵是一个线性空间到另一个线性空间的线性映射。同样的我们只关注量子力学感兴趣的$A:V\mapsto V$
或者说
\begin{equation}
\left(\begin{array}{c}
v_{1} \\
v_{2} \\
\vdots \\
v_{n}
\end{array}\right) \longmapsto\left(\begin{array}{cccc}
a_{11} & a_{12} & \cdots & a_{1 n} \\
a_{22} & a_{22} & \cdots & a_{2 n} \\
\vdots & \vdots & \ddots & \vdots \\
a_{n 1} & a_{n 2} & \cdots & a_{n n}
\end{array}\right)\left(\begin{array}{c}
v_{1} \\
v_{2} \\
\vdots \\
v_{n}
\end{array}\right)_{V}
\end{equation}在张量积空间上,相同的矩阵仍然可以作用在矢量上,使得$\vec{v} \mapsto A \vec{v}$但$\vec{w} \mapsto \vec{w}$保持不变。这个矩阵可以写成$A\otimes I$,其中$I$是恒等矩阵。在之前的$n=2,m=3$例子里,矩阵$A$是$2\times2$维矩阵,显然$A\otimes I$是$6\times6$维矩阵
\begin{equation}
A \otimes I=\left(\begin{array}{ccc|ccc}
a_{11} & 0 & 0 & a_{12} & 0 & 0 \\
0 & a_{11} & 0 & 0 & a_{12} & 0 \\
0 & 0 & a_{11} & 0 & 0 & a_{12} \\
\hline a_{21} & 0 & 0 & a_{22} & 0 & 0 \\
0 & a_{21} & 0 & 0 & a_{22} & 0 \\
0 & 0 & a_{21} & 0 & 0 & a_{22}
\end{array}\right)
\end{equation}把这个矩阵作用在$\vec{v}\otimes\vec{w}$上就知道了之所以这样构造的原因
\begin{equation}
\begin{aligned}
(A \otimes I)(\vec{v} \otimes \vec{w}) &=\left(\begin{array}{ccc|ccc}
a_{11} & 0 & 0 & a_{12} & 0 & 0 \\
0 & a_{11} & 0 & 0 & a_{12} & 0 \\
0 & 0 & a_{11} & 0 & 0 & a_{12} \\
\hline a_{21} & 0 & 0 & a_{22} & 0 & 0 \\
0 & a_{21} & 0 & 0 & a_{22} & 0 \\
0 & 0 & a_{21} & 0 & 0 & a_{22}
\end{array}\right)\left(\begin{array}{c}
v_{1} w_{1} \\
v_{1} w_{2} \\
v_{1} w_{3} \\
\hline v_{2} w_{1} \\
v_{2} w_{2} \\
v_{2} w_{3}
\end{array}\right) \\
=&\left(\begin{array}{c}
\left(a_{11} v_{1}+a_{12} v_{2}\right) w_{1} \\
\left(a_{11} v_{1}+a_{12} v_{2}\right) w_{2} \\
\left(a_{11} v_{1}+a_{12} v_{2}\right) w_{3} \\
\hline\left(a_{21} v_{1}+a_{22} v_{2}\right) w_{1} \\
\left(a_{21} v_{1}+a_{22} v_{2}\right) w_{2} \\
\left(a_{21} v_{1}+a_{22} v_{2}\right) w_{3}
\end{array}\right) &=(A \vec{v}) \otimes \vec{w}
\end{aligned}
\end{equation}显然矩阵$A$仅仅只作用在$\vec{v}\in V$上并且保持$\vec{w}\in W$不变。
类似的对于矩阵$B:W\mapsto W$, $\vec{w}\mapsto B\vec{w}$. 同样的也能用$I\otimes B$作用在$V\otimes W$中的矢量。其中
\begin{equation}
I \otimes B=\left(\begin{array}{ccc|ccc}
b_{11} & b_{12} & b_{13} & 0 & 0 & 0 \\
b_{21} & b_{22} & b_{23} & 0 & 0 & 0 \\
b_{31} & b_{32} & b_{33} & 0 & 0 & 0 \\
\hline 0 & 0 & 0 & b_{11} & b_{12} & b_{13} \\
0 & 0 & 0 & b_{21} & b_{22} & b_{23} \\
0 & 0 & 0 & b_{31} & b_{32} & b_{33}
\end{array}\right)
\end{equation}作用在$\vec{v}\otimes\vec{w}$为
\begin{equation}
\begin{aligned}
(I \otimes B)(\vec{v} \otimes \vec{w}) &=\left(\begin{array}{ccc|ccc}
b_{11} & b_{12} & b_{13} & 0 & 0 & 0 \\
b_{21} & b_{22} & b_{23} & 0 & 0 & 0 \\
b_{31} & b_{32} & b_{33} & 0 & 0 & 0 \\
\hline 0 & 0 & 0 & b_{11} & b_{12} & b_{13} \\
0 & 0 & 0 & b_{21} & b_{22} & b_{23} \\
0 & 0 & 0 & b_{31} & b_{32} & b_{33}
\end{array}\right)\left(\begin{array}{c}
v_{1} w_{1} \\
v_{1} w_{2} \\
v_{1} w_{3} \\
\hline v_{2} w_{1} \\
v_{2} w_{2} \\
v_{2} w_{3}
\end{array}\right) \\
=&\left(\begin{array}{c}
v_{1}\left(b_{11} w_{1}+b_{12} w_{2}+b_{13} w_{3}\right) \\
v_{1}\left(b_{21} w_{1}+b_{22} w_{2}+b_{23} w_{3}\right) \\
v_{1}\left(b_{31} w_{1}+b_{32} w_{2}+b_{33} w_{3}\right) \\
\hline v_{2}\left(b_{11} w_{1}+b_{12} w_{2}+b_{13} w_{3}\right) \\
v_{2}\left(b_{21} w_{1}+b_{22} w_{2}+b_{23} w_{3}\right) \\
v_{2}\left(b_{31} w_{1}+b_{32} w_{2}+b_{33} w_{3}\right)
\end{array}\right) &=\vec{v} \otimes(B \vec{w})
\end{aligned}
\end{equation}一般而言
\begin{equation}
\begin{split}
(A \otimes I)(\vec{v} \otimes \vec{w})&=(A \vec{v}) \otimes \vec{w}\\
(I \otimes B)(\vec{v} \otimes \vec{w})&=\vec{v} \otimes(B \vec{w})
\end{split}
\end{equation}利用上面式子很容易证明下列式子
\begin{equation}
\begin{array}{l}
\left(A_{1} \otimes I\right)\left(A_{2} \otimes I\right)=\left(A_{1} A_{2}\right) \otimes I \\
\left(I \otimes B_{1}\right)\left(I \otimes B_{2}\right)=I \otimes\left(B_{1} B_{2}\right) \\
\left(A_{1} \otimes B_{1}\right)\left(A_{2} \otimes B_{2}\right)=\left(A_{1}A_{2}\right) \otimes\left(B_{1} B_{2}\right) \\
(A \otimes I)(I \otimes B)=(I \otimes B)(A \otimes I)=(A \otimes B)
\end{array}
\end{equation}证明:
\begin{equation}
(A_1\otimes I)(A_2\otimes I)(\vec{v}\otimes\vec{w})=(A_1\otimes I)(A_2\vec{v}\otimes\vec{w})=(A_1A_2\vec{v}\otimes\vec{w})=((A_1 A_2)\otimes I)(\vec{v}\otimes\vec{w})
\end{equation}第二三式类似证明,不再赘述
\begin{equation}
(A\otimes I)(I\otimes B)(\vec{v}\otimes\vec{w})=(A\otimes I)(v\otimes B\vec{w})=(A\vec{v})\otimes(B\vec{w})=(A\otimes B)(\vec{v}\otimes\vec{w})=(I \otimes B)(A \otimes I)(\vec{v}\otimes\vec{w})
\end{equation}最后一式定义了$A\otimes B$,也可以写出矩阵形式
\begin{equation}
\begin{split}
A \otimes B&=(A\otimes I)(I\otimes B)\\
&=\left(\begin{array}{ccc|ccc}
a_{11} & 0 & 0 & a_{12} & 0 & 0 \\
0 & a_{11} & 0 & 0 & a_{12} & 0 \\
0 & 0 & a_{11} & 0 & 0 & a_{12} \\
\hline a_{21} & 0 & 0 & a_{22} & 0 & 0 \\
0 & a_{21} & 0 & 0 & a_{22} & 0 \\
0 & 0 & a_{21} & 0 & 0 & a_{22}
\end{array}\right)\left(\begin{array}{ccc|ccc}
b_{11} & b_{12} & b_{13} & 0 & 0 & 0 \\
b_{21} & b_{22} & b_{23} & 0 & 0 & 0 \\
b_{31} & b_{32} & b_{33} & 0 & 0 & 0 \\
\hline 0 & 0 & 0 & b_{11} & b_{12} & b_{13} \\
0 & 0 & 0 & b_{21} & b_{22} & b_{23} \\
0 & 0 & 0 & b_{31} & b_{32} & b_{33}
\end{array}\right)\\
&=\left(\begin{array}{ccc|ccc}
a_{11} b_{11} & a_{11} b_{12} & a_{11} b_{13} & a_{12} b_{11} & a_{12} b_{12} & a_{12} b_{13} \\
a_{11} b_{21} & a_{11} b_{22} & a_{11} b_{23} & a_{12} b_{21} & a_{12} b_{22} & a_{12} b_{23} \\
a_{11} b_{31} & a_{11} b_{32} & a_{11} b_{33} & a_{12} b_{31} & a_{12} b_{32} & a_{12} b_{33} \\
\hline a_{21} b_{11} & a_{21} b_{12} & a_{21} b_{13} & a_{22} b_{11} & a_{22} b_{12} & a_{22} b_{13} \\
a_{21} b_{21} & a_{21} b_{22} & a_{21} b_{23} & a_{22} b_{21} & a_{22} b_{22} & a_{22} b_{23} \\
a_{21} b_{31} & a_{21} b_{32} & a_{21} b_{33} & a_{22} b_{31} & a_{22} b_{32} & a_{22} b_{33}
\end{array}\right)
\end{split}
\end{equation}值得注意,张量积空间中不是所有矩阵都能写成两个子空间矩阵张量积的形式。此外还有两个常用公式
\begin{equation}
\begin{array}{l}
\operatorname{det}(A \otimes B)=(\operatorname{det} A)^{m}(\operatorname{det} B)^{n} \\
\operatorname{Tr}(A \otimes B)=(\operatorname{Tr} A)(\operatorname{Tr} B)
\end{array}
\end{equation}