$$ \newcommand{\qed}{\tag*{$\square$}} \newcommand{\span}{\operatorname{span}} \newcommand{\dim}{\operatorname{dim}} \newcommand{\rank}{\operatorname{rank}} \newcommand{\norm}[1]{\|#1\|} \newcommand{\grad}{\nabla} \newcommand{\prox}[1]{\operatorname{prox}_{#1}} \newcommand{\inner}[2]{\langle{#1}, {#2}\rangle} \newcommand{\mat}[1]{\mathcal{M}[#1]} \newcommand{\null}[1]{\operatorname{null} \left(#1\right)} \newcommand{\range}[1]{\operatorname{range} \left(#1\right)} \newcommand{\rowvec}[1]{\begin{bmatrix} #1 \end{bmatrix}^T} \newcommand{\Reals}{\mathbf{R}} \newcommand{\RR}{\mathbf{R}} \newcommand{\Complex}{\mathbf{C}} \newcommand{\Field}{\mathbf{F}} \newcommand{\Pb}{\operatorname{Pr}} \newcommand{\E}[1]{\operatorname{E}[#1]} \newcommand{\Var}[1]{\operatorname{Var}[#1]} \newcommand{\argmin}[2]{\underset{#1}{\operatorname{argmin}} {#2}} \newcommand{\optmin}[3]{ \begin{align*} & \underset{#1}{\text{minimize}} & & #2 \\ & \text{subject to} & & #3 \end{align*} } \newcommand{\optmax}[3]{ \begin{align*} & \underset{#1}{\text{maximize}} & & #2 \\ & \text{subject to} & & #3 \end{align*} } \newcommand{\optfind}[2]{ \begin{align*} & {\text{find}} & & #1 \\ & \text{subject to} & & #2 \end{align*} } $$
In this section, we introduce the concepts of eigenvectors and eigenvalues of square matrices. The eigenvectors and eigenvalues of a matrix provide insight into its behavior as an operator; for this reason, the list of eigenvalues of a matrix is referred to as its spectrum. An important result that we will present is that if a matrix has a basis of eigenvectors, then it is diagonal with respect to that basis.
A nonzero vector is an eigenvector of a matrix if there exists a scalar such that
The scalar is referred to as the eigenvalue corresponding to the eigenvector . Notice that is an eigenvalue of if and only if is not injective, or, equivalently, is not invertible.
It is a fact that eigenvectors corresponding to distinct eigenvalues are linearly independent; for a proof, see 5.10 of Axler. Notice that this means that has at most distinct eigenvalues.
Let be a basis of consisting of eigenvectors of , with corresponding eigenvalues (note that such a basis need not exist). Then has a diagonal matrix with respect to the basis :
Whenever has a diagonal matrix with respect to some basis, we say that is diagonalizable. Concretely, letting , we have that
or equivalently that
The matrix is referred to as an eigendecomposition of .
Diagonalizability of is equivalent to having linearly independent eigenvectors. We can also give a sufficient (but not necessary) condition for diagonalizability: any matrix with distinct eigenvalues is diagonalizable, since eigenvectors corresponding to distinct eigenvalues are linearly independent.
Notice that it is easy to compute powers of diagonalizable matrices:
The rank of an matrix is equal to the number of nonzero eigenvalues; in other words, if the eigenvalue is repeated times, the rank of the matrix is . The eigendecomposition of a diagonalizable matrix of rank is of the form