Symmetry and definiteness - Fundamentals of Numerical Computation

As we saw in Exploiting matrix structure, symmetry can simplify the LU factorization into the symmetric form $\mathbf{A}=\mathbf{L}\mathbf{D}\mathbf{L}^T$ . Important specializations occur as well for the eigenvalue and singular value factorizations. In this section we stay with complex-valued matrices, so we are interested in the case when $\mathbf{A}^*=\mathbf{A}$ , i.e., $\mathbf{A}$ is hermitian. However, we often loosely speak of symmetry to mean this property even in the complex case. All the statements in this section easily specialize to the real case.

7.4.1Normality¶

Suppose now that $\mathbf{A}^*=\mathbf{A}$ and that $\mathbf{A}=\mathbf{U}\mathbf{S}\mathbf{V}^*$ is an SVD. Since $\mathbf{S}$ is real and square, we have

\mathbf{A}^* = \mathbf{V} \mathbf{S}^* \mathbf{U}^* = \mathbf{V} \mathbf{S} \mathbf{U}^*,

(7.4.1)

and it’s tempting to conclude that $\mathbf{U}=\mathbf{V}$ . Happily, this is nearly true. The following theorem is typically proved in an advanced linear algebra course.

Because hermitian matrices are normal, their eigenvalue condition number is guaranteed to be 1 by Theorem 7.2.3. That fact makes eigenvalues a robust computational target in the hermitian case.

For a hermitian matrix, the EVD

\mathbf{A}=\mathbf{V}\mathbf{D}\mathbf{V}^{-1}=\mathbf{V} \mathbf{D} \mathbf{V}^*

(7.4.2)

is almost an SVD.

7.4.2Rayleigh quotient¶

For a hermitian matrix $\mathbf{A}$ , the number $\mathbf{x}^* \mathbf{A} \mathbf{x}$ acts much like a scalar quadratic term $ax^2$ .

The following facts can be established by straightforward calculations.

As a consequence of Theorem 7.4.4, the Rayleigh quotient can be used to turn an estimate of the eigenvector $\mathbf{v}$ into an estimate of its eigenvalue $\lambda$ . Specifically,

R_{\mathbf{A}}(\mathbf{v}+\delta\mathbf{z}) = \lambda + O(\delta^2),

(7.4.7)

as $\delta \to 0$ .

Example 7.4.1 (Rayleigh quotient)

Julia

MATLAB

Python

Example 7.4.1

We will use a symmetric matrix with a known EVD and eigenvalues equal to the integers from 1 to 20.

n = 20;
λ = 1:n
D = diagm(λ)
V, _ = qr(randn(n, n))   # get a random orthogonal V
A = V * D * V';

The Rayleigh quotient is a scalar-valued function of a vector.

R = x -> (x' * A * x) / (x' * x);

The Rayleigh quotient evaluated at an eigenvector gives the corresponding eigenvalue.

R(V[:, 7])

7.0000000000000036

If the input to he Rayleigh quotient is within a small $\delta$ of an eigenvector, its output is within $O(\delta^2)$ of the corresponding eigenvalue. In this experiment, we observe that each additional digit of accuracy in an approximate eigenvector gives two more digits to the eigenvalue estimate coming from the Rayleigh quotient.

δ = @. 1 ./ 10^(1:5)
eval_diff = zeros(size(δ))
for (k, delta) in enumerate(δ)
    e = randn(n)
    e = delta * e / norm(e)
    x = V[:, 7] + e
    eval_diff[k] = R(x) - 7
end
labels = ["perturbation δ", "δ²", "R(x) - λ"]
@pt :header=labels [δ δ .^ 2 eval_diff]

Example 7.4.1

We will use a symmetric matrix with a known EVD and eigenvalues equal to the integers from 1 to 20.

n = 20;
lambda = 1:n;
D = diag(lambda);
[V, ~] = qr(randn(n, n));    % get a random orthogonal V
A = V * D * V';

The Rayleigh quotient is a scalar-valued function of a vector.

R = @(x) (x' * A * x) / (x' * x);

The Rayleigh quotient evaluated at an eigenvector gives the corresponding eigenvalue.

format long
R(V(:, 7))

delta = 1 ./ 10 .^ (1:5)';
dif = zeros(size(delta));
for k = 1:length(delta)
    e = randn(n, 1);
    e = delta(k) * e / norm(e);
    x = V(:, 6) + e;
    dif(k) = R(x) - lambda(6);
end
table(delta, dif, variablenames=["perturbation size", "R(x) - lambda"])

Example 7.4.1

We will use a symmetric matrix with a known EVD and eigenvalues equal to the integers from 1 to 20.

from numpy.linalg import qr
n = 20
d = arange(n) + 1
D = diag(d)
V, _ = qr(random.randn(n, n))    # get a random orthogonal V
A = V @ D @ V.T

The Rayleigh quotient is a scalar-valued function of a vector.

R = lambda x: dot(x, A @ x) / dot(x, x)

The Rayleigh quotient evaluated at an eigenvector gives the corresponding eigenvalue.

print(R(V[:, 6]))

7.0000000000000036

results = PrettyTable(["perturbation size", "R.Q. - λ"])
for delta in 1 / 10 ** arange(1, 6):
    e = random.randn(n)
    e = delta * e / norm(e)
    x = V[:, 5] + e
    quotient = R(x)
    results.add_row([delta, quotient - d[5]])

print(results)

+-------------------+------------------------+
| perturbation size |        R.Q. - λ        |
+-------------------+------------------------+
|        0.1        |  0.049968697608489876  |
|        0.01       | 0.00017267652398533784 |
|       0.001       | 3.7459265787020968e-06 |
|       0.0001      | 1.2033776641828808e-08 |
|       1e-05       | 6.467448798730402e-10  |
+-------------------+------------------------+

7.4.3Definite, semidefinite, and indefinite matrices¶

In the real case, we called a symmetric matrix $\mathbf{A}$ SPD if $\mathbf{x}^T \mathbf{A}\mathbf{x} > 0$ for all nonzero vectors $\mathbf{x}$ . There is an analogous definition for complex matrices.

Putting the HPD property together with the Rayleigh quotient leads to the following.

According to Theorem 7.4.5, for an HPD matrix, the EVD $\mathbf{A}=\mathbf{V}\mathbf{D}\mathbf{V}^*$ meets all the requirements of the SVD, provided the ordering of eigenvalues is chosen appropriately.

7.4.4Exercises¶

Exercise 7.4.1

✍ Each line below is an EVD for a hermitian matrix. State whether the matrix is definite, indefinite, or semidefinite. Then state whether the given factorization is also an SVD, and if it is not, modify it to find an SVD.

(a) $\begin{bmatrix} 0 & 0 \\ 0 & -1 \end{bmatrix} = \begin{bmatrix} 0 & 1 \\ 1 & 0 \end{bmatrix} \begin{bmatrix} -1 & 0 \\ 0 & 0 \end{bmatrix} \begin{bmatrix} 0 & 1 \\ 1 & 0 \end{bmatrix}$

(b) $\begin{bmatrix} 4 & -2 \\ -2 & 1 \end{bmatrix} = \begin{bmatrix} 1 & -0.5 \\ -0.5 & -1 \end{bmatrix} \begin{bmatrix} 5 & 0 \\ 0 & 0 \end{bmatrix} \begin{bmatrix} 0.8 & -0.4 \\ -0.4 & -0.8 \end{bmatrix}$

(c) $\begin{bmatrix} -5 & 3\\ 3 & -5 \end{bmatrix} = \begin{bmatrix} \alpha & \alpha \\ \alpha & -\alpha \end{bmatrix} \begin{bmatrix} -2 & 0 \\ 0 & -8 \end{bmatrix} \begin{bmatrix} \alpha & \alpha \\ \alpha & -\alpha \end{bmatrix}, \quad\alpha=1/\sqrt{2}$

Preface

Singular value decomposition

Preface

Dimension reduction