9.3 Gram-Schmidt orthogonalization

An orthogonal basis is valuable, but most bases are not orthogonal when they are first given to you. Gram-Schmidt is the standard procedure that repairs that problem.

It starts from any linearly independent list and systematically removes the components that point in already-used directions. What remains is orthogonal, and the span is preserved at every step.

The projection idea behind Gram-Schmidt

Suppose $v_1,\dots,v_k$ are already orthogonal and w is any vector. If you want the part of w that is perpendicular to every $v_i$ , you subtract the components of w along those vectors:

w- \frac{\langle w,v_1\rangle}{\|v_1\|^2}v_1 -\cdots- \frac{\langle w,v_k\rangle}{\|v_k\|^2}v_k.

Theorem

Orthogonal remainder theorem

Let $S=\{v_1,\dots,v_k\}$ be an orthogonal subset of $\mathbb{R}^m$ , and let $w\in\mathbb{R}^m$ . Define

v= w- \frac{\langle w,v_1\rangle}{\|v_1\|^2}v_1 -\cdots- \frac{\langle w,v_k\rangle}{\|v_k\|^2}v_k.

Then v is perpendicular to each $v_i$ .

This theorem is the engine of the algorithm. It tells you exactly how to build a new vector orthogonal to all the old ones.

Gram-Schmidt process

Theorem

Gram-Schmidt orthogonalization process

Let $\{w_1,\dots,w_k\}$ be a linearly independent subset of $\mathbb{R}^m$ . Define

v_1=w_1,

and for $\ell=2,\dots,k$ ,

v_\ell= w_\ell -\frac{\langle w_\ell,v_1\rangle}{\|v_1\|^2}v_1 -\cdots- \frac{\langle w_\ell,v_{\ell-1}\rangle}{\|v_{\ell-1}\|^2}v_{\ell-1}.

Then:

$\{v_1,\dots,v_k\}$ is orthogonal;
for each $\ell$ ,

\operatorname{span}\{w_1,\dots,w_\ell\} = \operatorname{span}\{v_1,\dots,v_\ell\}.

In particular, the final orthogonal set spans the same subspace as the original linearly independent set.

The second conclusion matters as much as the first. Gram-Schmidt does not merely produce some orthogonal vectors. It preserves the exact subspace you started with.

How to read the algorithm

The first vector is unchanged:

v_1=w_1.

The second vector is the part of $w_2$ orthogonal to $v_1$ .

The third vector is the part of $w_3$ orthogonal to both $v_1$ and $v_2$ .

And so on. Each step removes everything already accounted for by the earlier orthogonal directions.

Worked example

A short Gram-Schmidt computation in R^3

Start with the linearly independent vectors

w_1= \begin{bmatrix} 1\\0\\1 \end{bmatrix}, \qquad w_2= \begin{bmatrix} 1\\1\\1 \end{bmatrix}, \qquad w_3= \begin{bmatrix} 0\\1\\2 \end{bmatrix}.

Set

v_1=w_1= \begin{bmatrix} 1\\0\\1 \end{bmatrix}.

Now compute

\frac{\langle w_2,v_1\rangle}{\|v_1\|^2} =\frac{2}{2}=1,

v_2=w_2-v_1= \begin{bmatrix} 0\\1\\0 \end{bmatrix}.

Next,

\frac{\langle w_3,v_1\rangle}{\|v_1\|^2} =\frac{2}{2}=1, \qquad \frac{\langle w_3,v_2\rangle}{\|v_2\|^2} =\frac{1}{1}=1.

Therefore

v_3=w_3-v_1-v_2= \begin{bmatrix} -1\\0\\1 \end{bmatrix}.

The resulting set

\left\{ \begin{bmatrix} 1\\0\\1 \end{bmatrix}, \begin{bmatrix} 0\\1\\0 \end{bmatrix}, \begin{bmatrix} -1\\0\\1 \end{bmatrix} \right\}

is orthogonal and spans the same subspace as the original list.

Worked example

A static Gram-Schmidt table

The same calculation can be organized as a table, which is often clearer in a written solution or exported study sheet.

Start with

w_1= \begin{bmatrix}1\\1\\0\end{bmatrix}, \quad w_2= \begin{bmatrix}1\\0\\1\end{bmatrix}, \quad w_3= \begin{bmatrix}0\\1\\1\end{bmatrix}.

Step	Projection data	New orthogonal vector	Span preserved
$v_1$	none	$v_1=w_1=(1,1,0)^T$	$Span\{v_1\}=Span\{w_1\}$
$v_2$	$\langle w_2,v_1\rangle/\|v_1\|^2=1/2$	$v_2=w_2-\frac12v_1=(1/2,-1/2,1)^T$	$Span\{v_1,v_2\}=Span\{w_1,w_2\}$
$v_3$	$\langle w_3,v_1\rangle/\|v_1\|^2=1/2$ , $\langle w_3,v_2\rangle/\|v_2\|^2=1/3$	$v_3=w_3-\frac12v_1-\frac13v_2=(-2/3,2/3,2/3)^T$	$Span\{v_1,v_2,v_3\}=Span\{w_1,w_2,w_3\}$

The table makes two things explicit. Each new vector is obtained by subtracting projections onto the already constructed orthogonal vectors, and each step keeps the same partial span.

From orthogonal to orthonormal

Gram-Schmidt gives an orthogonal basis. To get an orthonormal basis, normalize each nonzero vector:

u_i=\frac{v_i}{\|v_i\|}.

Theorem

Every subspace has an orthonormal basis

Let $V$ be a subspace of $\mathbb{R}^m$ . Then $V$ has an orthogonal basis, and after normalization it also has an orthonormal basis.

So orthonormal bases are not rare or special accidents. Every finite-dimensional subspace of $\mathbb{R}^m$ admits one.

Worked example

Normalize the Gram-Schmidt output

Using the vectors from the previous example:

\|v_1\|=\sqrt2,\qquad \|v_2\|=1,\qquad \|v_3\|=\sqrt2.

So an orthonormal basis is

u_1=\frac{1}{\sqrt2} \begin{bmatrix} 1\\0\\1 \end{bmatrix}, \qquad u_2= \begin{bmatrix} 0\\1\\0 \end{bmatrix}, \qquad u_3=\frac{1}{\sqrt2} \begin{bmatrix} -1\\0\\1 \end{bmatrix}.

Common mistake

Do not subtract projections onto the original w-vectors

In the recursive formula, each new vector must be built using the already constructed orthogonal vectors $v_1,\dots,v_{\ell-1}$ , not the original $w_1,\dots,w_{\ell-1}$ . If you subtract components along the original non-orthogonal vectors, the output need not become orthogonal.

Quick check

What is the first Gram-Schmidt vector v_1?

Look at the definition.

Solution

Answer

Quick check

What property does Gram-Schmidt preserve besides orthogonality?

Think about the span conclusion in the theorem.

Solution

Answer

Quick check

How do you turn an orthogonal basis into an orthonormal basis?

Use normalization.

Solution

Answer

Exercises

Quick check

Apply the first two steps of Gram-Schmidt to $w_1=(1,1,0)$ and $w_2=(1,0,1)$ .

Compute $v_1=w_1$ , then subtract the component of $w_2$ along $v_1$ .

Solution

Guided solution

Quick check

Why does Gram-Schmidt require the original list $w_1,\dots,w_k$ to be linearly independent?

Think about what could happen to one of the new vectors.

Solution

Guided solution

Quick check

Suppose Gram-Schmidt gives orthogonal vectors $v_1,v_2$ with norms 3 and 4. What are the corresponding orthonormal vectors?

Normalize each one separately.

Solution

Guided solution

Read 9.2 Orthogonal sets and orthonormal bases first, because Gram-Schmidt is built from the orthogonal coefficient formula.

The existence theorem here is one of the main practical tools for working with subspaces introduced earlier in 6.2 Subspaces and 6.5 Basis and dimension.

The equality cases in the next note, 9.4 Cauchy-Schwarz and triangle inequalities, explain why the projection steps here behave geometrically.

9.3 Gram-Schmidt orthogonalization

MATH1030: Linear algebra I

The projection idea behind Gram-Schmidt

Orthogonal remainder theorem

Gram-Schmidt process

Gram-Schmidt orthogonalization process

How to read the algorithm

A short Gram-Schmidt computation in R^3

A static Gram-Schmidt table

From orthogonal to orthonormal

Every subspace has an orthonormal basis

Normalize the Gram-Schmidt output

Common mistake

Do not subtract projections onto the original w-vectors

Quick check

What is the first Gram-Schmidt vector v_1?

Answer

What property does Gram-Schmidt preserve besides orthogonality?

Answer

How do you turn an orthogonal basis into an orthonormal basis?

Answer

Exercises

Apply the first two steps of Gram-Schmidt to $w_1=(1,1,0)$ and $w_2=(1,0,1)$ .

Guided solution

Why does Gram-Schmidt require the original list $w_1,\dots,w_k$ to be linearly independent?

Guided solution

Suppose Gram-Schmidt gives orthogonal vectors $v_1,v_2$ with norms 3 and 4. What are the corresponding orthonormal vectors?

Guided solution

Section mastery checkpoint

Besides producing orthogonal vectors, what does Gram-Schmidt preserve at each step?

In Gram-Schmidt, let $v_1=w_1=(1,1,0)$ and $w_2=(1,0,1)$ . Fill in `(<w_2,v_1>)/(||v_1||^2)=____`.

Prerequisites

Key terms in this unit

Premium learning add-ons

More notes in this series

9.3 Gram-Schmidt orthogonalization

MATH1030: Linear algebra I

The projection idea behind Gram-Schmidt

Orthogonal remainder theorem

Gram-Schmidt process

Gram-Schmidt orthogonalization process

How to read the algorithm

A short Gram-Schmidt computation in R^3

A static Gram-Schmidt table

From orthogonal to orthonormal

Every subspace has an orthonormal basis

Normalize the Gram-Schmidt output

Common mistake

Do not subtract projections onto the original w-vectors

Quick check

What is the first Gram-Schmidt vector v_1?

Answer

What property does Gram-Schmidt preserve besides orthogonality?

Answer

How do you turn an orthogonal basis into an orthonormal basis?

Answer

Exercises

Apply the first two steps of Gram-Schmidt to w1=(1,1,0)w_1=(1,1,0)w1​=(1,1,0) and w2=(1,0,1)w_2=(1,0,1)w2​=(1,0,1).

Guided solution

Why does Gram-Schmidt require the original list w1,…,wkw_1,\dots,w_kw1​,…,wk​ to be linearly independent?

Guided solution

Suppose Gram-Schmidt gives orthogonal vectors v1,v2v_1,v_2v1​,v2​ with norms 3 and 4. What are the corresponding orthonormal vectors?

Guided solution

Related notes

Section mastery checkpoint

Prerequisites

Key terms in this unit

Premium learning add-ons

More notes in this series

Apply the first two steps of Gram-Schmidt to $w_1=(1,1,0)$ and $w_2=(1,0,1)$ .

Why does Gram-Schmidt require the original list $w_1,\dots,w_k$ to be linearly independent?

Suppose Gram-Schmidt gives orthogonal vectors $v_1,v_2$ with norms 3 and 4. What are the corresponding orthonormal vectors?