3.1 Matrix multiplication and identity matrices

Matrix multiplication is the first matrix operation that genuinely mixes rows with columns. It is also the operation that lets matrices encode composition, systems of equations, and later inverse matrices. Because of that, you should not memorize the rule as a pattern of symbols only. You should know what the dimensions are doing at each step.

Why multiplication is more subtle than addition

Addition and scalar multiplication act entry by entry. Matrix multiplication is different. To compute one output entry, you compare one row of the left matrix with one column of the right matrix.

That is why dimensions matter so strictly.

Definition

When a matrix product is defined

If $A$ is an $m \times n$ matrix and $B$ is an $n \times p$ matrix, then the product AB is defined and is an $m \times p$ matrix.

If the number of columns of $A$ does not equal the number of rows of $B$ , then the product AB is undefined.

The inner dimensions must match. The outer dimensions tell you the size of the result.

The row-by-column rule

Definition

Matrix multiplication

Suppose $A = [a_{ij}]$ is an $m \times n$ matrix and $B = [b_{jk}]$ is an $n \times p$ matrix.

Then the (i,k) entry of AB is

(AB)_{ik} = a_{i1}b_{1k} + a_{i2}b_{2k} + \cdots + a_{in}b_{nk}.

So each output entry is the dot-product-style combination of row i of $A$ with column k of $B$ .

This rule explains three important facts at once:

multiplication is not entrywise;
the inner dimensions must match;
the output entry uses every matched position in the row and column.

Worked example

Compute a product carefully

Let

A = \begin{bmatrix} 1 & 2 \\ 3 & -1 \end{bmatrix}, \qquad B = \begin{bmatrix} 4 & 0 \\ 5 & 1 \end{bmatrix}.

Then AB is defined because both matrices are $2 \times 2$ . Its entries are:

(AB)_{11} = 1 \cdot 4 + 2 \cdot 5 = 14,

(AB)_{12} = 1 \cdot 0 + 2 \cdot 1 = 2,

(AB)_{21} = 3 \cdot 4 + (-1) \cdot 5 = 7,

(AB)_{22} = 3 \cdot 0 + (-1) \cdot 1 = -1.

AB = \begin{bmatrix} 14 & 2 \\ 7 & -1 \end{bmatrix}.

Matrix-vector multiplication is a system statement

If x is a column vector, then Ax is a special case of matrix multiplication. It packages the left-hand sides of a linear system into one object.

For

A = \begin{bmatrix} 1 & 2 & -1 \\ 3 & -1 & 5 \end{bmatrix}, \qquad x = \begin{bmatrix} x_1 \\ x_2 \\ x_3 \end{bmatrix},

we have

Ax = \begin{bmatrix} x_1 + 2x_2 - x_3 \\ 3x_1 - x_2 + 5x_3 \end{bmatrix}.

So the system $Ax = b$ is not merely shorthand. It is a matrix product whose entries reproduce the equations of the system.

Identity matrices do nothing, on purpose

Definition

Identity matrix

For each positive integer n, the identity matrix $I_n$ is the $n \times n$ square matrix with 1 on the main diagonal and 0 everywhere else.

For example,

I_2 = \begin{bmatrix} 1 & 0 \\ 0 & 1 \end{bmatrix}, \qquad I_3 = \begin{bmatrix} 1 & 0 & 0 \\ 0 & 1 & 0 \\ 0 & 0 & 1 \end{bmatrix}.

The identity matrix matters because it preserves any compatible matrix:

AI_n = A, \qquad I_m A = A

whenever the sizes match.

Worked example

Why multiplying by the identity changes nothing

Let

A = \begin{bmatrix} 2 & -1 \\ 4 & 3 \end{bmatrix}.

Then

AI_2 = \begin{bmatrix} 2 & -1 \\ 4 & 3 \end{bmatrix} \begin{bmatrix} 1 & 0 \\ 0 & 1 \end{bmatrix} = \begin{bmatrix} 2 & -1 \\ 4 & 3 \end{bmatrix}.

The first column of $AI_2$ reproduces the first column of $A$ , and the second column reproduces the second column of $A$ .

That is exactly why inverse matrices are defined through the identity later: if $A^{-1}$ exists, then $AA^{-1} = I$ .

Multiplication is usually not commutative

One of the first conceptual shocks in linear algebra is that

AB \ne BA

in general.

Sometimes both products are defined and differ. Sometimes one product is defined and the other is not. So order matters twice: it matters for meaning, and it matters for the final answer.

Use the figure below to watch one output entry being built from a selected row and a selected column.

Read and try

Follow one matrix product entry

The live widget updates each entry of AB as you change the entries of A and B.

Result

8	9
3	4

8 = 1×2 + 2×3

Read the product by columns as well as by entries

The row-by-column rule is the standard local computation rule, but it is not the only useful interpretation.

Write the columns of $B$ as

B = [b_1\ b_2\ \cdots\ b_p].

Then the product can be read as

AB = [Ab_1\ Ab_2\ \cdots\ Ab_p].

So each column of AB is obtained by applying $A$ to the corresponding column of $B$ .

Worked example

One product read column by column

Let

A = \begin{bmatrix} 1 & 2 \\ 3 & -1 \end{bmatrix}, \qquad B = \begin{bmatrix} 4 & 0 \\ 5 & 1 \end{bmatrix}.

b_1 = \begin{bmatrix} 4 \\ 5 \end{bmatrix}, \qquad b_2 = \begin{bmatrix} 0 \\ 1 \end{bmatrix},

then

Ab_1 = \begin{bmatrix} 14 \\ 7 \end{bmatrix}, \qquad Ab_2 = \begin{bmatrix} 2 \\ -1 \end{bmatrix}.

Therefore

AB = \begin{bmatrix} 14 & 2 \\ 7 & -1 \end{bmatrix}.

This is the same answer as the entrywise row-by-column computation. The point is that matrix multiplication packages several matrix-vector products together.

Matrix multiplication represents composition

The multiplication rule is not arbitrary. It is the rule that makes matrices encode linear transformations in sequence.

If a vector x is first sent to Bx, and then that result is sent to A(Bx), the combined effect is

(AB)x.

That is why the inner dimensions must match. The output of the first map must be a valid input for the second one.

Theorem

Associativity matches repeated composition

Whenever the products are defined,

A(BC) = (AB)C.

So we may regroup a chain of matrix products without changing the final linear transformation.

This does not mean that order may be changed. Associativity lets us change parentheses, not the order of the factors themselves.

Worked example

Grouping may change, but order may not

Suppose $A$ is $2 \times 3$ , $B$ is $3 \times 4$ , and $C$ is $4 \times 2$ .

Then both AB and BC are defined, so both (AB)C and A(BC) make sense, and associativity says they are equal.

But BA is not defined at all, because the inner dimensions 4 and 2 do not match. So matrix multiplication is associative, but not commutative.

Standard basis vectors explain why columns behave so cleanly

The standard basis vectors make the column interpretation precise. In $R^n$ , the vector $e_k$ has a 1 in position k and 0 everywhere else. If $A$ is an $m \times n$ matrix, then $Ae_k$ is exactly the kth column of $A$ .

This is why the identity matrix behaves so naturally. The columns of $I_n$ are $e_1, e_2, \ldots, e_n$ , so right-multiplying by $I_n$ simply reproduces the columns of $A$ one by one.

This also explains why a compatible zero matrix on the right forces the product to be zero: every column of the zero matrix is the zero vector, so every column of the product is $A0 = 0$ .

The first algebra laws worth remembering

Once multiplication is defined, the next issue is how it interacts with the other matrix operations you already know.

Whenever the sizes are compatible, matrix multiplication satisfies:

A(B + C) = AB + AC, \qquad (A + B)C = AC + BC,

and scalar multiplication may be moved in or out:

(cA)B = c(AB) = A(cB).

The zero matrix is the simplest sanity check for these rules. If 0 is a compatible zero matrix, then

A0 = 0, \qquad 0A = 0.

The reason is that every row-by-column product uses only zero entries from the zero matrix, so every output entry is zero as well.

These identities are basic, but they matter because later arguments about inverse matrices, row operations, and block-matrix computation assume them silently. If you do not know them explicitly, longer calculations become much harder to audit.

Practice-set patterns that reveal the main traps

The assignment-style problems on matrix algebra usually test more than raw arithmetic. A product with unknown entries may be asking whether you can read dimensions, identify which row and column make one output entry, and remember that matrix factors cannot usually be rearranged.

Worked example

Recover unknowns from a partially known product

Let

A = \begin{bmatrix} 1 & 2 & 1 & 3 \\ 1 & 0 & 2 & 1 \end{bmatrix}, \qquad B = \begin{bmatrix} a & b \\ 1 & 1 \\ b & a \\ 1 & 2 \end{bmatrix}.

Suppose

AB = \begin{bmatrix} 1 & c \\ 3 & d \end{bmatrix}.

The product is defined because $A$ is $2 \times 4$ and $B$ is $4 \times 2$ , so AB must be $2 \times 2$ . Computing only the entries we need gives

AB = \begin{bmatrix} a+b+5 & a+b+8 \\ a+2b+1 & 2a+b+2 \end{bmatrix}.

Comparing the first column with the given matrix gives

a+b+5=1, \qquad a+2b+1=3.

Thus

a+b=-4, \qquad a+2b=2.

Subtracting the first equation from the second gives $b=6$ , and then $a=-10$ . The remaining entries are

c=a+b+8=4, \qquad d=2a+b+2=-12.

This kind of problem is not really about multiplying every entry in sight. It is about extracting the few equations that the known product entries force.

Worked example

Expand products without pretending matrices commute

Let $A$ and $B$ be square matrices of the same size. Then

(5A-B)(2A+3B) = 5A(2A+3B)-B(2A+3B).

Now distribute on the right:

5A(2A+3B)-B(2A+3B) =10A^2+15AB-2BA-3B^2.

The middle terms are 15AB and $-2BA$ . They cannot be combined into 13AB unless you already know that $AB=BA$ .

The same warning explains a common false shortcut:

(A+B)(A-B) =A^2-AB+BA-B^2.

This equals $A^2-B^2$ only under the extra condition $AB=BA$ . Real-number algebra hides this issue because real-number multiplication is commutative. Matrix algebra does not.

Theorem

Lower triangular matrices are closed under multiplication

A $3 \times 3$ lower triangular matrix has the shape

\begin{bmatrix} * & 0 & 0 \\ * & * & 0 \\ * & * & * \end{bmatrix}.

If $A$ and $B$ are both lower triangular $3 \times 3$ matrices, then AB is also lower triangular.

Here is the reason in one explicit calculation. Write

A = \begin{bmatrix} a_1 & 0 & 0 \\ b_1 & c_1 & 0 \\ d_1 & e_1 & f_1 \end{bmatrix}, \qquad B = \begin{bmatrix} a_2 & 0 & 0 \\ b_2 & c_2 & 0 \\ d_2 & e_2 & f_2 \end{bmatrix}.

Then

AB = \begin{bmatrix} a_1a_2 & 0 & 0 \\ b_1a_2+c_1b_2 & c_1c_2 & 0 \\ d_1a_2+e_1b_2+f_1d_2 & e_1c_2+f_1e_2 & f_1f_2 \end{bmatrix}.

The entries above the diagonal stay zero because every row-by-column product there is forced to use a zero from one side or the other. This is a useful example of a shape property being preserved by multiplication.

Worked example

A zero product does not force a zero factor

Let

A = \begin{bmatrix} 1 & -1 \end{bmatrix}, \qquad B = \begin{bmatrix} 1 \\ 1 \end{bmatrix}.

Neither factor is the zero matrix, yet

AB = \begin{bmatrix} 1 \cdot 1 + (-1) \cdot 1 \end{bmatrix} = \begin{bmatrix} 0 \end{bmatrix}.

So matrix multiplication behaves differently from real-number multiplication: $AB = 0$ does not imply $A = 0$ or $B = 0$ .

Theorem

The identity matrix is unique

If $E$ is an $n \times n$ matrix such that

EA = A \qquad \text{and} \qquad AE = A

for every compatible $n \times n$ matrix $A$ , then $E = I_n$ .

Proof

Why no second identity matrix can exist

Common mistakes

Common mistake

Matrix multiplication is not entrywise multiplication

The entry $(AB)_{ik}$ is not $a_{ik}b_{ik}$ . It is built from the whole ith row of $A$ and the whole kth column of $B$ .

Common mistake

Defined products can still appear in only one order

If $A$ is $2 \times 3$ and $B$ is $3 \times 4$ , then AB is defined but BA is not. Never assume the reverse order makes sense automatically.

Common mistake

Column language belongs to the right-hand factor

If $B=[b_1\ b_2]$ , then $AB=[Ab_1\ Ab_2]$ . The columns of the product are linear combinations of the columns of $A$ , using weights from the corresponding columns of $B$ . Do not write $AB=[a_1b_1\ a_2b_2]$ ; that expression does not match the definition of matrix multiplication.

Common mistake

Binomial formulas need commutation hypotheses

The formula $(A+B)^2=A^2+2AB+B^2$ is not automatic for matrices. The actual expansion is

Keep AB and BA as different terms.

MATH1030: Linear algebra I

Why multiplication is more subtle than addition

When a matrix product is defined

The row-by-column rule

Matrix multiplication

Compute a product carefully

Matrix-vector multiplication is a system statement

Identity matrices do nothing, on purpose

Identity matrix

Why multiplying by the identity changes nothing

Multiplication is usually not commutative

Follow one matrix product entry

Read the product by columns as well as by entries

One product read column by column

Matrix multiplication represents composition

Associativity matches repeated composition

Grouping may change, but order may not

Standard basis vectors explain why columns behave so cleanly

The first algebra laws worth remembering

Practice-set patterns that reveal the main traps

Recover unknowns from a partially known product

Expand products without pretending matrices commute

Lower triangular matrices are closed under multiplication

A zero product does not force a zero factor

The identity matrix is unique

Why no second identity matrix can exist

Common mistakes

Matrix multiplication is not entrywise multiplication

Defined products can still appear in only one order

Column language belongs to the right-hand factor

Binomial formulas need commutation hypotheses

Quick checks

If AAA is 2×32 × 32×3 and BBB is 3×53 × 53×5, what is the size of AB?

Answer

What does multiplying by InI_nIn​ do to a compatible matrix?

Answer

If the columns of BBB are b1b_1b1​ and b2b_2b2​, how should you read the columns of AB?

Answer

In (5A−B)(2A+3B)(5A-B)(2A+3B)(5A−B)(2A+3B), what is the coefficient of BA after expansion?

Answer

Exercise

Why does Ax=0Ax = 0Ax=0 always have at least one solution, no matter what AAA is?

Guided solution

Explain why BA may be undefined even when AB is defined.

Guided solution

Let AAA and BBB be the matrices from the worked example with unknowns. If the first column of AB is [1,3]T[1,3]^T[1,3]T, find a and b.

Guided solution

Why does the product of two lower triangular 3×33 × 33×3 matrices remain lower triangular?

Guided solution

Related notes

Section mastery checkpoint

Prerequisites

Key terms in this unit

Premium learning add-ons

More notes in this series

If $A$ is $2 × 3$ and $B$ is $3 × 5$ , what is the size of `AB`?

What does multiplying by $I_n$ do to a compatible matrix?

If the columns of $B$ are $b_1$ and $b_2$ , how should you read the columns of `AB`?

In $(5A-B)(2A+3B)$ , what is the coefficient of `BA` after expansion?

Why does $Ax = 0$ always have at least one solution, no matter what $A$ is?

Explain why `BA` may be undefined even when `AB` is defined.

Let $A$ and $B$ be the matrices from the worked example with unknowns. If the first column of `AB` is $[1,3]^T$ , find `a` and `b`.

Why does the product of two lower triangular $3 × 3$ matrices remain lower triangular?