Vector spaces

Section 1.1 Vector spaces

A vector space is simply a mathematical set on which we can perform addition and scalar multiplication. We already have some familiarity with vector spaces since \(\real^n\) is a good example. However, as mentioned in the introduction to this chapter, polynomials have similar operations so we would like to create a mathematical structure that allows us to study vectors and polynomials as equals. This is why the concept of a vector space is so useful.

🔗

Subsection 1.1.1 Vector spaces

The usual place to get started would be with a general definition of a vector space. However, this is one place in mathematics, among others, where a general definition can obscure the underlying idea. For that reason, let’s just start with some examples.

🔗

Example 1.1.1. Matrices.

Let’s look at the set of all \(3\times 2\) matrices, which include the matrices

\begin{equation*} A=\begin{bmatrix} 3 \amp -1 \\ 0 \amp 2 \\ 4 \amp -3 \\ \end{bmatrix}, \hspace{0.5in} B=\begin{bmatrix} 1 \amp 3 \\ -1 \amp 0 \\ 2 \amp 4 \\ \end{bmatrix}\text{.} \end{equation*}

As we saw in our earlier course, we can multiply a matrix by a scalar and we can add matrices:

\begin{equation*} -3A=\begin{bmatrix} -9 \amp 3 \\ 0 \amp -6 \\ -12 \amp 9 \\ \end{bmatrix}, \hspace{24pt} A+B=\begin{bmatrix} 4 \amp 2 \\ -1 \amp 2 \\ 6 \amp 1 \\ \end{bmatrix}\text{.} \end{equation*}

Notice that both operations produce a new object that is also a \(3\times2\) matrix. We say that that the set is closed under these operations.

🔗

With these operations, the set of \(3\times2\) matrices becomes a vector space.

🔗

Notice that the entries in our matrices are real numbers \(\real\text{.}\) We could instead change the example so that we consider matrices whose entries are in the complex numbers \(\complex\text{.}\)

🔗

Example 1.1.2. Complex matrices.

Consider now the set of \(1\times2\) matrices with complex entries. For example,

\begin{equation*} A=\begin{bmatrix} 2-3i \amp 4 \\ \end{bmatrix},\hspace{24pt} B=\begin{bmatrix} i \amp 1+i \\ \end{bmatrix}\text{.} \end{equation*}

Scalar multiplication includes multiplication by complex numbers so we have

\begin{equation*} (3+i)A = \begin{bmatrix} 9-7i \amp 12+4i \\ \end{bmatrix},\hspace{24pt} A+B = \begin{bmatrix} 2-2i \amp 5+i \end{bmatrix}\text{.} \end{equation*}

🔗

These examples show that vector spaces have an underlying field, which is the set of scalars by which we can multiply. You may or may not know about fields depending on whether you have studied abstract algebra. In either case, the underlying field of our vector spaces will always be either the real numbers or the complex numbers, which we will write as \(\field=\real\) or \(\field=\complex\text{.}\)

🔗

Having seen some examples, we offer a general definition of a vector space.

🔗

Definition 1.1.3. Vector space.

A vector space over a field \(\field\) is a set \(V\) with two operations, scalar multiplication by elements of \(\field\) and addition, under which \(V\) is closed. Moreover, these operations satisfy the following natural properties:

Addition is commutative; that is \(\vvec+\wvec=\wvec+\vvec\) for every pair of \(\vvec,\wvec\in V\text{.}\)
🔗

🔗
There is an additive identity; that is, there is an element \(0\in V\) such that \(0+\vvec = \vvec\) for any element in \(V\text{.}\)
🔗

🔗
Every element \(\vvec\) has an additive inverse \(\wvec\) such that \(\vvec + \wvec = 0\text{.}\) We will usually write the additive identity as \(-\vvec\text{.}\)
🔗

🔗
Addition is associative, which means that we can regroup a sum in the following way:

\begin{equation*} (\uvec+\vvec) + \wvec = \uvec + (\vvec+\wvec)\text{.} \end{equation*}

🔗

🔗
For every \(\vvec\in V\text{,}\) we have \(1\vvec = \vvec\text{.}\)
🔗

🔗
Scalar multiplication is distributive in the sense that

\begin{equation*} s(\vvec+\wvec) = s\vvec + s\wvec\text{.} \end{equation*}

🔗

🔗

🔗

That is a long list of properties. Technically speaking, if we want to check that some set with operations is a vector space, we need to check each one of those properties. In practice, however, we will know a vector space when we see one, and we will be fairly loose with these details. We will often think of a vector space as a set that is closed under operations that are called addition and scalar multiplication and that satisfy some familiar “compatibility properties.”

🔗

Example 1.1.4. Polynomials.

If \(\field=\real\) or \(\field=\complex\text{,}\) the set of polynomials whose coefficients are in \(\field\) form a vector space \(\pbb\text{.}\)

🔗

Example 1.1.5. Polynomials of degree \(n\).

Rather than the set of all polynomials, we define the set \(\pbb_n\) to be the set of all polynomials whose degree is \(n\) or less. For example, \(\pbb_2\) contains all polynomials of degree two or less:

\begin{equation*} p(x) = a_0 + a_1x + a_2x^2 \end{equation*}

where the coefficients \(a_j\) are assumed to be either real or complex, as will be either specified or clear from the context.

🔗

If \(p=a_0 + a_1x + a_2x^2\) and \(q=b_1+b_1x+b_2x^2\text{,}\) then the operations of scalar multiplication and addition are

\begin{align*} sp \amp = sa_0 + (sa_1)x + (sa_2)x^2\\ p + q \amp = (a_0+b_0) + (a_1+b_0)x + (a_2+b_2)x^2\text{.} \end{align*}

In particular, the additive identity is \(0 = 0 + 0x + 0x^2\text{.}\)

🔗

Of course, the set of all polynomials is larger than the set of quadratic polynomials, and we have \(\pbb_2\subset \pbb\text{.}\) We say that \(\pbb_2\) is a vector subspace of \(\pbb\text{.}\)

🔗

Definition 1.1.6. Vector subspace.

A subset \(W\) of a vector space \(V\) is called a subspace of \(V\) if \(W\) is closed under the operations of scalar multiplication and addition that it inherits from \(V\text{.}\)

🔗

Notice that a subspace is itself a vector space and that the underlying fields of \(V\) and \(W\) are the same.

🔗

Every vector space \(V\) has two subspaces that we will sometimes need to consider, namely, the subspace consisting of only the zero vector \(W=\{0\}\) and the entire vector space \(W=V\) itself.

🔗

Example 1.1.7. Function spaces.

Let \(\fcal\) be the set of functions whose domain is \(\real\) and whose codomain is \(\complex\text{;}\) that is, functions of the form \(f:\real\to\complex\text{.}\) It follows that \(\fcal\) is a complex vector space.

🔗

If \(f\) and \(g\) are such functions and \(s\) is a complex scalar, then we define new functions \(sf\) and \(f+g\) by

\begin{align*} (sf)(x) \amp s f(x)\\ (f+g)(x) \amp f(x) + g(x)\text{.} \end{align*}

🔗

If we were to consider functions \(f:\real\to\real\text{,}\) we would obtain a real vector space. This is not a subspace of \(\fcal\text{,}\) however, since the underlying fields are different. Rather, here are some natural subspaces of \(\fcal\text{.}\)

🔗

Example 1.1.8.

The following are subspaces of \(\fcal\text{:}\)

The set of functions \(f:\real\to\complex\) for which \(f(17)=0\text{.}\)
🔗

If \(s\) is a scalar and \(f\) and \(g\) are functions that satisfy \(f(17)=g(17)=0\text{,}\) then

\begin{align*} (sf)(17)\amp = sf(17) = 0\\ (f+g)(17)\amp = f(17)+g(17) = 0 + 0 = 0 \end{align*}

showing that this set of functions is closed under scalar multiplication and addition.

🔗

🔗
The set of periodic functions whose period is 7; that is functions that satisfy \(f(x+7)=f(x)\) for all \(x\text{.}\)
🔗

Again,

\begin{align*} (sf)(x+7) \amp = sf(x+7) = sf(x) = (sf)(x)\\ (f+g)(x+7) \amp = f(x+7)+g(x+7) = f(x)+g(x) = (f+g)(x)\text{.} \end{align*}

🔗

🔗
The set of continous functions.
🔗

🔗
The set of functions that satisfy \(f(17)=1\) is, however, not a subspace since it is not closed under scalar multiplication or vector addition. For instance, if \(f(17) = 1\text{,}\) then \((2f)(17) = 2f(17) = 2\cdot1=2\text{.}\)
🔗

🔗

🔗

Example 1.1.9.

If \(V\) is a vector space and \(V_1\) and \(V_2\) are subspaces, then \(V_1\cap V_2\) is also a subspace of \(V\) as it can be seen that the interection is closed under scalar multiplication and addition. The “compatibility properties” are satisfied due to the fact that \(V\) is itself a vector space.

🔗

When working with a vector space \(V\text{,}\) we will frequently refer to the elements of \(V\) as vectors even though they may be polynomials, matrices, functions, or even something entirely different.

🔗

Subsection 1.1.2 Linear combinations

Our earlier study of linear algebra really began once we introduced linear combinations. Of course, linear combinations are defined purely in terms of scalar multiplication and addition so we can form linear combinations of elements in a vector space.

🔗

Definition 1.1.10.

Suppose that \(\vvec_1,\ldots,\vvec_n\) is a set of vectors in a vector space \(V\) over a field \(\field\text{.}\) A linear combination of these vectors is a vector of the form

\begin{equation*} c_1\vvec_1 + c_2\vvec_2 + \ldots + c_m\vvec_m \end{equation*}

where the scalars \(c_j\) belong to the field \(\field\text{.}\)

🔗

Example 1.1.11.

Consider the vector space \(\pbb_2\) consisting of polynomials having degree two or less and the polynomials \(p_1(x)=3x+4\) and \(p_2(x)=7x^2-2x+1\text{.}\) We can form the linear combination

\begin{equation*} 2p_1(x)-3p_2(x) = -21x^2 +5\text{.} \end{equation*}

🔗

Because we can form linear combinations, we can also think about concepts like span and linear independence.

🔗

Definition 1.1.12. Span.

The span of a set of vectors in a vector space is the set of all linear combinations that can be formed from the set.

🔗

Example 1.1.13.

Consider the vector space \(V\) formed by all \(2\times2\) matrices. Then consider

\begin{equation*} \vvec_1 = \begin{bmatrix} 1 \amp 0 \\ 0 \amp 0 \\ \end{bmatrix}, \vvec_2 = \begin{bmatrix} 0 \amp 1 \\ 0 \amp 0 \\ \end{bmatrix}, \vvec_3 = \begin{bmatrix} 0 \amp 0 \\ 0 \amp 1 \\ \end{bmatrix}\text{.} \end{equation*}

The span of these three matrices is the set of all possible linear combinations

\begin{equation*} c_1\vvec_1 + c_2\vvec_2 + c_3\vvec_3 = \begin{bmatrix} c_1 \amp c_2 \\ 0 \amp c_3 \\ \end{bmatrix}\text{,} \end{equation*}

which is the set of all upper triangular \(2\times 2\) matrices.

🔗

It’s not hard to see that the span of a set of vectors \(\vvec_1,\vvec_2,\ldots,\vvec_m\) in \(V\) forms a subspace. We just have to check that the span is closed under scalar multiplication and addition. So we will consider vectors

\begin{align*} \uvec \amp = a_1\vvec_1 + a_2 \vvec_2 + \ldots + a_m\vvec_m\\ \wvec \amp = b_1\vvec_1 + b_2 \vvec_2 + \ldots + b_m\vvec_m\text{.} \end{align*}

If we multiply \(\uvec\) by the scalar \(s\text{,}\) we have

\begin{equation*} s\uvec = (sa_1)\vvec_1 + (sa_2) \vvec_2 + \ldots + (sa_m)\vvec_m\text{,} \end{equation*}

which is in the span of the set of vectors. Similarly,

\begin{equation*} \uvec+\wvec = (a_1+b_1)\vvec_1 + (a_2+b_2) \vvec_2 + \ldots + (a_m+b_m)\vvec_m\text{,} \end{equation*}

which is also in the span. This demonstrates the following proposition.

🔗

Proposition 1.1.14.

The span of a set of vectors in \(V\) is a subspace of \(V\text{.}\)

🔗

We can also define linear dependence as before.

🔗

Definition 1.1.15. Linear independence.

A set of vectors in \(V\) is linearly dependent if one of the vectors can be written as a linear combination of the others.

🔗

Example 1.1.16.

In \(\pbb_2\text{,}\) consider the polynomials

\begin{equation*} p_1(x)=x^2-x+2,\hspace{12pt} p_2(x)=3x^2+4x-1,\hspace{12pt} p_3(x)=-7x+7\text{.} \end{equation*}

This set of polynomials is linear dependent because \(p_3=3p_1-p_2\text{.}\)

🔗

Notice that this relationship can also be expressed as

\begin{equation*} 3p_1 - p_2 - p_3 = 0\text{,} \end{equation*}

which leads to the next proposition.

🔗

Proposition 1.1.17.

A set of vectors \(\vvec_1,\vvec_2,\ldots,\vvec_m\) in a vector space \(V\) is linearly dependent if and only if

\begin{equation*} a_1\vvec_1+a_2\vvec_2+\ldots+a_m\vvec_m = 0 \end{equation*}

for some set of scalars \(a_1,a_2,\ldots,a_m\) with at least one being nonzero.

🔗

Equivalently, the set of vectors is linearly independent if and only if

\begin{equation*} a_1\vvec_1+a_2\vvec_2+\ldots+a_m\vvec_m = 0 \end{equation*}

implies that all the scalars \(a_j=0\text{.}\)

🔗

Proof.

The second statement is logically equivalent to the first so our proof will focus on the first statement. Suppose that the set \(\vvec_1,\ldots,\vvec_m\) is linearly dependent and that \(\vvec_k\) is the first vector that is a linear combination of vectors that occur previously in the list. This means that there are scalars \(c_1,c_2,\ldots,c_{k-1}\) such that

\begin{equation*} \vvec_k = c_1\vvec_1+c_2\vvec_2+\ldots+c_{k-1}\vvec_{k-1}\text{.} \end{equation*}

We can rewrite this expression as

\begin{equation*} c_1\vvec_1+c_2\vvec_2+\ldots+c_{k-1}\vvec_{k-1}-\vvec_k = 0\text{,} \end{equation*}

which means that there are scalars \(a_j\) with

\begin{equation*} a_1\vvec_1+a_2\vvec_2+\ldots+a_m\vvec_m = 0\text{.} \end{equation*}

🔗

Conversely, suppose that

\begin{equation*} a_1\vvec_1+a_2\vvec_2+\ldots+a_m\vvec_m = 0 \end{equation*}

for some set of scalars and that \(a_k\) is the last nonzero scalar. We can rewrite this expression as

\begin{equation*} \vvec_k = -\frac{a_1}{a_k}\vvec_1 -\frac{a_2}{a_k}\vvec_2 -\ldots -\frac{a_{k-1}}{a_k}\vvec_{k-1}\text{.} \end{equation*}

This shows that \(\vvec_k\) is a linear combination of the other vectors and that the set of vectors is therefore linearly dependent.

🔗

Proposition 1.1.18.

Suppose that \(\vvec_1,\vvec_2,\ldots,\vvec_m\) is a linear dependent set of vectors and that \(\vvec_k\) can be written as a linear combination of the other vectors. Removing \(\vvec_k\) from the set does not change the span; that is,

\begin{equation*} \laspan{\vvec_1,\vvec_2,\ldots,\vvec_m}= \laspan{\vvec_1,\vvec_2,\ldots,\widehat{\vvec_k},\ldots,\vvec_m}\text{.} \end{equation*}

🔗

Proof.

If \(\wvec = a_1\vvec_1+a_2\vvec_2+\ldots+a_m\vvec_m\text{,}\) then we can replace \(\vvec_k\) in this expression with a linear combination of the other vectors. This shows that \(\wvec\) can be written as a linear combination of the set of vectors with \(\vvec_k\) removed.

🔗

Subsection 1.1.3 Bases

Definition 1.1.19.

A set of vectors in a vector space \(V\) forms a basis for \(V\) if the set is linearly independent and its span is \(V\text{.}\)

🔗

Example 1.1.20.

We can see that the polynomials

\begin{equation*} p_1(x)=1,\hspace{12pt} p_2(x)=x,\hspace{12pt} p_3(x)=x^2 \end{equation*}

form a basis of \(\pbb_2\text{.}\) Notice that this statement is true for both \(\field=\real\) and \(\field=\complex\text{.}\)

🔗

First, every polynomial \(p\) in \(\pbb_2\) can be written as

\begin{equation*} p(x)=a_0 + a_1x + a_2x^2\text{,} \end{equation*}

showing that \(p_1\text{,}\) \(p_2\text{,}\) and \(p_3\) span \(\pbb_2\text{.}\) To see that these polynomials are linearly independent, suppose that

\begin{equation*} a_1 p_1(x) + a_2p_2(x) + a_3p_3(x) = 0\text{,} \end{equation*}

the additivity identity in \(\pbb_2\text{.}\) We therefore have

\begin{equation*} a_1 + a_2x + a_3x^2 = 0 + 0x+0x^2 \end{equation*}

from which we conclude that \(a_1=0\text{,}\) \(a_2=0\text{,}\) and \(a_3=0\text{.}\) Therefore, \(p_1\text{,}\) \(p_2\text{,}\) and \(p_3\) are linearly indepedent by Proposition 1.1.17 and hence form a basis for \(\pbb_2\text{.}\)

🔗

Example 1.1.21.

The polynomials

\begin{equation*} q_1(x)=x^2+3x,\hspace{24pt} q_2(x)=-x^2+x,\hspace{24pt} q_3(x)=2x^2+4x+2 \end{equation*}

form a basis of \(\pbb_2\text{.}\)

🔗

To see this, suppose that \(p(x)=a_0 + a_1x + a_2x^2\) is a polynomial in \(\pbb_\text{.}\) We wish to see that \(p\) can be written as a linear combination of \(q_1\text{,}\) \(q_2\text{,}\) and \(q_3\text{.}\) This means that there are scalars \(c_1\text{,}\) \(c_2\text{,}\) and \(c_3\) such that

\begin{align*} c_1q_1 + c_2q_2 +c_3q_3 \amp = p \\ c_1(x^2+3x) + c_2(-x^2+x) + c_3(2x^2 + 4x + 2) \amp = a_0 + a_1x + a_2x^2\\ (c_1-c_2+2c_2)x^2 + (3c_1+c_2+4c_3) + 2c_3 \amp = a_0 + a_1x + a_2x^2 \end{align*}

This is a linear system of three equations in the three variables \(c_1\text{,}\) \(c_2\text{,}\) and \(c_3\text{,}\) which may be written as

\begin{equation*} \begin{bmatrix} 1 \amp -1 \amp 2 \\ 3 \amp 1 \amp 4 \\ 0 \amp 0 \amp 2 \\ \end{bmatrix} \threevec{c_1}{c_2}{c_3} = \threevec{a_0}{a_1}{a_2}\text{,} \end{equation*}

which has a unique solution for every vector \(\threevec{a_0}{a_1}{a_2}\text{.}\) This says that \(\laspan{q_1,q_2,q_3} = \pbb_2\text{.}\)

🔗

Moreover, since the solution to the equation above is unique, the only linear combination

\begin{equation*} c_1q_1 + c_2q_2 +c_3q_3=0 = 0+0x+0x^2 \end{equation*}

must have \(c_1=c_2=c_2=0\text{,}\) which means that these polynomials are linearly independent.

🔗

Example 1.1.22.

Consider the set of polynomials

\begin{align*} p_0 \amp = 1 \\ p_1 \amp = 1 +x \\ p_2 \amp = 1 +x + x^2\\ \amp \vdots \\ p_n \amp = 1 +x+x^2+\ldots+x^n \end{align*}

in \(\pbb_n\text{.}\) We claim that these polynomials form a basis for \(\pbb_n\text{.}\)

🔗

To see that they are linearly independent, we will suppose that they are linearly dependent and derive a contradiction. Suppose that

\begin{equation*} c_0p_0 + c_1p_1 + \ldots c_np_n = 0 \end{equation*}

and that some of the scalars are nonzero. Let \(c_k\) be the last nonzero scalar so that

\begin{equation*} c_0p_0 + c_1p_1 + \ldots + c_kp_k = c_kx^k + \text{ lower order terms}\text{.} \end{equation*}

That is, \(c_kx^k\) is the only term involving \(x^k\text{.}\) Therefore, \(c_k=0\text{,}\) which contradicts our assumption that \(c_k\neq 0\text{.}\)

🔗

To see that these polynomials span \(\pbb_n\text{,}\) we offer a proof by induction. When \(n=0\text{,}\) we see that \(p_0 = 1\) spans \(\pbb_0\text{.}\) Now suppose that \(p_0,p_1,\ldots,p_{n-1}\) span \(\pbb_{n-1}\) and that \(p(x)=a_0+a_1x+a_2x^2 + \ldots + a_nx^n\) is a polynomial in \(\pbb_n\text{.}\) Notice that the polynomials \(p(x)\) and \(a_np_n(x)\) have the same coefficient of \(x^n\text{.}\) Therefore,

\begin{equation*} p(x) - a_np_n(x) \end{equation*}

is a polynomial in \(\pbb_{n-1}\) and can be written as a linear combination of \(p_0,p_1,\ldots,p_{n-1}\text{.}\) This means that

\begin{align*} p - a_np_n \amp = c_0p_0+c_1p_1+\ldots+c_{n-1}p_{n-1}\\ p \amp = c_0p_0+c_1p_1+\ldots+c_{n-1}p_{n-1} + a_np_n \end{align*}

🔗

Example 1.1.23.

There is no finite set that forms a basis for \(\pbb\text{,}\) the set of all polynomials. Given any finite set, there is a polynomial having a highest degree \(m\text{.}\) Therefore, the polynomial \(x^{m+1}\) is not in the span of the set so it cannot be a basis.

🔗

Definition 1.1.24.

We say that a vector space \(V\) is finite dimensional if there is a finite set whose span is \(V\text{.}\) Otherwise, we say that \(V\) is infinite dimensional.

🔗

Notice that any finite dimensional vector space must have a basis.

🔗

Proposition 1.1.25.

Any finite dimensional vector space has a basis consisting of a finite number of vectors.

🔗

Proof.

If \(V\) is a finite dimensional vector space, there is a finite set of vectors whose span is \(V\text{.}\) If this set of vectors is linearly independent, then it forms a basis. It not, we can remove one vector that is a linear combination of the others. Proposition 1.1.18 says that the span of the remaining vectors is still \(V\) so we continue removing vectors one at a time until we obtain a linearly independent set, which must be a basis.

🔗

Notice that the two bases for \(\pbb_2\) in Example 1.1.20 and Example 1.1.21 both consist of three polynomials. That is, these two bases for \(\pbb_2\) have the same number of vectors. This is generally true as we now explain. First, we will prove a more technical, but still useful, result.

🔗

Proposition 1.1.26.

The number of vectors in a linearly independent set in the vector space \(V\) is no more than the number of vectors in any set whose span is \(V\text{.}\)

🔗

Proof.

Suppose that \(\vvec_1,\vvec_2,\ldots,\vvec_m\) is a linear independent set in the vector space \(V\) and that \(\wvec_1,\wvec_2,\ldots,\wvec_n\) is a set whose span is \(V\text{.}\) We wish to show that \(m\leq n\text{.}\)

🔗

We first construct a new list

\begin{equation*} \vvec_m,\wvec_1,\wvec_2,\ldots,\wvec_n \end{equation*}

whose span is \(V\text{.}\) Because the span of the \(\wvec\) vectors is \(V\text{,}\) \(\vvec_m\) is a linear combination of the \(\wvec\) vectors, which means that this set of vectors is linearly dependent. We let \(\uvec\) be the first vector in the list that is a linear combination of vectors that occur previously in the list. Since the set of \(\vvec\) vectors is linearly independent, \(\vvec_m\) is nonzero, which means that \(\uvec\) must be one of the \(\wvec\) vectors. If we remove \(\uvec\text{,}\) we have a new list

\begin{equation*} \vvec_m,\wvec_1,\ldots,\widehat{\wvec_j},\ldots,\wvec_n \end{equation*}

whose span is \(V\) by Proposition 1.1.18. Notice that the cardinality of this new list is still \(n\text{.}\)

🔗

We can repeat this process. We prepend \(\vvec_{m-1}\) to the list to obtain

\begin{equation*} \vvec_{m-1},\vvec_m,\wvec_1,\ldots,\widehat{\wvec_j},\ldots,\wvec_n, \end{equation*}

which must be linearly dependent. Let \(\uvec\) be the first vector in the list that is a linear combination of vectors that occur previously in the list. Once again, since the \(\vvec\) vectors form a linearly independent set, we know that \(\uvec\) is one of the \(\wvec\) vectors. We can remove \(\uvec\) to obtain a new list of vectors whose span is \(V\text{.}\) Again, the cardinality of this new list is \(n\text{.}\)

🔗

We continue this process until all the \(\vvec\) vectors have been added to the beginning of the list. At each step, the vector we remove is one of the \(\wvec\) vectors since the \(\vvec\) vectors are linearly independent. Therefore, we have a list of \(n\) vectors that contains \(\vvec_1,\vvec_2,\ldots,\vvec_m\text{,}\) which says that \(m\leq n\text{.}\)

🔗

Proposition 1.1.27.

If \(V\) is a finite dimensional vector space, then any two bases have the same number of vectors.

🔗

Proof.

Suppose that \(\vvec_1,\vvec_2,\ldots,\vvec_m\) is one basis and that \(\wvec_1,\wvec_2,\ldots,\wvec_n\) is another. The set of \(\vvec\) vectors forms a linearly independent set and the set of \(\wvec\) vectors spans \(V\text{.}\) By Proposition 1.1.26, we know that \(m\leq n\text{.}\)

🔗

We can repeat this argument interchanging the two bases to conclude that \(n\leq m\text{.}\) Put together, these two facts mean that \(m=n\text{.}\)

🔗

If \(V\) is a finite dimensional vector space, we define its dimension to be the number of vectors in a basis. In this case, the number of vectors in any basis is the same so this definition does not depend on which basis we choose.

🔗

Definition 1.1.28.

If a vector space \(V\) has a basis with \(n\) vectors, we say that the dimension of \(V\) is \(n\) and write \(\dim V = n\text{.}\)

🔗

Example 1.1.29.

We have seen that

\begin{equation*} 1, x, x^2,\ldots,x^n \end{equation*}

is a basis for \(\pbb_n\text{.}\) Therefore, \(\dim\pbb_n=n+1\text{.}\)

🔗

Example 1.1.30.

Returning to the vector space \(V\) of \(3\times2\) matrices, as introduced in Example 1.1.1, we find a basis

\begin{equation*} \begin{bmatrix} 1 \amp 0 \\ 0 \amp 0 \\ 0 \amp 0 \end{bmatrix}, \begin{bmatrix} 0 \amp 1 \\ 0 \amp 0 \\ 0 \amp 0 \end{bmatrix}, \begin{bmatrix} 0 \amp 0 \\ 1 \amp 0 \\ 0 \amp 0 \end{bmatrix}, \begin{bmatrix} 0 \amp 0 \\ 0 \amp 1 \\ 0 \amp 0 \end{bmatrix}, \begin{bmatrix} 0 \amp 0 \\ 0 \amp 0 \\ 1 \amp 0 \end{bmatrix}, \begin{bmatrix} 0 \amp 0 \\ 0 \amp 0 \\ 0 \amp 1 \end{bmatrix}\text{.} \end{equation*}

This means that \(\dim V=3\cdot 2 = 6\text{.}\)

🔗

More generally, if \(V=M_{m,n}\text{,}\) the vector space of \(m\times n\) matrices, we have

\begin{equation*} \dim M_{m,n} = mn\text{.} \end{equation*}

🔗

We may informally think of the dimension of a vector space as a measure of its size. Therefore, it should follow that the dimension of a subspace cannot be larger than the dimension of the vector space in which it resides. We first call attention to a useful fact.

🔗

Proposition 1.1.31.

If \(\vvec_1,\vvec_2,\ldots,\vvec_m\) is a linearly independent subset of the vector space \(V\) whose span is not \(V\text{,}\) then there is a vector \(\uvec\) in \(V\) such that \(\vvec_1,\vvec_2,\ldots,\vvec_m,\uvec\) is a linearly independent subset of \(V\text{.}\)

🔗

Proof.

Under the assumptions of this proposition, the span of \(\vvec_1,\vvec_2,\ldots,\vvec_m\) is not \(V\) so there is a vector \(\uvec\) that is not in the span of the \(\vvec\) vectors. This means that it is not a linear combination of the \(\vvec\) vectors and therefore

\begin{equation*} \vvec_1,\vvec_2,\ldots,\vvec_m,\uvec \end{equation*}

is a linearly independent set.

🔗

Proposition 1.1.32.

If \(V\) is a vector space whose dimension \(\dim V = n\) and \(\vvec_1,\vvec_2,\ldots,\vvec_n\) is a linearly independent set in \(V\text{,}\) then \(\vvec_1,\vvec_2,\ldots,\vvec_n\) also spans \(V\) and is therefore a basis of \(V\text{.}\)

🔗

Proof.

We will prove this fact by contradiction. Suppose that \(\basis{\vvec}{n}\) does not span \(V\text{.}\) Then by Proposition 1.1.31, we can add a vector \(\uvec\) so that

\begin{equation*} \vvec_1,\vvec_2,\ldots,\vvec_n,\uvec \end{equation*}

is a linearly independent subset of \(n+1\) vectors in \(V\text{.}\) However, any linear independent subset of \(V\) can have no more than \(n\) vectors by Proposition 1.1.26, which says that \(\vvec_1,\vvec_2,\ldots,\vvec_n\) must span \(V\text{.}\)

🔗

Proposition 1.1.33.

If \(W\) is a subspace of the finite dimensional vector space \(V\text{,}\) then \(W\) is also a finite dimensional vector space and

\begin{equation*} \dim W \leq \dim V\text{.} \end{equation*}

🔗

Proof.

We will first explain why \(W\) is a finite dimensional vector space, which means we need to explain why there is a finite set \(W\) that spans \(W\text{.}\) We begin with any set of vectors \(\wvec_1,\wvec_2,\ldots,\wvec_m\) in \(W\text{.}\) By Proposition 1.1.18, we can remove vectors one at a time until we obtain a linearly independent set in \(W\text{.}\) If this set does not span \(W\text{,}\) then we can add vectors in \(W\) one at a time to obtain new linearly independent sets in \(W\text{.}\) This process must stop at some point since any linearly independent set in \(V\) can have no more than \(\dim V\) vectors. Therefore, we have obtained a finite set that spans \(W\text{,}\) which says that \(W\) is finite dimensional.

🔗

Since any basis for \(W\) is also a linearly independent subset of \(V\text{,}\) it can contain no more vectors than a basis of \(V\text{.}\) This tells us that

\begin{equation*} \dim W \leq \dim V\text{.} \end{equation*}

🔗

Proposition 1.1.34.

Any linearly independent set in a finite dimensional vector space \(V\) can be extended to a basis for \(V\text{.}\)

🔗

Proof.

Suppose that \(\vvec_1,\vvec_2,\ldots,\vvec_m\) is a linearly independent set in \(V\) and that \(\wvec_1,\wvec_2,\ldots,\wvec_n\) is a basis for \(V\text{.}\) Join the two lists together to obtain

\begin{equation*} \vvec_1,\ldots,\vvec_m,\wvec_1,\ldots,\wvec_n\text{.} \end{equation*}

We are guaranteed that the span of this set is \(V\text{.}\) If it is not a linear independent set, then we remove the first vector that is a linear combination of the others. Since the \(\vvec\) vectors are linearly independent, the vector that is removed must be one of the \(\wvec\) vectors. Continuing in this way, we eventually obtain a basis that includes the vectors \(\vvec_1,\ldots,\vvec_m\text{.}\)

🔗

The following is a consequence of Proposition 1.1.32.

🔗

Proposition 1.1.35.

If \(W\) is a subspace of \(V\) and \(\dim W = \dim V\text{,}\) then \(W=V\text{.}\)

🔗

Some further conseqences of these ideas follow.

🔗

Proposition 1.1.36.

If \(V\) is a finite dimensional vector space of dimension \(n\) and \(\vvec_1,\vvec_2,\ldots,\vvec_n\) is a set of vectors in \(V\text{,}\) it follows that

if the set of vectors is linearly independent, then it is a basis.
🔗

🔗
if the span of the set of vectors is \(V\text{,}\) then it is a basis.
🔗

🔗

🔗

Subsection 1.1.4 Sums of subspaces

If we have two subspaces \(V_1\) and \(V_2\) in a vector space \(V\text{,}\) Example 1.1.9 says that the intersection \(V_1\cap V_2\) is also a subspace of \(V\text{.}\) The same is not true, however, for the union \(V_1\cup V_2\) as demonstrated by Example 1.1.37.

🔗

Example 1.1.37.

Suppose that \(V=\real^2\) with \(\vvec_1=\twovec11\) and \(\vvec_2=\twovec{-1}1\text{.}\) If \(V_1=\laspan{\vvec_1}\) and \(V_2=\laspan{\vvec_2}\text{,}\) then \(V_1\cup V_2\) consists of two lines as seen in Figure 1.1.38. Notice that \(\vvec_1 + \vvec_2 = \twovec02\) is not in \(V_1\cup V_2\text{,}\) which shows that \(V_1\cup V_2\) is not closed under addition.

🔗

Figure 1.1.38. The union of two subspaces is not usually a subspace.
🔗

🔗

Instead, we construct a vector subspace that contains the union \(V_1\cup V_2\text{.}\) This construction will be useful in the future as it helps us to break vector spaces into simpler pieces. Suppose that we have a vector space \(V\) and that \(V_1\) and \(V_2\) are subspaces. We can define a new subspace \(V_1+V_2\text{.}\)

🔗

Definition 1.1.39.

Given subspaces \(V_1\) and \(V_2\) of \(V\text{,}\) we form the vector space sum \(V_1+V_2\) as the subset of \(V\) whose elements can be written in the form \(\vvec_1+\vvec_2\) where \(\vvec_1\in V_1\) and \(\vvec_2\in V_2\text{.}\)

🔗

Example 1.1.40.

Suppose that \(V=\real^3\text{,}\) \(V_1\) is the 1-dimensional subspace whose vectors are \(\threevec x00\text{,}\) and \(V_2\) is the 1-dimensional subspace whose vectors are \(\threevec 0y0\text{.}\) Then \(V_1+V_2\) is the 2-dimensional subspace whose vectors have the form \(\threevec xy0\text{.}\)

🔗

Example 1.1.41.

Suppose that \(V=\real^4\text{,}\) \(V_1\) is the 2-dimensional subspace having vectors \(\fourvec xy00\text{,}\) and \(V_2\) is the 2-dimensional subspace having vectors \(\fourvec 0yz0\text{.}\) Then \(V_1+V_2\) is the three-dimensional subspace consisting of vectors \(\fourvec xyz0\text{.}\)

🔗

Proposition 1.1.42.

If \(V_1\) and \(V_2\) are subspaces of \(V\text{,}\) then \(V_1+V_2\) is also a subspace of \(V\) and

\begin{equation*} \dim (V_1+V_2) = \dim V_1 + \dim V_2 - \dim (V_1\cap V_2)\text{.} \end{equation*}

🔗

Proof.

One can verify that \(V_1+V_2\) is closed under scalar multiplication and addition, which means that it forms a subspace of \(V\text{.}\)

🔗

Suppose that \(V_1\cap V_2\) has a basis \(\uvec_1,\ldots,\uvec_m\text{.}\) Since \(V_1\cap V_2\) is a subspace of \(V_1\text{,}\) this basis can be extended to a basis for \(V_1\) by adding vectors \(\vvec_1,\ldots,\vvec_j\text{.}\) Similarly, the basis for \(V_1\cap V_2\) can be extended to a basis for \(V_2\) by adding vectors \(\wvec_1,\ldots,\wvec_k\text{.}\) Putting all these vectors together gives the set

\begin{equation*} \bcal=\{\uvec_1,\ldots,\uvec_m,\vvec_1,\ldots,\vvec_j,\wvec_1, \ldots, \wvec_k\}\text{,} \end{equation*}

which we claim is a basis for \(V_1+V_2\text{.}\)

🔗

Any vector in \(V_1+V_2\) can be written as the sum of a vector in \(V_1\) and a vector in \(V_2\text{.}\) Therefore, the span of the vectors in \(\bcal\) is \(V_1+V_2\) since contained within \(\bcal\) is a basis for \(V_1\)and a basis for \(V_2\text{.}\)

🔗

Moreover, the set of vectors \(\bcal\) is linearly independent. Suppose that

\begin{equation*} a_1\uvec_1+\ldots + a_m\uvec_m + b_1\vvec_1 + \ldots + b_j\vvec_j + c_1\wvec_1 +\ldots+ c_k\wvec_k = 0\text{.} \end{equation*}

Rearranging this gives

\begin{equation*} c_1\wvec_1 +\ldots+ c_k\wvec_k = -a_1\uvec_1-\ldots - a_m\uvec_m - b_1\vvec_1 - \ldots - b_j\vvec_j\text{.} \end{equation*}

The vector on the left is in \(V_2\) but not \(V_1\cap V_2\text{,}\) while the vector on the right is in \(V_1\text{.}\) The only way this can happen is for both vectors to be zero, which means that all the coefficients must be zero.

🔗

Definition 1.1.43.

If \(V_1\cap V_2 = \{0\}\text{,}\) we say that \(V_1+V_2\) is the direct sum of \(V_1\) and \(V_2\) and denote it as \(V_1\oplus V_2\text{.}\)

🔗

As a consequence of Proposition 1.1.42, we have that

🔗

Proposition 1.1.44.

\begin{equation*} \dim(V_1\oplus V_2)= \dim V_1 + \dim V_2\text{.} \end{equation*}

🔗

As a consequence of Proposition 1.1.34, we have

🔗

Proposition 1.1.45.

If \(W\) is a subspace of \(V\text{,}\) then there is another subspace \(U\) such that \(V=W\oplus U\text{.}\)

🔗

Definition 1.1.46.

If \(V = W\oplus U\text{,}\) we say that \(U\) is a complement of \(W\text{.}\)

🔗

Prev Top Next