To talk about the type of functions called linear transformations, we need first to talk about what types of things are domains and codomains of these functions. The basic space of intetrest is a vector space. In many physical contexts a vector is simply a directed magnitude, which is to say some amount of measurement pointing along an arrow. Unfortunately, that is not a good definition, because it relies on a property which is not actually necessary for all of the things we want to do with vectors.
Convention4.1.1.Notation for complex numbers.
While most mathematicians prefer the notation \(z=x+iy\) for a complex number \(z\in\Comps\text{,}\) where \(x,y\in\Reals\text{,}\) programmers, engineers, and scientists have adopted the notation \(z=x+yj\text{.}\) In the interest of being consistent with Python, we will use \(z=x+yj\) notation throughout the text.
Subsection4.1.1Vector space axioms
The conditions in the following definition are referred to as the vector space axioms.
Definition4.1.2.Vector space.
Let \(V\) be a set where addition in \(V\) is denoted by \(\vv_1+\vv_2\) when \(\vv_1,\vv_2\in V\) and where elements of \(V\) can be multiplied by complex numbers; that is we can write \(z \vv_1\) when \(z\in\Comps\) and \(\vv_1\in V\text{.}\) Moreover, we must have a well-defined notion of equality within \(V\text{.}\) Then \(V\) is a vector space over \(\Comps\) if and only if all of the following conditions are satisfied:
Additive Closure
If \(v_1,v_2\in V\) then \(\vv_1+\vv_2\in V\text{.}\)
Scalar Closure
If \(z\in\Comps\) and \(\vv\in V\) then \(z \vv\in \Comps\text{.}\)
Commutativity
If \(\vv_1,\vv_2\in V\) then \(\vv_1+\vv_2 = \vv_2+\vv_1\text{.}\)
Additive Associativity
If \(\vv_1,\vv_2,\vv_3\in V\) then \(\vv_1+(\vv_2+\vv_3) = (\vv_1+\vv_2)+\vv_3\text{.}\)
Additive Zero
There is a vector \(\vec{0}\in V\) such that \(\vv+\vec{0} = \vv\) for all \(\vv\in V\text{.}\)
Additive Inverses
For each \(\vv\in V\) there is a vector \(-\vv\in V\) such that \(\vv+(-\vv) = \vec{0}\text{.}\)
Scalar Multiplicative Associativity
If \(z_1,z_2\in\Comps\) and \(\vv\in V\) then \(z_1(z_2\vv) = (z_1z_2)\vv\text{.}\)
Distributivity across Vector Sums
If \(z\in\Comps\) and \(\vv_1,\vv_2\in V\) then \(z(\vv_1+\vv_2) = z\vv_1 + z\vv_2\text{.}\)
Distributivity across Scalar Sums
If \(z_1,z_2\in\Comps\) and \(\vv\in V\) then \((z_1+z_2)\vv = z_1\vv+z_2\vv\text{.}\)
Scalar One
If \(\vv\in V\text{,}\) then \(1\vv = \vv\text{.}\)
If \(V\) is a vector space over \(\Comps\) then \(\Comps\) is the field of scalars of \(V\text{,}\) the elements of \(V\) are vectors, and when referring to \(V\) we say that the elements of \(\Comps\) are scalars.
Those ten conditions are precisely the conditions necessary to make directed magnitude into a rigorous definition which coincides with observable physical phenomena, which is why they are the axioms! Notationally, different texts may use different symbols to represent vectors; \(\mathbf{v}\) and \(\vec{v}\) are both common.
Subsection4.1.2Column vectors
Our introductory example of a vector space is the fundamentally most important example.
Definition4.1.3.The vector space \(\CV{n}\).
Let \(n\in\Zp\text{,}\) and define \(\CV{n}\) to be the set
and \(k\in\set{0,1,\dotsc,n-1}\text{,}\) we define \(\entry{\vv}{k} = z_k\text{,}\) the kth component of \(\vv\), and we say that when \(\vv_1,\vv_2\in\CV{n}\text{,}\)\(\vv_1 = \vv_2\) precisely when \(\entry{\vv_1}{k} = \entry{\vv_2}{k}\) for every \({k\in\set{0,1,\dotsc,n-1}}\text{.}\) We further define the following operations on \(\CV{n}\text{:}\)
Addition
If \(\vv_1,\vv_2\in\CV{n}\text{,}\) then \(\vv_1+\vv_2\in\CV{n}\) is the vector satisfying \(\entry{\vv_1+\vv_2}{k} = \entry{\vv_1}{k}+\entry{\vv_2}{k}\) for each \({k\in\set{0,1,\dotsc,n-1}}\text{;}\) and
Scalar multiplication
If \(\vv\in\CV{n}\) and \(z\in\Comps\text{,}\) then \(z\cdot\vv = z\vv\in\CV{n}\) is the vector satisfying \(\entry{z\vv}{k} = z\entry{\vv}{k}\) for each \({k\in\set{0,1,\dotsc,n-1}}\text{.}\)
Theorem4.1.4.\(\CV{n}\) is a vector space.
For every \(n\in\Zp\text{,}\) the set \(\CV{n}\) with addition and multiplication as defined in 4.1.3 is a vector space over \(\Comps\text{.}\)
Proof.
To prove that a set with given operations is a vector space over \(\Comps\) requires verifying that all the axioms hold. This is a standard exercise in any linear algebra textbook and follows directly using the component notation.
The vector space \(\CV{n}\) also inherits the complex conjugate operation from its underlying field of scalars.
Definition4.1.5.Complex conjugate.
Let \(z=x+yj\in\Comps\text{,}\) where \(x,y\in\Reals\) and \(j\) is the unique number such that \(j^2 = -1\text{.}\) Then the complex conjugate of \(z\) is \(\conj{z} = x-yj\text{.}\) If instead the complex number is represented in its polar form as \(z=r\exp(\theta j)=re^{\theta j}\text{,}\) then \(\conj{z} = r\exp(-\theta j) = re^{-\theta j}\text{.}\)
In the complex vector space \(\CV{n}\) this extends naturally, and for each \(\vv\in\CV{n}\) we define the conjugate of \(\vv\) to be the unique vector \(\conj{\vv}\in\CV{n}\) satisfying
for every \({k\in\set{0,1,2,\dotsc,n-1}}\text{.}\)
Subsection4.1.3Linear transformations
Multivariate calculus is one of the first places students encounter vectors, and in calculus the functions from one real vector space to another take many interesting forms. Of course, the original interest in calculus was describing physical motion, so the functions of multivariate calculus seem like simply higher-dimensional versions of differential calculus functions.
Example4.1.6.Functions between vector spaces.
In fact, multivariate calculus starts with multi-variable functions and proceeds through an analysis of those before moving to the study of vector-valued functions, and only towards then end of the course develops functions \(f:\Reals^m\to\Reals^n\text{,}\) such as the function given by
Here one can think of the vectors in the domain \(\Reals^3\) as ordered triples and the vectors in the codomain \(\Reals^2\) as ordered pairs, following component-wise operations.
Unlike in multivariate calculus, we are not restricted to ordered tuples of input; we have defined vector spaces in full generality. So we can define our special class of functions in its full generality as well.
Definition4.1.7.Linear transformation.
Let \(V,W\) be vector spaces over \(\Comps\) and let \(f:V\to W\) be a function. Then \(f\) is a linear transformation from \(V\) to \(W\) if and only if:
\(f(\vv_1+\vv_2) = f(\vv_1) + f(\vv_2)\) for any pair of \(\vv_1,\vv_2\in V\text{,}\) and
\(f(z\cdot \vv_1) = z\cdot f(\vv_1)\) for every \(z\in\Comps\) and \(\vv_1\in V\text{.}\)
An effective way to remember this is to think that linear transformations faithfully turn scalar multiplication and addition in the domain vector space into scalar multiplication and addition in the codomain vector space, preserving the scalars. If you add in the bijectivity condition which allowed us to define permutations, you get something even better than a well-behaved mapping between vector spaces.
Checkpoint4.1.8.Some linear transformations of \(\CV{n}\).
Verify that the following are linear transformations:
A linear transformation \(f:V\to W\) which is a bijection is a vector space isomorphism from \(V\) to \(W\).
As in most settings, isomorphism of vector spaces is important as it shows that the domain and codomain are fundamentally the same structure under different representations.
Theorem4.1.10.Every complex vector space is a \(\CV{n}\).
If \(V\) is a complex vector space, then there is some \(n\in\Zp\) such that \(V\) is isomorphic to \(\CV{n}\text{.}\)
Subsection4.1.4Inner product spaces
The idea of directed magnitude used in many vector contexts is an additional property which can be attached to a complex vector space. It is important for these definitions to remember the idea of conjugation of complex numbers, so we begin there.
Definition4.1.11.Inner product.
Define a function \(\hip{\cdot}{\cdot}:\CV{n}\times \CV{n}\to \Comps\text{.}\) The function is an inner product if and only if it satisfies the following properties:
Conjugate symmetry
\(\hip{\vv_1}{\vv_2} = \conj{\hip{\vv_2}{\vv_1}}\) for every \(\vv_1,\vv_2\in\CV{n}\text{;}\)
Linearity in the first argument
\(\hip{z_1\vv_1+z_2\vv_2}{\vv_3} = z_1\hip{\vv_1}{\vv_3} + z_2\hip{\vv_2}{\vv_3}\) for all \(\vv_1,\vv_2,\vv_3\in\CV{n}\) and \(z_1,z_2\in\Comps\text{;}\) and
In a normal calculus sequence, the vector spaces used restrict the field of scalars from \(\Comps\) to \(\Reals\text{.}\) In this special case the conjugation is irrelevant (since \(\conj{x} = x\) for all \(x\in\Reals\)) and the Hermitian inner product becomes the ordinary dot product, given by
It may seem arbitrary to introduce the Hermitian inner product, but the requirement that a complex inner product must be conjugate symmetric makes it necessary: the regular dot product is symmetric, not conjugate symmetric.
Finally, the magnitude of a vector in an inner product space is closely related to that inner product.
Definition4.1.13.Magnitude in \(\CV{n}\).
The norm of a vector \(\vv\in\CV{n}\) is the scalar quantity