In many of our applications in 2d and 3d, we will encounter quantities that have both a magnitude (like a distance) and also a direction. Such quantities are called vectors. That is, a vector is a quantity which has both a direction and a magnitude, like a velocity. If you are moving, the magnitude (length) of your velocity vector is your speed (distance travelled per unit time) and the direction of your velocity vector is your direction of motion. To specify a vector in three dimensions you have to give three components, just as for a point. To draw the vector with components \(a, b, c \) you can draw an arrow from the point \((0,0,0)\) to the point \((a,b,c)\text{.}\)
Similarly, to specify a vector in two dimensions you have to give two components and to draw the vector with components \(a, b \) you can draw an arrow from the point \((0,0)\) to the point \((a,b)\text{.}\)
There are many situations in which it is preferable to draw a vector with its tail at some point other than the origin. For example, it is natural to draw the velocity vector of a moving particle with the tail of the velocity vector at the position of the particle, whether or not the particle is at the origin. The sketch below shows a moving particle and its velocity vector at two different times.
As a second example, suppose that you are analyzing the motion of a pendulum. There are three forces acting on the pendulum bob: gravity \(\vg\text{,}\) which is pulling the bob straight down, tension \(\vt\) in the rod, which is pulling the bob in the direction of the rod, and air resistance \(\vr\text{,}\) which is pulling the bob in a direction opposite to its direction of motion. All three forces are acting on the bob. So it is natural to draw all three arrows representing the forces with their tails at the bob.
In this text, we will used bold faced letters, like \(\vv\text{,}\) \(\vt\text{,}\) \(\vg\text{,}\) to designate vectors. In handwriting, it is clearer to use a small overhead arrow, as in \(\vec{v}\text{,}\) \(\vec{t}\text{,}\) \(\vec{g}\text{,}\) instead. Also, when we want to emphasise that some quantity is a number, rather than a vector, we will call the number a scalar.
Both points and vectors in 2d are specified by two numbers. Until you get used to this, it might confuse you sometimes — does a given pair of numbers represent a point or a vector? To distinguish between the components of a vector and the coordinates of the point at its head, when its tail is at some point other than the origin, we shall use angle brackets rather than round brackets around the components of a vector. For example, the figure below shows the two-dimensional vector \(\llt 2,1\rgt\) drawn in three different positions. In each case, when the tail is at the point \((u,v)\) the head is at \((2+u,1+v)\text{.}\) We warn you that, out in the real world, no one uses notation that distinguishes between components of a vector and the coordinates of its head — usually round brackets are used for both. It is up to you to keep straight which is being referred to.
By way of summary,
Definition 1.2.1.
we use
bold faced letters, like \(\vv\text{,}\) \(\vt\text{,}\) \(\vg\text{,}\) to designate vectors, and
angle brackets, like \(\llt 2,1\rgt\text{,}\) around the components of a vector, but use
round brackets, like \((2,1)\text{,}\) around the coordinates of a point, and use
“scalar” to emphasise that some quantity is a number, rather than a vector.
Subsection 1.2.1 Addition of Vectors and Multiplication of a Vector by a Scalar
Just as we have done many times in the CLP texts, when we define a new type of object, we want to understand how it interacts with the basic operations of addition and multiplication. Vectors are no different, and we shall shortly see a natural way to define addition of vectors. Multiplication will be more subtle, and we shall start with multiplication of a vector by a number (rather than with multiplication of a vector by another vector).
By way of motivation for the definitions of addition and multiplication by a number, imagine that we are out for a walk on the \(xy\)-plane.
Suppose that we take a step and, in doing so, we move \(a_1\) units parallel to the \(x\)-axis and \(a_2\) units parallel to the \(y\)-axis. Then we say that \(\llt a_1, a_2\rgt\) is the displacement vector for the step. Suppose now that we take a second step which moves us an additional \(b_1\) units parallel to the \(x\)-axis and an additional \(b_2\) units parallel to the \(y\)-axis, as in the figure on the left below. So the displacement vector for the second step is \(\llt b_1, b_2\rgt\text{.}\) All together, we have moved \(a_1+b_1\) units parallel to the \(x\)-axis and \(a_2+b_2\) units parallel to the \(y\)-axis. The displacement vector for the two steps combined is \(\llt a_1+b_1, a_2+b_2\rgt\text{.}\) We shall define the sum of \(\llt a_1, a_2\rgt\) and \(\llt b_1, b_2\rgt\text{,}\) denoted by \(\llt a_1, a_2\rgt+\llt b_1,b_2\rgt\text{,}\) to be \(\llt a_1+b_1, a_2+b_2\rgt\text{.}\)
Suppose now that, instead, we decide to step in the same direction as the first step above, but to move twice as far, as in the figure on the right below. That is, our step will move us \(2a_1\) units in the direction of the \(x\)-axis and \(2a_2\) units in the direction of the \(y\)-axis and the corresponding displacement vector will be \(\llt 2a_1, 2a_2\rgt\text{.}\) We shall define the product of the number \(2\) and the vector \(\llt a_1, a_2\rgt\text{,}\) denoted by \(2\llt a_1, a_2\rgt\text{,}\) to be \(\llt 2a_1, 2a_2\rgt\text{.}\)
Here are the formal definitions.
Definition 1.2.2. Adding Vectors and Multiplying a Vector by a Number.
These two operations have the obvious definitions
\begin{align*}
\va=\llt a_1,a_2\rgt,\ \vb =\llt b_1,b_2\rgt
\amp\amp
\implies\amp\amp \va+\vb=\llt a_1+b_1,a_2+b_2\rgt\\
\va=\llt a_1,a_2\rgt,\ s\text{ a number}
\amp\amp
\implies\amp\amp s\va=\llt sa_1,sa_2\rgt
\end{align*}
and similarly in three dimensions.
Pictorially, you add the vector \(\vb\) to the vector \(\va\) by drawing \(\vb\) with its tail at the head of \(\va\) and then drawing a vector from the tail of \(\va\) to the head of \(\vb\text{,}\) as in the figure on the left below. For a number \(s\text{,}\) we can draw the vector \(s\va\text{,}\) by just
changing the vector \(\va\)’s length by the factor \(|s|\text{,}\) and,
if \(s \lt 0\text{,}\) reversing the arrow’s direction,
as in the other two figures below.
The special case of multiplication by \(s=-1\) appears so frequently that \((-1)\va\) is given the shorter notation \(-\va\text{.}\) That is,
\begin{equation*}
-\llt a_1,a_2\rgt=\llt -a_1,-a_2\rgt
\end{equation*}
Of course \(\va+(-\va)\) is \(\vZero\text{,}\) the vector all of whose components are zero.
To subtract \(\vb\) from \(\va\) pictorially, you may add \(-\vb\) (which is drawn by reversing the direction of \(\vb\)) to \(\va\text{.}\) Alternatively, if you draw \(\va\) and \(\vb\) with their tails at a common point, then \(\va-\vb\) is the vector from the head of \(\vb\) to the head of \(\va\text{.}\) That is, \(\va-\vb\) is the vector you must add to \(\vb\) in order to get \(\va\text{.}\)
The operations of addition and multiplication by a scalar that we have just defined are quite natural and rarely cause any problems, because they inherit from the real numbers the properties of addition and multiplication that you are used to.
Theorem 1.2.3. Properties of Addition and Scalar Multiplication.
Let \(\va\text{,}\) \(\vb\) and \(\vc\) be vectors and \(s\) and \(t\) be scalars. Then
\begin{alignat*}{4}
& (1)\quad&&\va+\vb=\vb+\va \qquad\qquad\qquad
&& (2)\quad&&\va+(\vb+\vc)=(\va+\vb)+\vc\\
& (3) &&\va+\vZero =\va
&& (4) &&\va+(-\va)=\vZero\\
& (5) &&s(\va+\vb)=s\va+s\vb
&& (6) &&(s+t)\va=s\va+t\va\\
& (7) &&(st)\va = s(t\va)
&& (8) &&1\va=\va
\end{alignat*}
We have just been introduced to many definitions. Let’s see some of them in action.
Example 1.2.4.
For example, if
\begin{equation*}
\va = \llt 1,2,3\rgt\qquad
\vb = \llt 3,2,1\rgt\qquad
\vc = \llt 1,0,1\rgt
\end{equation*}
then
\begin{alignat*}{2}
2\va&=2\llt 1,2,3\rgt&&=\llt 2,4,6\rgt\\
-\vb&=-\llt 3,2,1\rgt&&=\llt -3,-2,-1\rgt\\
3\vc&=3\llt 1,0,1\rgt&&=\llt 3,0,3\rgt
\end{alignat*}
and
\begin{align*}
2\va-\vb+3\vc
&= \llt 2,4,6\rgt + \llt -3,-2,-1\rgt+\llt 3,0,3\rgt\\
&= \llt 2-3+3\,,\,4-2+0\,,\,6-1+3\rgt\\
&= \llt 2,2,8 \rgt
\end{align*}
Definition 1.2.5.
Two vectors \(\va\) and \(\vb\)
are said to be parallel if \(\ \va= s\,\vb\ \) for some nonzero real number \(s\) and
are said to have the same direction if \(\ \va=s\,\vb\ \) for some number \(s \gt 0\text{.}\)
There are some vectors that occur sufficiently commonly that they are given special names. One is the vector \(\vZero\text{.}\) Some others are the “standard basis vectors”.
Definition 1.2.6.
The standard basis vectors in two dimensions are
\begin{equation*}
\hi = \llt 1,0\rgt \qquad\qquad\hj =\llt 0,1\rgt
\end{equation*}
The standard basis vectors in three dimensions are
\begin{equation*}
\hi = \llt 1,0,0\rgt \qquad\qquad\hj =\llt 0,1,0\rgt
\qquad\qquad\hk =\llt 0,0,1\rgt
\end{equation*}
We’ll explain the little hats in the notation \(\hi\text{,}\) \(\hj\text{,}\) \(\hk\) shortly. Some people rename \(\hi\text{,}\) \(\hj\) and \(\hk\) to \(\he_1\text{,}\) \(\he_2\) and \(\he_3\) respectively. Using the above properties we have, for all vectors,
\begin{equation*}
\llt a_1,a_2\rgt =a_1\,\hi+a_2\,\hj\qquad\qquad
\llt a_1,a_2,a_3\rgt =a_1\,\hi+a_2\,\hj+a_3\,\hk
\end{equation*}
A sum of numbers times vectors, like \(a_1\hi+a_2\hj\) is called a linear combination of the vectors. Thus all vectors can be expressed as linear combinations of the standard basis vectors. This makes basis vectors very helpful in computations. The standard basis vectors are unit vectors, meaning that they are of length one, where the length of a vector \(\va\) is denoted \(|\va|\) and is defined by
Definition 1.2.7. Length of a Vector.
\begin{alignat*}{3}
&\va=\llt a_1,a_2\rgt \qquad&&\implies\qquad &&|\va|=\sqrt{a_1^2+a_2^2}\\
&\va=\llt a_1,a_2,a_3\rgt \qquad&&\implies\qquad &&|\va|=\sqrt{a_1^2+a_2^2+a_3^2}
\end{alignat*}
A unit vector is a vector of length one. We’ll sometimes use the accent \(\hat{\ }\) to emphasise that the vector \(\hat\va\) is a unit vector. That is, \(|\hat\va|=1\text{.}\)
Example 1.2.8.
Recall that multiplying a vector \(\va\) by a positive number \(s\text{,}\) changes the length of the vector by a factor \(s\) without changing the direction of the vector. So (assuming that \(|\va|\ne 0)\) \(\frac{\va}{|\va|}\) is a unit vector that has the same direction as \(\va\text{.}\) For example, \(\frac{\llt 1,1,1\rgt}{\sqrt{3}}\) is a unit vector that points in the same direction as \(\llt 1,1,1\rgt\text{.}\)
Example 1.2.9.
We go for a walk on a flat Earth. We use a coordinate system with the positive x-axis pointing due east and the positive y-axis pointing due north. We
start at the origin and
walk due east for 4 units and then
walk northeast for \(5\sqrt{2}\) units and then
head towards the point \((0,11)\text{,}\) but we only go
one third of the way.
We will now use vectors to figure out our final location.
On the first leg of our walk, we go 4 units in the positive \(x\)-direction. So our displacement vector — the vector whose tail is at our starting point and whose head is at the end point of the first leg — is \(\llt 4,0\rgt\text{.}\) As we started at \((0,0)\) we finish the first leg of the walk at \((4,0)\text{.}\)
On the second leg of our walk, our direction of motion is northeast, i.e. is
\(45^\circ\) above the direction of the positive
\(x\)-axis. Looking at the figure on the right above, we see that our displacement vector, for the second leg of the walk, has to be in the same direction as the vector
\(\llt 1,1\rgt\text{.}\) So our displacement vector is the vector of length
\(5\sqrt{2}\) with the same direction as
\(\llt 1,1\rgt\text{.}\) The vector
\(\llt 1,1\rgt\) has length
\(\sqrt{1^2+1^2}=\sqrt{2}\) and so
\(\frac{\llt 1,1\rgt}{\sqrt{2}}\) has length one and our displacement vector is
\begin{equation*}
5\sqrt{2}\ \frac{\llt 1,1\rgt}{\sqrt{2}}
=5 \llt 1,1\rgt
=\llt 5,5\rgt
\end{equation*}
If we draw this displacement vector,
\(\llt 5,5\rgt\) with its tail at
\((4,0)\text{,}\) the starting point of the second leg of the walk, then its head will be at
\((4+5, 0+5)=(9,5)\) and that is the end point of the second leg of the walk.
On the final leg of our walk, we start at
\((9,5)\) and walk towards
\((0,11)\text{.}\) The vector from
\((9,5)\) to
\((0,11)\) is
\(\llt 0-9\,,\,11-5\rgt =\llt -9,6\rgt\text{.}\) As we go only one third of the way, our final displacement vector is
\begin{equation*}
\frac{1}{3}\llt -9,6\rgt
=\llt -3,2\rgt
\end{equation*}
If we draw this displacement vector with its tail at
\((9,5)\text{,}\) the starting point of the final leg, then its head will be at
\((9-3, 5+2)=(6,7)\) and that is the end point of the final leg of the walk, and our final location.
Subsection 1.2.2 The Dot Product
Let’s get back to the arithmetic operations of addition and multiplication. We will be using both scalars and vectors. So, for each operation there are three possibilities that we need to explore:
“scalar plus scalar”, “scalar plus vector” and “vector plus vector”
“scalar times scalar”, “scalar times vector” and “vector times vector”
We have been using “scalar plus scalar” and “scalar times scalar” since childhood. “vector plus vector” and “scalar times vector” were just defined above. There is no sensible way to define “scalar plus vector”, so we won’t. This leaves “vector times vector”. There are actually two widely used such products. The first is the dot product, which is the topic of this section, and which is used to easily determine the angle \(\theta\) (or more precisely, \(\cos\theta\)) between two vectors. We’ll get to the second, the cross product, later.
Here is preview of what we will do in this dot product subsection §
1.2.2. We are going to give two formulae for the dot product,
\(\va\cdot\vb\text{,}\) of the pair of vectors
\(\va=\llt a_1,a_2,a_3\rgt\) and
\(\vb=\llt b_1,b_2,b_3\rgt\text{.}\)
The first formula is \(\va\cdot\vb = a_1b_1+a_2b_2+a_3b_3\text{.}\) We will take it as our official definition of \(\va\cdot\vb\text{.}\) This formula provides us with an easy way to compute dot products.
-
The second formula is \(\va\cdot\vb=|\va|\,|\vb|\,\cos\theta\text{,}\) where \(\theta\) is the angle between \(\va\) and \(\vb\text{.}\)
We will show, in Theorem
1.2.11 below, that this second formula always gives the same answer as the first formula. The second formula provides us with an easy way to determine the angle between two vectors. In particular, it provides us with an easy way to test whether or not two vectors are perpendicular to each other. For example, the vectors
\(\llt 1,2,3\rgt\) and
\(\llt -1,-1,1\rgt\) have dot product
\begin{equation*}
\llt 1,2,3\rgt\cdot\llt -1,-1,1\rgt = 1\times(-1)+2\times(-1)+3\times 1=0
\end{equation*}
This tell us as the angle \(\theta\) between the two vectors obeys \(\cos\theta=0\text{,}\) so that \(\theta=\frac{\pi}{2}\text{.}\) That is, the two vectors are perpendicular to each other.
After we give our official definition of the dot product in Definition
1.2.10, and give the important properties of the dot product, including the formula
\(\va\cdot\vb=|\va|\,|\vb|\,\cos\theta\text{,}\) in Theorem
1.2.11, we’ll give some examples. Finally, to see the dot product in action, we’ll define what it means to project one vector on another vector and give an example.
Definition 1.2.10. Dot Product.
The dot product of the vectors \(\va\) and \(\vb\) is denoted \(\va\cdot\vb\) and is defined by
\begin{alignat*}{3}
&\va=\llt a_1,a_2\rgt ,\quad &&\vb=\llt b_1,b_2\rgt\quad &\implies\quad
&\va\cdot\vb = a_1b_1+a_2b_2\\
&\va=\llt a_1,a_2,a_3\rgt ,\quad &&\vb=\llt b_1,b_2,b_3\rgt\quad &\implies\quad
&\va\cdot\vb = a_1b_1+a_2b_2+a_3b_3
\end{alignat*}
in two and three dimensions respectively.
The properties of the dot product are as follows:
Theorem 1.2.11. Properties of the Dot Product.
Let \(\va\text{,}\) \(\vb\) and \(\vc\) be vectors and let \(s\) be a scalar. Then
\begin{alignat*}{2}
&(0)\quad &&\va,\vb\text{ are vectors and }\va\cdot\vb
\text{ is a scalar}\\
&(1) &&\va\cdot\va=|\va|^2\\
&(2) &&\va\cdot\vb=\vb\cdot\va\\
&(3) &&\va\cdot(\vb+\vc)=\va\cdot\vb+\va\cdot\vc,\quad
(\va+\vb)\cdot\vc=\va\cdot\vc+\vb\cdot\vc\\
&(4) &&(s\va)\cdot\vb= s(\va\cdot\vb)\\
&(5) &&\vZero \cdot\va=0\\
&(6) &&\va\cdot\vb=|\va|\,|\vb|\,\cos\theta
\text{ where \(\theta\) is the angle between \(\va\) and \(\vb\)}\\
&(7) &&\va\cdot\vb=0\iff \va=\vZero \text{ or }\vb=\vZero
\text{ or } \va\perp\vb
\end{alignat*}
Proof.
Properties 0 through 5 are almost immediate consequences of the definition. For example, for property 3 (which is called the distributive law) in dimension 2,
\begin{align*}
\va\cdot(\vb+\vc)
&=\llt a_1,a_2\rgt \cdot\llt b_1+c_1,b_2+c_2\rgt\\
&=a_1(b_1+c_1)+a_2(b_2+c_2)=a_1b_1+a_1c_1+a_2b_2+a_2c_2\\
\va\cdot\vb+\va\cdot\vc
&=\llt a_1,a_2\rgt \cdot\llt b_1,b_2\rgt
+\llt a_1,a_2\rgt \cdot\llt c_1,c_2\rgt\\
&=a_1b_1+a_2b_2+a_1c_1+a_2c_2
\end{align*}
Property 6 is sufficiently important that it is often used as the definition of dot product. It is not at all an obvious consequence of the definition. To verify it, we just write \(|\va-\vb|^2\) in two different ways. The first expresses \(|\va-\vb|^2\) in terms of \(\va\cdot\vb\text{.}\) It is
\begin{align*}
|\va-\vb|^2\ &{\buildrel 1 \over =}\ (\va-\vb\,)\cdot(\va-\vb\,)\\
&{\buildrel 3 \over =}\ \va\cdot\va-\va\cdot\vb-\vb\cdot\va
+\vb\cdot\vb\\
&{\buildrel 1,2 \over =}\ |\va|^2+|\vb|^2-2\va\cdot\vb
\end{align*}
Here, \({\buildrel 1 \over =}\text{,}\) for example, means that the equality is a consequence of property 1. The second way we write \(|\va-\vb|^2\) involves \(\cos\theta\) and follows from the cosine law for triangles. Just in case you don’t remember the cosine law, we’ll derive it right now! Start by applying Pythagoras to the shaded triangle in the right hand figure of
That triangle is a right triangle whose hypotenuse has length \(|\va-\vb|\) and whose other two sides have lengths \(\big(|\vb|-|\va|\cos\theta\big)\) and \(|\va|\sin\theta\text{.}\) So Pythagoras gives
\begin{align*}
|\va-\vb|^2&=\big(|\vb|-|\va|\cos\theta\big)^2+
\big(|\va|\sin\theta\big)^2\\
&=|\vb|^2-2|\va|\,|\vb|\,\cos\theta+|\va|^2\cos^2\theta
+|\va|^2\sin^2\theta\\
&=|\vb|^2-2|\va|\,|\vb|\,\cos\theta+|\va|^2
\end{align*}
This is precisely the cosine law. Observe that, when \(\theta=\tfrac{\pi}{2}\text{,}\) this reduces to, (surprise!) Pythagoras’ theorem.
Setting our two expressions for \(|\va-\vb|^2\) equal to each other,
\begin{gather*}
|\va-\vb|^2=|\va|^2+|\vb|^2-2\va\cdot\vb
=|\vb|^2-2|\va|\,|\vb|\,\cos\theta+|\va|^2
\end{gather*}
cancelling the \(|\va|^2\) and \(|\vb|^2\) common to both sides
\begin{gather*}
-2\va\cdot\vb
=-2|\va|\,|\vb|\,\cos\theta
\end{gather*}
and dividing by \(-2\) gives
\begin{gather*}
\va\cdot\vb=|\va|\,|\vb|\,\cos\theta
\end{gather*}
which is exactly property 6.
Property 7 follows directly from property 6. First note that the dot product \(\va\cdot\vb=|\va|\,|\vb|\,\cos\theta\) is zero if and only if at least one of the three factors \(|\va|,\ |\vb|,\ \cos\theta\) is zero. The first factor is zero if and only if \(\va=\vZero \text{.}\) The second factor is zero if and only if \(\vb=\vZero \text{.}\) The third factor is zero if and only if \(\theta=\pm\tfrac{\pi}{2}+2k\pi\text{,}\) for some integer \(k\text{,}\) which in turn is true if and only if \(\va\) and \(\vb\) are mutually perpendicular.
Because of Property 7 of Theorem
1.2.11, the dot product can be used to test whether or not two vectors are perpendicular to each other. That is, whether or not the angle between the two vectors is
\(90^\circ\text{.}\) Another name for “perpendicular” is “orthogonal”. Testing for orthogonality is one of the main uses of the dot product.
Example 1.2.12.
Consider the three vectors
\begin{equation*}
\va=\llt 1,1,0 \rgt\qquad
\vb=\llt 1,0,1 \rgt \qquad
\vc=\llt -1,1,1\rgt
\end{equation*}
Their dot products
\begin{alignat*}{3}
\va\cdot\vb & = \llt 1,1,0 \rgt \cdot \llt 1,0,1 \rgt
&& = 1\times 1 +1\times 0+0\times 1
&& = 1\\
\va\cdot\vc & = \llt 1,1,0 \rgt \cdot \llt -1,1,1 \rgt
&& = 1\times(-1) +1\times 1+0\times 1
&& = 0\\
\vb\cdot\vc & = \llt 1,0,1 \rgt \cdot \llt -1,1,1 \rgt
&& = 1\times(-1) +0\times 1+1\times 1
&& = 0
\end{alignat*}
tell us that \(\vc\) is perpendicular to both \(\va\) and \(\vb\text{.}\) Since both \(|\va|=|\vb|=\sqrt{1^2+1^2+0^2}=\sqrt{2}\) the first dot product tells us that the angle, \(\theta\text{,}\) between \(\va\) and \(\vb\) obeys
\begin{gather*}
\cos\theta =\frac{\va\cdot\vb}{|\va|\,|\vb|}=\frac{1}{2}
\implies
\theta =\frac{\pi}{3}
\end{gather*}
Dot products are also used to compute projections. First, here’s the definition.
Definition 1.2.13. Projection.
Draw two vectors, \(\va\) and \(\vb\text{,}\) with their tails at a common point and drop a perpendicular from the head of \(\va\) to the line that passes through both the head and tail of \(\vb\text{.}\) By definition, the projection of the vector \(\va\) on the vector \(\vb\) is the vector from the tail of \(\vb\) to the point on the line where the perpendicular hits.
Think of the projection of \(\va\) on \(\vb\) as the part of \(\va\) that is in the direction of \(\vb\text{.}\)
Now let’s develop a formula for the projection of
\(\va\) on
\(\vb\text{.}\) Denote by
\(\theta\) the angle between
\(\va\) and
\(\vb\text{.}\) If
\(|\theta|\) is no more than
\(90^\circ\text{,}\) as in the figure on the left above, the length of the projection of
\(\va\) on
\(\vb\) is
\(|\va|\cos\theta\text{.}\) By Property 6 of Theorem
1.2.11,
\(|\va|\cos\theta=\va\cdot\vb/|\vb|\text{,}\) so the projection is a vector whose length is
\(\va\cdot\vb/|\vb|\) and whose direction is given by the unit vector
\(\vb/|\vb|\text{.}\) Hence
\begin{gather*}
\text{projection of \(\va\) on \(\vb\)}={\rm proj}_{\vb}\,\va
=\frac{\va\cdot\vb}{|\vb|}\frac{\vb}{|\vb|}
=\frac{\va\cdot\vb}{|\vb|^2}\,\vb
\end{gather*}
If \(|\theta|\) is larger than \(90^\circ\text{,}\) as in the figure on the right above, the projection has length \(|\va|\,\cos(\pi-\theta)=-|\va|\cos\theta=-\va\cdot\vb/|\vb|\) and direction \(-\vb/|\vb|\text{.}\) In this case
\begin{gather*}
{\rm proj}_{\vb}\,\va
=-\frac{\va\cdot\vb}{|\vb|}\ \frac{-\vb}{|\vb|}
=\frac{\va\cdot\vb}{|\vb|^2}\ \vb
\end{gather*}
too. So the formula
Equation 1.2.14.
\begin{equation*}
{\rm proj}_{\vb}\,\va=\frac{\va\cdot\vb}{|\vb|^2}\,\vb
\end{equation*}
is applicable whenever \(\vb\ne\vZero \text{.}\) We may rewrite \({\rm proj}_{\vb}\,\va=\frac{\va\cdot\vb}{|\vb|}\,\frac{\vb}{|\vb|}\text{.}\) The coefficient, \(\frac{\va\cdot\vb}{|\vb|}\text{,}\) of the unit vector \(\frac{\vb}{|\vb|}\text{,}\) is called the component of \(\va\) in the direction \(\vb\text{.}\) As a special case, if \(\vb\) happens to be a unit vector, which, for emphasis, we’ll now write has \(\hat\vb\text{,}\) the projection formula simplifies to
Equation 1.2.15.
\begin{equation*}
{\rm proj}_{\hat\vb}\,\va
= (\va\cdot\hat\vb)\,\hat\vb
\end{equation*}
Example 1.2.16.
In this example, we will find the projection of the vector \(\llt 0,3\rgt\) on the vector \(\llt 1,1\rgt\text{,}\) as in the figure
By Equation
1.2.14 with
\(\va=\llt 0,3\rgt\) and
\(\vb=\llt 1,1\rgt\text{,}\) that projection is
\begin{align*}
{\rm proj}_{\llt 1,1\rgt}\,\llt 0,3\rgt
&=\frac{\llt 0,3\rgt\cdot\llt 1,1\rgt}{|\llt 1,1\rgt|^2}\,\llt 1,1\rgt\\
&=\frac{0\times1+3\times 1}{1^2+1^2}\,\llt 1,1\rgt
=\llt \frac{3}{2},\frac{3}{2}\rgt
\end{align*}
One use of projections is to “resolve forces”. There is an example in the next (optional) section.
Subsection 1.2.3 (Optional) Using Dot Products to Resolve Forces — The Pendulum
Model a pendulum by a mass \(m\) that is connected to a hinge by an idealized rod that is massless and of fixed length \(\ell\text{.}\) Denote by \(\theta\) the angle between the rod and vertical. The forces acting on the mass are
gravity, which has magnitude \(mg\) and direction \(\llt 0,-1\rgt\text{,}\)
tension in the rod, whose magnitude \(\tau(t)\) automatically adjusts itself so that the distance between the mass and the hinge is fixed at \(\ell\) (so that the rod does not stretch or contract) and whose direction is always parallel to the rod,
and possibly some frictional forces, like friction in the hinge and air resistance. Assume that the total frictional force has magnitude proportional to the speed of the mass and has direction opposite to the direction of motion of the mass. We’ll call the constant of proportionality \(\beta\text{.}\)
If we use a coordinate system centered on the hinge, the \((x,y)\) coordinates of the mass at time \(t\) are
\begin{align*}
x(t)&=\ell\sin\theta(t)\\
y(t)&=-\ell\cos\theta(t)
\end{align*}
where \(\theta(t)\) is the angle between the rod and vertical at time \(t\text{.}\) We are now going to use Newton’s law of motion
\begin{gather*}
\text{mass}\times\text{acceleration}=\text{total applied force}
\end{gather*}
to determine now \(\theta\) evolves in time. By definition, the velocity and acceleration vectors for the position vector \(\llt x(t),y(t)\rgt\) are
\begin{align*}
\diff{}{t}\llt x(t),y(t)\rgt
&= \llt \diff{x}{t}(t),\diff{y}{t}(t)\rgt\\
\ddiff{2}{}{t}\llt x(t),y(t)\rgt
&=\llt \ddiff{2}{x}{t}(t),\ddiff{2}{y}{t}(t)\rgt
\end{align*}
So, the velocity and acceleration vectors of our mass are
\begin{alignat*}{1}
\vv(t)&=\diff{}{t}\llt x(t),y(t)\rgt\\
&=\llt \ell\diff{}{t}\sin\theta(t),-\ell\diff{}{t}\cos\theta(t)\rgt\\
&=\llt \ell\cos\theta(t)\,\diff{\theta}{t}(t)\,,\,
\ell\sin\theta(t)\,\diff{\theta}{t}(t)\rgt\\
&=\ell\,\diff{\theta}{t}(t)\,\llt \cos\theta(t),\sin\theta(t)\rgt\\
\va(t)&=\ddiff{2}{}{t}\llt x(t),y(t)\rgt\\
&=\diff{}{t}
\left\{\ell\,\diff{\theta}{t}(t)\,
\llt \cos\theta(t),\sin\theta(t)\rgt \right\}\\
&=\ell\,\ddiff{2}{\theta}{t}(t)\,\llt \cos\theta(t),\sin\theta(t)\rgt
+\ell\,\diff{\theta}{t}(t)
\llt \diff{}{t}\cos\theta(t),\diff{}{t}\sin\theta(t)\rgt\\
&= \ell\,\ddiff{2}{\theta}{t}(t)\llt \cos\theta(t),\sin\theta(t)\rgt
+\ell \Big(\diff{\theta}{t}(t)\Big)^2
\llt -\sin\theta(t),\cos\theta(t)\rgt
\end{alignat*}
The negative of the velocity vector is \(- \ell\, \diff{\theta}{t}\llt \cos\theta,\sin\theta\rgt\text{,}\) so the total frictional force is
\begin{equation*}
-\beta\ell\, \diff{\theta}{t}\llt \cos\theta,\sin\theta\rgt
\end{equation*}
with \(\beta\) our constant of proportionality.
The vector
\begin{equation*}
\tau(t) \llt -\sin\theta(t),\cos\theta(t)\rgt
\end{equation*}
has magnitude \(\tau(t)\) and direction parallel to the rod pointing from the mass towards the hinge and so is the force due to tension in the rod.
Hence, for this physical system, Newton’s law of motion is
\begin{align*}
&\overbrace{m\ell\,\ddiff{2}{\theta}{t}\llt \cos\theta,\sin\theta\rgt
+m\ell\,\Big(\diff{\theta}{t}\Big)^2\llt -\sin\theta,\cos\theta\rgt}^
{\text{mass}\times\text{acceleration}}\\
&\hskip1in=
\overbrace{mg\llt 0,-1\rgt}^{\rm gravity}
+\overbrace{\tau \llt -\sin\theta,\cos\theta\rgt}^{\rm tension}
-\overbrace{\beta\ell\,\diff{\theta}{t}\llt \cos\theta,\sin\theta\rgt}^{\rm friction}
\tag{\(*\)}
\end{align*}
This is a rather complicated looking equation. Writing out its
\(x\)- and
\(y\)-components doesn’t help. They also look complicated. Instead, the equation can be considerably simplified (and consequently better understood) by “taking its components parallel to and perpendicular to the direction of motion”. From the velocity vector
\(\vv(t)\text{,}\) we see that
\(\llt \cos\theta(t),\sin\theta(t)\rgt \) is a unit vector parallel to the direction of motion at time
\(t\text{.}\) Recall, from
1.2.15, that the projection of any vector
\(\vb\) on any unit vector
\(\hat\vd\) (with the “hat” on
\(\hat\vd\) reminding ourselves that the vector is a unit vector) is
\begin{gather*}
\big(\vb\cdot \hat\vd\big)\,\hat\vd
\end{gather*}
The coefficient \(\vb\cdot \hat\vd\) is, by definition, the component of \(\vb\) in the direction \(\hat\vd\text{.}\) So, by dotting both sides of the equation of motion \((*)\) with \(\hat\vd=\llt \cos\theta(t),\sin\theta(t)\rgt \text{,}\) we extract the component parallel to the direction of motion. Since
\begin{align*}
\llt \cos\theta,\sin\theta\rgt \cdot\llt \cos\theta,\sin\theta\rgt &=1\\
\llt \cos\theta,\sin\theta\rgt \cdot\llt -\sin\theta,\cos\theta\rgt &=0\\
\llt \cos\theta,\sin\theta\rgt \cdot\llt 0,-1\rgt &=-\sin\theta
\end{align*}
this gives
\begin{gather*}
m\ell\ddiff{2}{\theta}{t}=-mg\sin\theta-\beta\ell\diff{\theta}{t}
\end{gather*}
which is much cleaner than \((*)\text{!}\) When \(\theta\) is small, we can approximate \(\sin\theta\approx\theta\) and get the equation
\begin{gather*}
\ddiff{2}{\theta}{t}+\frac{\beta}{m}\diff{\theta}{t}+\frac{g}{\ell}\theta=0
\end{gather*}
which is easily solved. There are systematic procedures for finding the solution, but we’ll just guess.
When there is no friction (so that \(\beta=0\)), we would expect the pendulum to just oscillate. So it is natural to guess
\begin{equation*}
\theta(t)=A\sin(\omega t-\delta)
\end{equation*}
which is an oscillation with (unknown) amplitude \(A\text{,}\) frequency \(\omega\) (radians per unit time) and phase \(\delta\text{.}\) Substituting this guess into the left hand side, \(\theta'' + \tfrac{g}{\ell}\theta\text{,}\) yields
\begin{equation*}
-A\omega^2\sin(\omega t-\delta)+A\tfrac{g}{\ell}\sin(\omega t-\delta)
\end{equation*}
which is zero if \(\omega=\sqrt{g/\ell}\text{.}\) So \(\ \theta(t)=A\sin(\omega t-\delta)\ \) is a solution for any amplitude \(A\) and phase \(\delta\text{,}\) provided the frequency \(\omega=\sqrt{g/\ell}\text{.}\)
When there is some, but not too much, friction, so that \(\beta \gt 0\) is relatively small, we would expect “oscillation with decaying amplitude”. So we guess
\begin{equation*}
\theta(t)=Ae^{-\gamma t}\sin(\omega t-\delta)
\end{equation*}
for some constant decay rate \(\gamma\text{,}\) to be determined. With this guess,
\begin{alignat*}{3}
\theta(t)&=\phantom{- -\gamma\omega^{2})} Ae^{-\gamma t}
&\sin(\omega t-\delta)\\
\theta'(t)&=\phantom{-\omega^{2})}-\gamma Ae^{-\gamma t}
&\sin(\omega t-\delta)
&+\phantom{2\gamma }\omega A e^{-\gamma t}&\cos(\omega t-\delta)\\
\theta''(t)&=(\gamma^2-\omega^2)Ae^{-\gamma t}&\sin(\omega t-\delta)
&-2\gamma\omega A e^{-\gamma t}&\cos(\omega t-\delta)
\end{alignat*}
and the left hand side
\begin{align*}
\ddiff{2}{\theta}{t}+\frac{\beta}{m}\diff{\theta}{t}+\frac{g}{\ell}\theta
&
=\left[\gamma^2-\omega^2-\frac{\beta}{m}\gamma+\frac{g}{\ell}\right]
Ae^{-\gamma t}\sin(\omega t-\delta)\\
&\hskip0.5in+\left[-2\gamma\omega+\frac{\beta}{m}\omega\right]
Ae^{-\gamma t}\cos(\omega t-\delta)
\end{align*}
vanishes if \(\gamma^2-\omega^2-\frac{\beta}{m}\gamma+\tfrac{g}{\ell}=0\) and \(-2\gamma\omega+\frac{\beta}{m}\omega=0.\) The second equation tells us the decay rate \(\gamma=\tfrac{\beta}{2m}\) and then the first tells us the frequency
\begin{gather*}
\omega=\sqrt{\gamma^2-\tfrac{\beta}{m}\gamma+\tfrac{g}{\ell}}
=\sqrt{\tfrac{g}{\ell}-\tfrac{\beta^2}{4m^2}}
\end{gather*}
When there is a lot of friction (namely when \(\tfrac{\beta^2}{4m^2} \gt \tfrac{g}{\ell}\text{,}\) so that the frequency \(\omega\) is not a real number), we would expect damping without oscillation and so would guess \(\theta(t)=Ae^{-\gamma t}\text{.}\) You can determine the allowed values of \(\gamma\) by substituting this guess in.
To extract the components perpendicular to the direction of motion, we dot with \(\llt -\sin\theta,\cos\theta\rgt \) rather than \(\llt \cos\theta,\sin\theta\rgt \text{.}\) Note that, because
\begin{equation*}
\llt -\sin\theta,\cos\theta\rgt \cdot \llt \cos\theta,\sin\theta\rgt =0,
\end{equation*}
the vector \(\llt -\sin\theta,\cos\theta\rgt \) really is perpendicular to the direction of motion. Since
\begin{align*}
\llt -\sin\theta,\cos\theta\rgt \cdot\llt \cos\theta,\sin\theta\rgt &=0\\
\llt -\sin\theta,\cos\theta\rgt \cdot\llt -\sin\theta,cos\theta\rgt &=1\\
\llt -\sin\theta,\cos\theta\rgt \cdot\llt 0,-1\rgt &=-\cos\theta
\end{align*}
dotting both sides of the equation of motion \((*)\) with \(\llt -\sin\theta,\cos\theta\rgt \) gives
\begin{gather*}
m\ell\Big(\diff{\theta}{t}\Big)^2=-mg\cos\theta+\tau
\end{gather*}
This equation just determines the tension
\begin{equation*}
\tau=m\ell\big(\diff{\theta}{t}\big)^2+mg\cos\theta
\end{equation*}
in the rod, once you know \(\theta(t)\text{.}\)
Subsection 1.2.4 (Optional) Areas of Parallelograms
A parallelogram is naturally determined by the two vectors that define its sides. We’ll now develop a formula for the area of a parallelogram in terms of these two vectors.
Construct a parallelogram as follows. Pick two vectors \(\llt a,b\rgt \) and \(\llt c,d\rgt \text{.}\) Draw them with their tails at a common point. Then draw \(\llt a,b\rgt \) a second time with its tail at the head of \(\llt c,d\rgt \) and draw \(\llt c,d\rgt \) a second time with its tail at the head of \(\llt a,b\rgt \text{.}\) If the common point is the origin, you get a picture like the figure below.
Any parallelogram can be constructed like this if you pick the common point and two vectors appropriately. Let’s compute the area of the parallelogram. The area of the large rectangle with vertices \((0,0),\ (0, b+d),\ (a+c,0)\) and \((a+c,b+d)\) is \((a+c)(b+d)\text{.}\) The parallelogram we want can be extracted from the large rectangle by deleting the two small rectangles (each of area \(bc\)), and the two lightly shaded triangles (each of area \(\half cd\)), and the two darkly shaded triangles (each of area \(\half ab\)). So the desired
\begin{gather*}
{\rm area} = (a+c)(b+d) - (2\times bc) -\big(2\times \half cd\big)
-\big(2\times\half ab\big)
=ad-bc
\end{gather*}
In the above figure, we have implicitly assumed that \(a,\ b,\ c,\ d\ge 0\) and \(d/c\ge b/a\text{.}\) In words, we have assumed that both vectors \(\llt a,b\rgt ,\ \llt c,d\rgt \) lie in the first quadrant and that \(\llt c,d\rgt \) lies above \(\llt a,b\rgt \text{.}\) By simply interchanging \(a\leftrightarrow c\) and \(b\leftrightarrow d\) in the picture and throughout the argument, we see that when \(a,\ b,\ c,\ d\ge 0\) and \(b/a\ge d/c\text{,}\) so that the vector \(\llt c,d\rgt \) lies below \(\llt a,b\rgt \text{,}\) the area of the parallelogram is \(bc-ad\text{.}\) In fact, all cases are covered by the formula
Equation 1.2.17.
\begin{gather*}
\text{area of parallelogram with sides }
\llt a,b\rgt
\text{ and }
\llt c,d\rgt =|ad-bc|
\end{gather*}
Given two vectors \(\llt a,b\rgt \) and \(\llt c,d\rgt \text{,}\) the expression \(ad-bc\) is generally written
\begin{align*}
\det\left[\begin{matrix}a&b\\
c&d\end{matrix}\right]=ad-bc
\end{align*}
and is called the determinant of the matrix
\begin{align*}
\left[\begin{matrix}a&b \\ c&d\end{matrix}\right]
\end{align*}
with rows \(\llt a,b\rgt \) and \(\llt c,d\rgt \text{.}\) The determinant of a \(2\times 2\) matrix is the product of the diagonal entries minus the product of the off-diagonal entries.
There is a similar formula in three dimensions. Any three vectors \(\va=\llt a_1,a_2,a_3\rgt ,\ \vb=\llt b_1,b_2,b_3\rgt \) and \(\vc=\llt c_1,c_2,c_3\rgt \) in three dimensions
determine a parallelepiped (three dimensional parallelogram). Its volume is given by the formula
Equation 1.2.18.
\begin{align*}
\text{volume of parallelepiped with edges }
\va,\ \vb,\ \vc\
=\ \left|
\det\left[\begin{matrix}a_1&a_2&a_3 \\ b_1&b_2&b_3\\ c_1&c_2&c_3\end{matrix}\right]
\right|
\end{align*}
The determinant of a \(3\times 3\) matrix can be defined in terms of some \(2\times 2\) determinants by
This formula is called “expansion along the top row”. There is one term in the formula for each entry in the top row of the \(3\times 3\) matrix. The term is a sign times the entry itself times the determinant of the \(2\times 2\) matrix gotten by deleting the row and column that contains the entry. The sign alternates, starting with a “\(+\)”.
We shall not prove this formula completely here. It gets a little tedious. But, there is one case in which we can easily verify that the volume of the parallelepiped is really given by the absolute value of the claimed determinant. If the vectors \(\vb\) and \(\vc\) happen to lie in the \(xy\) plane, so that \(b_3=c_3=0\text{,}\) then
\begin{align*}
\det\left[\begin{matrix}
a_1&a_2&a_3\\ b_1&b_2&0\\ c_1&c_2&0
\end{matrix}\right]
&=a_1(b_20-0c_2) -a_2(b_10-0c_1) +a_3(b_1c_2-b_2c_1)\\
&=a_3(b_1c_2-b_2c_1)
\end{align*}
The first factor, \(a_3\text{,}\) is the \(z\)-coordinate of the one vector not contained in the \(xy\)-plane. It is (up to a sign) the height of the parallelepiped. The second factor is, up to a sign, the area of the parallelogram determined by \(\vb\) and \(\vc\text{.}\) This parallelogram forms the base of the parallelepiped. The product is indeed, up to a sign, the volume of the parallelepiped. That the formula is true in general is a consequence of the fact (that we will not prove) that the value of a determinant does not change when one rotates the coordinate system and that one can always rotate our coordinate axes around so that \(\vb\) and \(\vc\) both lie in the \(xy\)-plane.
Subsection 1.2.5 The Cross Product
We have already seen two different products involving vectors — the multiplication of a vector by a scalar and the dot product of two vectors. The dot product of two vectors yields a scalar. We now introduce another product of two vectors, called the cross product. The cross product of two vectors will give a vector. There are applications which have two vectors as inputs and produce one vector as an output, and which are related to the cross product. Here is a very brief mention of two such applications. We will look at them in much more detail later.
Finally, here is the definition of the cross product. Note that it applies only to vectors in three dimensions.
Definition 1.2.19. Cross Product.
The cross product of the vectors \(\va=\llt a_1,a_2,a_3\rgt\) and \(\vb=\llt b_1,b_2,b_3\rgt\) is denoted \(\va\times\vb\) and is defined by
\begin{gather*}
\va\times\vb = \llt a_2b_3-a_3b_2\,,\, a_3b_1-a_1b_3\,,\, a_1b_2-a_2b_1\rgt
\end{gather*}
Note that each component has the form \(a_ib_j-a_jb_i\text{.}\) The index \(i\) of the first \(a\) in component number \(k\) of \(\va\times\vb\) is just after \(k\) in the list \(1,2,3,1,2,3,1,2,3,\cdots\text{.}\) The index \(j\) of the first \(b\) is just before \(k\) in the list.
\begin{gather*}
(\va\times\vb)_k
=a_{{\rm just\ after\ }k}\ b_{{\rm just\ before\ }k}
-a_{{\rm just\ before\ }k}\ b_{{\rm just\ after\ }k}
\end{gather*}
For example, for component number \(k=3\text{,}\)
\begin{gather*}
\left.{\text{''just after 3'' is 1}}\atop{\text{''just before 3'' is 2}}\right\}
\implies (\va\times\vb)_3= a_1b_2-a_2b_1
\end{gather*}
There is a much better way to remember this definition. Recall that a \(2\times 2\) matrix is an array of numbers having two rows and two columns and that the determinant of a \(2\times 2\) matrix is defined by
\begin{align*}
\det \left[\begin{matrix}a& b \\ c&d\end{matrix}\right]=ad-bc
\end{align*}
It is the product of the entries on the diagonal minus the product of the entries not on the diagonal.
A \(3\times 3\) matrix is an array of numbers having three rows and three columns.
\begin{align*}
\left[\begin{matrix}
i& j &k
\\
a_1&a_2&a_3
\\
b_1&b_2&b_3\end{matrix}\right]
\end{align*}
You will shortly see why the entries in the top row have been given the rather peculiar names \(i\text{,}\) \(j\) and \(k\text{.}\) The determinant of a \(3\times 3\) matrix can be defined in terms of some \(2\times 2\) determinants by
This formula is called “expansion of the determinant along the top row”. There is one term in the formula for each entry in the top row. The term is a sign times the entry itself times the determinant of the
\(2\times 2\) matrix gotten by deleting the row and column that contains the entry. The sign alternates, starting with a
\(+\text{.}\) If we now replace
\(i\) by
\(\hi\text{,}\) \(j\) by
\(\hj\) and
\(k\) by
\(\hk\text{,}\) we get exactly the formula for
\(\va\times \vb\) of Definition
1.2.19. That is the reason for the peculiar choice of names for the matrix entries. So
\begin{align*}
\va\times\vb
&=\det\left[\begin{matrix}\hi& \hj &\hk
\\
a_1&a_2&a_3
\\
b_1&b_2&b_3\end{matrix}\right]\\
&=\hi\big(a_2b_3-a_3b_2) -\hj(a_1b_3-a_3b_1) +\hk(a_1b_2-a_2b_1)
\end{align*}
is a mnemonic device for remembering the definition of \(\va\times\vb\text{.}\) It is also good from the point of view of evaluating \(\va\times\vb\text{.}\) Here are several examples in which we use the determinant mnemonic device to evaluate cross products.
Example 1.2.20.
In this example, we’ll use the mnemonic device to compute two very simple cross products. First
Second
Note that, unlike most (or maybe even all) products that you have seen before, \(\hi\times\hj\) is not the same as \(\hj\times\hi\text{!}\)
Example 1.2.21.
In this example, we’ll use the mnemonic device to compute two more complicated cross products. Let \(\va=\llt 1,2,3 \rgt\) and \(\vb=\llt 1,-1,2 \rgt\text{.}\) First
Second
Here are some important observations.
The vectors
\(\va\times\vb\) and
\(\vb\times\va\) are not the same! In fact
\(\vb\times\va=-\va\times\vb\text{.}\) We shall see in Theorem
1.2.23 below that this was not a fluke.
The vector
\(\va\times\vb\) has dot product zero with both
\(\va\) and
\(\vb\text{.}\) So the vector
\(\va\times\vb\) is perpendicular to both
\(\va\) and
\(\vb\text{.}\) We shall see in Theorem
1.2.23 below that this was also not a fluke.
Example 1.2.22.
Yet again we use the mnemonic device to compute a more complicated cross product. This time let \(\va=\llt 3,2,1 \rgt\) and \(\vb=\llt 6,4,2 \rgt\text{.}\) Then
We shall see in Theorem
1.2.23 below that it is not a fluke that the cross product is
\(\vZero\text{.}\) It is a consequence of the fact that
\(\va\) and
\(\vb=2\va\) are parallel.
We now move on to learning about the properties of the cross product. Our first properties lead up to a more intuitive geometric definition of \(\va\times\vb\text{,}\) which is better for interpreting \(\va\times\vb\text{.}\) These properties of the cross product, which state that \(\va\times\vb\) is a vector and then determine its direction and length, are as follows. We will collect these properties, and a few others, into a theorem shortly.
- (0)
\(\va,\vb\) are vectors in three dimensions and \(\va\times\vb\) is a vector in three dimensions.
- (1)
\(\va\times\vb\) is perpendicular to both \(\va\) and \(\vb\text{.}\)
Proof.
To check that \(\va\) and \(\va\times \vb\) are perpendicular, one just has to check that the dot product \(\va\cdot(\va\times \vb)=0\text{.}\) The six terms in
\begin{equation*}
\va\cdot(\va\times \vb)=
a_1(a_2b_3-a_3b_2)+a_2(a_3b_1-a_1b_3)+a_3(a_1b_2-a_2b_1)
\end{equation*}
cancel pairwise. The computation showing that \(\vb\cdot(\va\times \vb)=0\) is similar.
- (2)
-
\begin{align*}
|\va\times\vb| \amp = |\va|\,|\vb|\sin\theta
\text{ where } 0\le\theta\le\pi \text{ is the angle between } \va,\vb\\
\amp = \text{the area of the parallelogram with sides } \va, \vb
\end{align*}
Proof.
The formula \(|\va\times\vb|=|\va|\,|\vb|\sin\theta\) is gotten by verifying that
\begin{align*}
|\va\times\vb|^2
&=\big(\va\times\vb\big)\cdot\big(\va\times\vb\big)\\
&=(a_2b_3-a_3b_2)^2+(a_3b_1-a_1b_3)^2+ (a_1b_2-a_2b_1)^2\\
&=a_2^2b_3^2-2a_2b_3a_3b_2+a_3^2b_2^2
+a_3^2b_1^2-2a_3b_1a_1b_3+a_1^2b_3^2\\
&\hskip0.5in
+a_1^2b_2^2-2a_1b_2a_2b_1+a_2^2b_1^2\\
\end{align*}
is equal to
\begin{align*}
|\va|^2\,|\vb|^2\sin^2\theta
&=|\va|^2|\vb|^2(1-\cos^2\theta)\\
&=|\va|^2|\vb|^2-(\va\cdot\vb)^2\\
&=\big(a_1^2+a_2^2+a_3^2\big)\big(b_1^2+b_2^2+b_3^2\big)
-\big(a_1b_1+a_2b_2+a_3b_3\big)^2\\
&=a_1^2b_2^2+a_1^2b_3^2+a_2^2b_1^2+a_2^2b_3^2+a_3^2b_1^2+a_3^2b_2^2\\
&\hskip0.5in
-\big(2a_1b_1a_2b_2+2a_1b_1a_3b_3+2a_2b_2a_3b_3\big)
\end{align*}
To see that \(|\va|\,|\vb|\sin\theta\) is the area of the parallelogram with sides \(\va\) and \(\vb\text{,}\) just recall that the area of any parallelogram is given by the length of its base times its height. Think of \(\va\) as the base of the parallelogram. Then \(|\va|\) is the length of the base and \(|\vb|\sin\theta\) is the height.
These properties almost determine \(\va\times\vb\text{.}\) Property 1 forces the vector \(\va\times\vb\) to lie on the line perpendicular to the plane containing \(\va\) and \(\vb\text{.}\) There are precisely two vectors on this line that have the length given by property 2. In the left hand figure of
the two vectors are labeled \(\vc\) and \(\vd\text{.}\) Which of these two candidates is correct is determined by the right hand rule, which says that if you form your right hand into a fist with your fingers curling from \(\va\) to \(\vb\text{,}\) then when you stick your thumb straight out from the fist, it points in the direction of \(\va\times\vb\text{.}\) This is illustrated in the figure on the right above. The important special cases
- (3)
-
\begin{equation*}
\hi\times\hj=\phantom{-}\hk,\ \ \ \hj\times\hk=\phantom{-}\hi,\ \ \ \hk\times\hi=\phantom{-}\hj
\end{equation*}
\begin{equation*}
\hj\times\hi=-\hk,\ \ \ \hk\times\hj=-\hi,\ \ \ \hi\times\hk=-\hj
\end{equation*}
all follow directly from the definition of the cross product (see, for example, Example
1.2.20) and all obey the right hand rule. Combining properties 1, 2 and the right hand rule give the geometric definition of
\(\va\times\vb\text{.}\) To remember these three special cases, just remember this figure.
The product of any two standard basis vectors, taken in the order of the arrows in the figure, is the third standard basis vector. Going against the arrows introduces a minus sign.
- (4)
\begin{equation*}
\va\times\vb=|\va|\,|\vb|\sin\theta\ \hn
\end{equation*}
where \(\theta\) is the angle between \(\va, \vb\text{,}\) \(|\hn|=1,\ \hn\perp\va,\vb\text{,}\) and \((\va,\vb,\hn)\) obey the right hand rule.
Outline of Proof.
We have already seen that the right hand side has the correct length and, except possibly for a sign, direction. To check that the right hand rule holds in general, rotate your coordinate system around so that \(\va\) points along the positive \(x\) axis and \(\vb\) lies in the \(xy\)-plane with positive \(y\) component. That is \(\va=\alpha\hi\) and \(\vb=\beta\hi+\gamma\hj\) with \(\alpha,\gamma\ge 0\text{.}\) Then
\begin{equation*}
\va\times\vb=\alpha\hi\times(\beta\hi+\gamma\hj) =\alpha\beta\,\hi\times\hi+\alpha\gamma\,\hi\times\hj.
\end{equation*}
The first term vanishes by property 2, because the angle \(\theta\) between \(\hi\) and \(\hi\) is zero. So, by property 3, \(\va\times\vb= \alpha\gamma\hk\) points along the positive \(z\) axis, which is consistent with the right hand rule.
The analog of property 7 of the dot product (which says that \(\va\cdot\vb\) is zero if and only if \(\va=\vZero\) or \(\vb=\vZero\) or \(\va\perp\vb\)) follows immediately from property 2.
- (5)
\begin{equation*}
\va\times\vb=\vZero\iff \va=\vZero \text{ or }
\vb=\vZero
\text{ or }
\va\parallel\vb.
\end{equation*}
The remaining properties are all tools for helping do computations with cross products. Here is a theorem which summarizes the properties of the cross product. We have already seen the first five. The other properties are all tools for helping do computations with cross products.
Theorem 1.2.23. Properties of the Cross Product.
- (0)
\(\va,\vb\) are vectors in three dimensions and \(\va\times\vb\) is a vector in three dimensions.
- (1)
\(\va\times\vb\) is perpendicular to both \(\va\) and \(\vb\text{.}\)
- (2)
-
\begin{align*}
|\va\times\vb| \amp = |\va|\,|\vb|\sin\theta
\text{ where } 0\le\theta\le\pi \text{ is the angle between } \va,\vb\\
\amp = \text{the area of the parallelogram with sides } \va, \vb
\end{align*}
- (3)
\begin{equation*}
\hi\times\hj=\phantom{-}\hk,\ \ \ \hj\times\hk=\phantom{-}\hi,\ \ \ \hk\times\hi=\phantom{-}\hj
\end{equation*}
- (4)
\begin{equation*}
\va\times\vb=|\va|\,|\vb|\sin\theta\ \hn
\end{equation*}
where \(\theta\) is the angle between \(\va, \vb\text{,}\) \(|\hn|=1,\ \hn\perp\va,\vb\text{,}\) and \((\va,\vb,\hn)\) obey the right hand rule.
- (5)
\begin{equation*}
\va\times\vb=\vZero\iff \va=\vZero \text{ or }
\vb=\vZero
\text{ or }
\va\parallel\vb.
\end{equation*}
- (6)
\begin{equation*}
\va\times \vb=-\vb\times\va
\end{equation*}
- (7)
\begin{equation*}
(s\va)\times \vb=\va\times(s\vb)=s(\va\times \vb)
\end{equation*}
for any scalar (i.e. number) \(s\text{.}\)
- (8)
\begin{equation*}
\va\times(\vb+\vc)=\va\times\vb+\va\times \vc
\end{equation*}
- (9)
\begin{equation*}
\va\cdot(\vb\times\vc)=(\va\times\vb)\cdot\vc
\end{equation*}
- (10)
\begin{equation*}
\va\times(\vb\times\vc)=(\vc\cdot\va)\vb -(\vb\cdot\va)\vc
\end{equation*}
Proof.
We have already seen the proofs up to number 5. Numbers 6, 7 and 8 follow immediately from the definition, using a little algebra. To prove numbers 9 and 10 we just write out the definitions of the left hand sides and the right hand sides and observe that they are equal.
- (9)
The left hand side is
\begin{align*}
\va\cdot(\vb\times\vc)
&=\llt a_1,a_2,a_3\rgt\cdot
\llt b_2c_3\!-\!b_3c_2\,,\, b_3c_1\!-\!b_1c_3\,,\, b_1c_2\!-\!b_2c_1\rgt\\
&={\color{blue}{a_1b_2c_3}} \!-\! {\color{blue}{a_1b_3c_2}} \!+\! {\color{orange}{a_2b_3c_1}} \!-\! {\color{orange}{a_2b_1c_3}} \!+\! a_3b_1c_2 \!-\! a_3b_2c_1
\end{align*}
The right hand side is
\begin{align*}
(\va\times\vb)\cdot\vc
&=\llt a_2b_3\!-\!a_3b_2\,,\, a_3b_1\!-\!a_1b_3\,,\, a_1b_2\!-\!a_2b_1\rgt\cdot
\llt c_1,c_2,c_3\rgt\\
&={\color{orange}{a_2b_3c_1}} \!-\! a_3b_2c_1 \!+\! a_3b_1c_2 \!-\! {\color{blue}{a_1b_3c_2}} \!+\! {\color{blue}{a_1b_2c_3}} \!-\! {\color{orange}{a_2b_1c_3}}
\end{align*}
The left and right hand sides are the same.
- (10)
We will give the straightforward, but slightly tedious, computations in (the optional) §
1.2.6.
Example 1.2.25.
As an illustration of the properties of the dot and cross product, we now derive the formula for the volume of the parallelepiped with edges
\(\va=\llt a_1,a_2,a_3\rgt \text{,}\) \(\vb = \llt b_1,b_2,b_3\rgt \text{,}\) \(\vc = \llt c_1,c_2,c_3\rgt \) that was mentioned in §
1.2.4.
The volume of the parallelepiped is the area of its base times its height. The base is the parallelogram with sides \(\vb\) and \(\vc\text{.}\) Its area is the length of its base, which is \(|\vb|\text{,}\) times its height, which is \(|\vc|\,\sin\theta\text{.}\) (Drop a perpendicular from the head of \(\vc\) to the line containing \(\vb\)). Here \(\theta\) is the angle between \(\vb\) and \(\vc\text{.}\) So the area of the base is \(|\vb|\,|\vc|\,\sin\theta= |\vb\times\vc|\text{,}\) by property 2 of the cross product.
To get the height of the parallelepiped, we drop a perpendicular from the head of \(\va\) to the line that passes through the tail of \(\va\) and is perpendicular to the base of the parallelepiped. In other words, from the head of \(\va\) to the line that contains both the head and the tail of \(\vb\times\vc\text{.}\) So the height of the parallelepiped is \(|\va|\,|\cos\varphi|\text{.}\) (The absolute values have been included because if the angle between \(\vb\times\vc\) and \(\va\) happens to be greater than \(90^\circ\text{,}\) the \(\cos\varphi\) produced by taking the dot product of \(\va\) and \((\vb\times\vc\)) will be negative.)
All together
\begin{align*}
\text{volume of parallelepiped}
&=(\text{area of base})\,(\text{height})\\
&=|\vb\times\vc|\ |\va|\ |\cos\varphi|\\
&=\big|\va\cdot(\vb\times\vc)\big|\\
&=\left|a_1 (\vb\times\vc)_1 +a_2 (\vb\times\vc)_2
+a_3 (\vb\times\vc)_3\right|\\
&=\left|a_1\det\left[\begin{matrix} b_2&b_3 \\ c_2&c_3\end{matrix}\right]
-a_2 \det\left[\begin{matrix}b_1&b_3 \\ c_1&c_3\end{matrix}\right]
+a_3\det\left[\begin{matrix}b_1&b_2 \\ c_1&c_2\end{matrix}\right]\right|\\
&=\left|\det\left[\begin{matrix}a_1&a_2&a_3\\ b_1&b_2&b_3\\c_1&c_2&c_3\end{matrix}\right]\right|
\end{align*}
Example 1.2.26.
As a concrete example of the computation of the volume of a parallelepiped, we consider the parallelepiped with edges
\begin{align*}
\va &= \llt 0,1,2\rgt\\
\vb &= \llt 1,1,0\rgt\\
\vc &= \llt 0,1,0\rgt
\end{align*}
Here is a sketch.
The base of the parallelepiped is the parallelogram with sides \(\vb\) and \(\vc\text{.}\) It is the shaded parallelogram in the sketch above. As
\begin{align*}
\vb\times\vc
&=\det\left[\begin{matrix}
\hi& \hj &\hk \\ 1&1&0 \\ 0&1&0\end{matrix}\right]\\
&=\hi\det\left[\begin{matrix} 1&0 \\ 1&0\end{matrix}\right]
-\hj\det\left[\begin{matrix}1&0 \\ 0&0\end{matrix}\right]
+\hk\det\left[\begin{matrix} 1&1 \\ 0&1\end{matrix}\right]\\
&=\hi\big(1\times 0-0\times 1) -\hj(1\times 0-0\times 0)
+\hk(1\times 1-1\times 0)\\
&=\hk
\end{align*}
We should not be surprised that \(\vb\times\vc\) has direction \(\hk\text{.}\)
\(\vb\times\vc\) has to be perpendicular to both \(\vb\) and \(\vc\) and
both \(\vb\) and \(\vc\) lie in the \(xy\)-plane,
so that \(\vb\times\vc\) has to the perpendicular to the \(xy\)-plane,
so that \(\vb\times\vc\) has to the parallel to the \(z\)-axis.
The area of the base, i.e. of the shaded parallelogram in the figure above, is
\begin{equation*}
|\vb\times\vc| = |\hk| =1
\end{equation*}
and the volume of the parallelepiped is
\begin{equation*}
|\va\cdot (\vb\times\vc)|
= |\llt 0,1,2\rgt\cdot\llt 0,0,1\rgt|=2
\end{equation*}
Subsection 1.2.7 (Optional) Application of Cross Products to Rotational Motion
In most computations involving rotational motion, the cross product shows up in one form or another. This is one of the main applications of the cross product. Consider, for example, a rigid body which is rotating at a constant rate of \(\Omega\) radians per second about an axis whose direction is given by the unit vector \(\hat\va\text{.}\) Let \(P\) be any point on the body. Let’s figure out its velocity. Pick any point on the axis of rotation and designate it as the origin of our coordinate system. Denote by \(\vr\) the vector from the origin to the point \(P\text{.}\) Let \(\theta\) denote the angle between \(\hat\va\) and \(\vr\text{.}\) As time progresses the point \(P\) sweeps out a circle of radius \(R=|\vr\,|\sin\theta\text{.}\)
In one second \(P\) travels along an arc that subtends an angle of \(\Omega\) radians, which is the fraction \(\tfrac{\Omega}{2\pi}\) of a full circle. The length of this arc is \(\tfrac{\Omega}{2\pi}\times 2\pi R=\Omega R=\Omega|\vr\,|\sin\theta\) so \(P\) travels the distance \(\Omega|\vr\,|\sin\theta\) in one second and its speed, which is also the length of its velocity vector, is \(\Omega|\vr\,|\sin\theta\text{.}\)
Now we just need to figure out the direction of the velocity vector. That is, the direction of motion of the point \(P\text{.}\) Imagine that both \(\hat\va\) and \(\vr\) lie in the plane of a piece of paper, as in the figure above. Then \(\vv\) points either straight into or straight out of the page and consequently is perpendicular to both \(\hat\va\) and \(\vr\text{.}\) To distinguish between the “into the page” and “out of the page” cases, let’s impose the conventions that \(\Omega \gt 0\) and the axis of rotation \(\hat\va\) is chosen to obey the right hand rule, meaning that if the thumb of your right hand is pointing in the direction \(\hat\va\text{,}\) then your fingers are pointing in the direction of motion of the rigid body. Under these conventions, the velocity vector \(\vv\) obeys
\(\displaystyle |\vv|=\Omega|\vr||\hat\va|\sin\theta\)
\(\displaystyle \vv\perp\hat\va,\vr\)
\((\hat\va,\vr,\vv)\) obey the right hand rule
That is, \(\vv\) is exactly \(\Omega\hat\va\times\vr\text{.}\) It is conventional to define the “angular velocity” of a rigid body to be vector \(\mathbf{\Omega}=\Omega\hat\va\text{.}\) That is, the vector with length given by the rate of rotation and direction given by the axis of rotation of the rigid body. In particular, the bigger the rate of rotation, the longer the angular velocity vector. In terms of this angular velocity vector, the velocity of the point \(P\) is
\begin{gather*}
\vv=\mathbf{\Omega}\times\vr
\end{gather*}
Subsection 1.2.8 (Optional) Application of Cross Products to Rotating Reference Frames
Imagine a moving particle that is being tracked by two observers.
One observer is fixed (out in space) and measures the position of the particle to be \(\big(X(t), Y(t), Z(t)\big)\text{.}\)
The other observer is tied to a merry-go-round (the Earth) and measures the position of the particle to be \(\big(x(t),y(t),z(t)\big)\text{.}\)
The merry-go-round is sketched in the figure on the left below. It is rotating about the \(Z\)-axis at a (constant) rate of \(\Omega\) radians per second. The vector \(\vOmega = \Omega \hk\text{,}\) whose length is the rate of rotation and whose direction is the axis of rotation, is called the angular velocity.
The \(x\)- and \(y\)-axes of the moving observer are painted in red on the merry-go-round. The figure on the right above shows a top view of the merry-go-round. The \(x\)- and \(y\)-axes of the moving observer are again red. The \(X\)- and \(Y\)-axes of the fixed observer are blue. We are assuming that at time \(0\text{,}\) the \(x\)-axis of the moving observer and the \(X\)-axis of the fixed observer coincide. As the merry-go-round is rotating at \(\Om\) radians per second, the angle between the \(X\)-axis and \(x\)-axis after \(t\) seconds is \(\Omega t\text{.}\)
As an example, suppose that the moving particle is tied to the tip of the moving observer’s unit \(x\) vector. Then
\begin{align*}
x(t) & = 1 & y(t) &= 0 & z(t) &= 0\\
X(t) & = \cos(\Om t) & Y(t) &=\sin(\Om t) & Z(t) &= 0
\end{align*}
or, if we write \(\vr(t) = \big(x(t), y(t), z(t)\big)\) and \(\vR(t) = \big(X(t), Y(t), Z(t)\big)\text{,}\) then
\begin{gather*}
\vr(t) = (1\,,\,0\,,\,0) \qquad
\vR(t)= \big(\cos(\Om t)\,,\,\sin(\Om t)\,,\,0\Big)
\end{gather*}
In general, denote by \(\hi(t)\) the coordinates of the unit \(x\)-vector of the moving observer at time \(t\text{,}\) as measured by the fixed observer. Similarly \(\hj(t)\) for the unit \(y\)-vector, and \(\hk(t)\) for the unit \(z\)-vector. As the merry-go-round is rotating about the \(Z\)-axis at a rate of \(\Omega\) radians per second, the angle between the \(X\)-axis and \(x\)-axis after \(t\) seconds is \(\Omega t\text{,}\) and
\begin{align*}
\hi(t) &= \big(\cos(\Om t)\,,\,\sin(\Om t)\,,\,0\Big)\\
\hj(t) &= \big(-\sin(\Om t)\,,\,\cos(\Om t)\,,\,0\Big)\\
\hk(t) &= \big(0\,,\,0\,,\,1\Big)
\end{align*}
The position of the moving particle, as seen by the fixed observer is
\begin{align*}
\vR(t) &= x(t)\,\hi(t) + y(t)\,\hj(t) +z(t)\,\hk(t)
\end{align*}
Differentiating, the velocity of the moving particle, as measured by the fixed observer is
\begin{align*}
\vV(t)=\diff{\vR}{t} &= \phantom{+}\diff{x}{t}\!(t)\ \hi(t)
+ \diff{y}{t}\!(t)\ \hj(t)
+ \diff{z}{t}\!(t)\ \hk(t)\\
&\phantom{=} +x(t)\diff{}{t} \hi(t)
+y(t)\diff{}{t} \hj(t)
+z(t)\diff{}{t} \hk(t)
\end{align*}
We saw, in the last (optional) §
1.2.7, that
\begin{gather*}
\diff{}{t} \hi(t)=\vOmega\times\hi(t)\quad
\diff{}{t} \hj(t)=\vOmega\times\hj(t)\quad
\diff{}{t} \hk(t)=\vOmega\times\hk(t)
\end{gather*}
(You could also verify that these are correct by putting in \(\vOmega = (0,0,\Om)\) and explicitly computing the cross products.) So
\begin{align*}
\vV(t)&= \Big(\diff{x}{t}\!(t)\ \hi(t)
+ \diff{y}{t}\!(t)\ \hj(t)
+ \diff{z}{t}\!(t)\ \hk(t)\Big)\\
&\hskip0.5in +\vOmega\times\Big(x(t)\ \hi(t)
+y(t)\ \hj(t)
+z(t)\ \hk(t)\Big)
\end{align*}
Differentiating a second time, the acceleration of the moving particle (which is also \(\frac{\vF}{m}\text{,}\) where \(\vF\) is the net force being applied to the particle and \(m\) is the mass of the particle) as measured by the fixed observer is
\begin{align*}
\frac{\vF}{m}
=\vA(t)
=& \Big(\difftwo{x}{t}\!(t)\ \hi(t)
+ \difftwo{y}{t}\!(t)\ \hj(t)
+ \difftwo{z}{t}\!(t)\ \hk(t)\Big)\\
& +2\vOmega\times\Big(\diff{x}{t}\!(t)\ \hi(t)
+\diff{y}{t}\!(t)\ \hj(t)
+\diff{z}{t}\!(t)\ \hk(t)\Big)\\
&+\vOmega\times\Big(\vOmega\times\big[
x(t)\ \hi(t)
+y(t)\ \hj(t)
+z(t)\ \hk(t)\big]\Big)
\end{align*}
Recall that the angular velocity \(\vOmega=(0,0,\Omega)\) does not depend on time. The rotating observer sees \(\hi(t)\) as \(\hi=(1,0,0)\text{,}\) sees \(\hj(t)\) as \(\hj=(0,1,0)\text{,}\) and sees \(\hk(t)\) as \(\hk=(0,0,1)\) and so sees
\begin{align*}
\frac{\vF}{m}
&=\va(t) + 2\vOmega\times\vv(t) +\vOmega\times\big[\vOmega\times\vr(t)\big]
\end{align*}
where, as usual,
\begin{alignat*}{2}
\vv(t) &= \diff{}{t}\vr(t)
&&= \Big(\diff{x}{t}(t)\,,\,\diff{y}{t}(t)\,,\,\diff{z}{t}(t)\Big)\\
\va(t) &= \difftwo{}{t}\vr(t)
&&= \Big(\difftwo{x}{t}(t)\,,\,\difftwo{y}{t}(t)\,,\,\difftwo{z}{t}(t)\Big)
\end{alignat*}
So the acceleration of the particle seen by the moving observer is
\begin{gather*}
\va(t) = \frac{\vF}{m} - 2\vOmega\times\vv(t)
- \vOmega\times\big[\vOmega\times\vr(t)\big]
\end{gather*}
Here
\(\vF\) is the sum of all external forces acting on the moving particle,
\(\vF_{{\rm cor}}=-2 \vOmega\times\vv(t)\) is called the Coriolis force and
\(- \vOmega\times\big[\vOmega\times\vr(t)\big]\) is called the centrifugal force.
As an example, suppose that you are the moving particle and that you are at the edge of the merry-go-round. Let’s say \(t=0\) and you are at \(\hi\text{.}\) Then \(\vF\) is the friction that the surface of the merry-go-round applies to the soles of your shoes. If you are just standing there, \(\vv(t)=\vZero\text{,}\) so that \(\vF_{{\rm cor}}=\vZero\text{,}\) and the friction \(\vF\) exactly cancels the centrifugal force \(-\vOmega\times\big[\vOmega\times\vr(t)\big]\) so that you remain at \(\hi(t)\text{.}\) Assume that \(\Om \gt 0\text{.}\) Now suppose that you start walking around the edge of the merry-go-round. Then, at \(t=0\text{,}\) \(\vr=\hi\) and
if you walk in the direction of rotation (with speed one), as in the figure on the left below, \(\vv=\hj\) and the Coriolis force \(\vF_{{\rm cor}}=
-2\Om\hk\times\hj = 2\Om\,\hi\) tries to push you off of the merry-go-round, while
if you walk opposite to the direction of rotation (with speed one), as in the figure on the right below, \(\vv=-\hj\) so that the Coriolis force \(\vF_{{\rm cor}}=-2\Om\hk\times(-\hj) = -2\Om\,\hi\) tries to pull you into the centre of the merry-go-round.
On a rotating ball, such as the Earth, the Coriolis force deflects wind to the right (counterclockwise) in the northern hemisphere and to the left (clockwise) is the southern hemisphere. In particular, hurricanes/cyclones/typhoons rotate counterclockwise in the northern hemisphere and clockwise in the southern hemisphere. On the other hand, when it comes to water draining out of, for example, a toilet, Coriolis force effects are dominated by other factors like asymmetry of the toilet.