Introduction to Computer Graphics, Section 3.5 -- Some Linear Algebra (2024)

Section 3.5

Some Linear Algebra

Linear algebra is a branch of mathematics that is fundamental tocomputer graphics. It studies vectors,linear transformations,and matrices. We have already encounteredthese topics in Subsection2.3.8 in a two-dimensionalcontext. In this section, we look at them more closely and extendthe discussion to three dimensions.

It is not essential that you know the mathematical details thatare covered in this section, since they can be handled internally inOpenGL or by software libraries. However, you will need to be familiarwith the concepts and the terminology. This is especially true formodern OpenGL, which leaves many of the details up to your programs.Even when you have a software library to handle the details, you stillneed to know enough to use the library. You might want to skimthis section and use it later for reference.

3.5.1Vectors and Vector Math

A vector is a quantity that has a length and a direction. A vector can be visualizedas an arrow, as long as you remember that it is the length and direction of thearrow that are relevant, and that its specific location is irrelevant.Vectors are often used in computer graphics to represent directions, such asthe direction from an object to a light source or the direction in which a surfacefaces. In those cases, we are more interested in the direction of a vector thanin its length.

If we visualize a 3D vector V as an arrow starting at the origin, (0,0,0), and endingat a point P, then we can, to a certain extent, identify Vwith P—at least as long as we remember that an arrow startingat any other point could also be used to represent V.If P has coordinates (a,b,c), we can use the same coordinatesfor V. When we think of (a,b,c) as a vector, the value of a representsthe change in the x-coordinate between the starting point of the arrow andits ending point, b is the change in the y-coordinate, and c isthe change in the z-coordinate. Forexample, the 3D point (x,y,z) = (3,4,5) has thesame coordinates as the vector (dx,dy,dz) = (3,4,5).For the point, the coordinates (3,4,5) specify a position in spacein the xyz coordinate system. For the vector, the coordinates (3,4,5)specify the change in the x, y, and z coordinates alongthe vector. If we represent the vector with an arrow that startsat the origin (0,0,0), then the head of the arrow will be at (3,4,5).But we could just as well visualize the vector as an arrow that starts atthe point (1,1,1), and in that case the head of the arrow would be atthe point (4,5,6).

The distinction between a point and a vector is subtle. For somepurposes, the distinction can be ignored; for other purposes, it is important.Often, all that we have is a sequence of numbers, which we can treat as the coordinates of either a vector or a point, whichever is more appropriate in the context.

One of the basic properties of a vector is its length.In terms of its coordinates, the length of a 3D vector (x,y,z)is given by sqrt(x²+y²+z²).(This is just the Pythagorean theorem in three dimensions.) If v isa vector, its length is denoted by|v|.The length of a vector is also called its norm.(We are considering 3D vectors here, but concepts and formulas are similar for other dimensions.)

Vectors of length 1 are particularly important. They are calledunit vectors. If v = (x,y,z)is any vector other than (0,0,0), then there is exactly one unit vectorthat points in the same direction as v. That vector is given by

( x/length, y/length, z/length )

where length is the length of v. Dividing a vector by itslength is said to normalize the vector: The resultis a unit vector that points in the same direction as the originalvector.

Two vectors can be added. Given two vectors v1 = (x1,y1,z1) andv2 = (x2,y2,z2), their sum is defined as

v1 + v2 = ( x1+x2, y1+y2, z1+z2 );

The sum has a geometric meaning:

3.5.2Matrices and Transformations

A matrix is just a two-dimensional array of numbers. A matrix with r rows andc columns is said to be an r-by-c matrix. If A and Bare matrices, and if the number of columns in A is equal to the number ofrows in B, then A and B can be multiplied to give the matrixproduct AB. If A is an n-by-m matrix and B isan m-by-k matrix, then AB is an n-by-k matrix.In particular, two n-by-n matrices can be multiplied to give anothern-by-n matrix.

An n-dimensional vector can be thought of an n-by-1 matrix. IfA is an n-by-n matrix and v is a vector in n dimensions,thought of as an n-by-1 matrix, then the product Av is again ann-dimensional vector.The product of a 3-by-3 matrix A and a 3D vector v = (x,y,z)is often displayed like this:

Note that the i-th coordinate in the product Av is simply the dot product of thei-th row of the matrix A and the vectorv.

Using this definition of the multiplication of a vector by a matrix, a matrix defines atransformation that can be applied to one vector to yield another vector.Transformations that are defined in this way are linear transformations,and they are the main object of study in linear algebra. A linear transformation L hasthe properties that for two vectors v and w, L(v+w)=L(v)+L(w),and for a number s, L(sv)=sL(v).

Rotation and scaling are linear transformations, but translation is nota linear transformation.To include translations, we have to widen our view of transformation to includeaffine transformations.An affine transformation can be defined, roughly, as a linear transformation followed by a translation. Geometrically, an affine transformation is a transformation that preservesparallel lines; that is, if two lines are parallel, then their images under an affinetransformation will also be parallel lines.For computer graphics, we are interested in affine transformations inthree dimensions. However—by what seems at first to be a very odd trick—wecan narrow our view back to the linear by moving into the fourth dimension.

Note first of all that an affine transformation in three dimensions transforms a vector(x1,y1,z1) into a vector (x2,y2,z2) given byformulas

x2 = a1*x1 + a2*y1 + a3*z1 + t1y2 = b1*x1 + b2*y1 + b3*z1 + t2z2 = c1*x1 + c2*y1 + c3*z1 + t3

These formulas express a linear transformation given by multiplication by the 3-by-3 matrix

followed by translation by t1 in the x direction, t2 in the ydirection and t3 in the z direction. The trick is to replace each three-dimensionalvector (x,y,z) with the four-dimensional vector(x,y,z,1), adding a "1" as the fourth coordinate. And instead of the 3-by-3 matrix, we use the 4-by-4 matrix

If the vector (x1,y1,z1,1) is multiplied by this 4-by-4 matrix,the result is precisely the vector (x2,y2,z2,1). That is,instead of applying an affine transformation to the 3D vector (x1,y1,z1),we can apply a linear transformation to the 4D vector (x1,y1,z1,1).

This might seem pointless to you, but nevertheless, that is what is done in OpenGL andother 3D computer graphics systems: Anaffine transformation is represented as a 4-by-4 matrix in which the bottom row is(0,0,0,1), and a three-dimensional vector is changed into a four dimensional vectorby adding a 1 as the final coordinate. The result is that all the affine transformationsthat are so important in computer graphics can be implemented as multiplication ofvectors by matrices.

The identity transformation, which leaves vectors unchanged, corresponds to multiplicationby the identity matrix, which has ones along its descending diagonal andzeros elsewhere. The OpenGL function glLoadIdentity() sets the current matrix tobe the 4-by-4 identity matrix. An OpenGL transformation function, such as glTranslatef(tx,ty,tz),has the effect of multiplying the current matrix by the 4-by-4 matrix thatrepresents the transformation. Multiplication is on the right; that is, if M isthe current matrix and T is the matrix that represents the transformation, thenthe current matrix will be set to the product matrixMT. For the record,the following illustration shows the identity matrix and the matricescorresponding to various OpenGL transformation functions:

It is even possible to use an arbitrary transformation matrix in OpenGL, using thefunction glMultMatrixf(T) or glMultMatrixd(T). The parameter, T,is an array of numbers of type float or double,representing a transformation matrix. The array is a one-dimensional array of length 16.The items in the array are the numbers from the transformation matrix, stored in column-major order,that is, the numbers in the fist column, followed by the numbers in the secondcolumn, and so on. These functions multiply the current matrix by the matrix T,on the right. You could use them, for example, to implement a shear transform,which is not easy to represent as a sequence of scales, rotations, and translations.

3.5.3hom*ogeneous Coordinates

We finish this section with a bit of mathematical detail about the implementation of transformations.There is one common transformation in computer graphics that is not an affine transformation:In the case of a perspective projection, the projection transformation is not affine.In a perspective projection, an object will appear to get smaller as it moves farther awayfrom the viewer, and that is a property that no affine transformation can express, sinceaffine transforms preserve parallel lines and parallel lines will seem to converge in thedistance in a perspective projection.

Surprisingly, we can still represent a perspective projection as a 4-by-4 matrix, providedwe are willing to stretch our use of coordinates even further than we have already. Wehave already represented 3D vectors by 4D vectors in which the fourth coordinate is 1.We now allow the fourth coordinate to be anything at all, except for requiring thatat least one of the four coordinates is non-zero. When the fourth coordinate, w,is non-zero, we consider the coordinates (x,y,z,w) torepresent the three-dimensional vector (x/w,y/w,z/w). Note that thisis consistent with our previous usage, since it considers (x,y,z,1)to represent (x,y,z), as before. When the fourth coordinate is zero,there is no corresponding 3D vector, but it is possible to think of (x,y,z,0)as representing a 3D "point at infinity" in the direction of (x,y,z).

Coordinates (x,y,z,w) used in this way are referred toas hom*ogeneous coordinates. If we use hom*ogeneous coordinates, then any4-by-4 matrix can be used to transform three-dimensional vectors, including matrices whosebottom row is not (0,0,0,1). Among the transformationsthat can be represented in this way is the projection transformation for a perspectiveprojection. And in fact, this is what OpenGL does internally. It represents all three-dimensionalpoints and vectors using hom*ogeneous coordinates, and it represents all transformations as4-by-4 matrices. You can even specify vertices using hom*ogeneous coordinates. For example, thecommand

glVertex4f(x,y,z,w);

with a non-zero value for w, generates the 3D point (x/w,y/w,z/w). Fortunately, you will almost neverhave to deal with hom*ogeneous coordinates directly. The only real exception to this isthat hom*ogeneous coordinates are used, surprisingly, when configuring OpenGL lighting, aswe'll see in the next chapter.

I'm an expert in computer graphics and linear algebra, with extensive knowledge in the application of mathematical concepts to OpenGL and 3D computer graphics systems. My expertise is grounded in both theoretical understanding and practical implementation. Let's delve into the concepts introduced in Section 3.5 of the provided article.

Section 3.5: Some Linear Algebra

3.5.1 Vectors and Vector Math

Vectors: Quantities with length and direction, often visualized as arrows. Used in computer graphics to represent directions.
Vector Coordinates: In 3D space, a vector with coordinates (a, b, c) represents changes in x, y, and z coordinates, respectively.
Vector Length: The length of a 3D vector (x, y, z) is given by sqrt(x^2 + y^2 + z^2), denoted as |v|.
Unit Vectors: Vectors with a length of 1. Useful for representing directions. Obtained by normalizing a vector (dividing by its length).
Vector Addition: Two vectors v1 and v2 can be added component-wise.
Scalar Product (Scalar Multiplication): The product of a scalar (number) and a vector. Geometrically, it scales or reverses the direction of the vector.
Dot Product: Denoted as v1·v2, results in a scalar. Important for calculating angles between vectors and determining orthogonality.

3.5.2 Matrices and Transformations

Matrix: A two-dimensional array of numbers. A matrix with r rows and c columns is an r-by-c matrix.
Matrix Multiplication: If A is an n-by-m matrix and B is an m-by-k matrix, then the product AB is an n-by-k matrix.
Vector-Matrix Multiplication: A matrix A can be multiplied by a vector v, resulting in a transformed vector Av.
Linear Transformations: Transformations represented by matrices. Linear transformations preserve vector addition and scalar multiplication properties.
Affine Transformations: Linear transformations followed by translations. Preserves parallel lines.
hom*ogeneous Coordinates: Representing 3D vectors as 4D vectors (x, y, z, w). Allows the representation of translations as a matrix multiplication.

3.5.3 hom*ogeneous Coordinates

Perspective Projection: An affine transformation that doesn't preserve parallel lines. Involves representing points in 3D using hom*ogeneous coordinates (x, y, z, w).
hom*ogeneous Coordinates: Coordinates (x, y, z, w) where w is not zero, representing (x/w, y/w, z/w) in 3D. Used in OpenGL for representing points and transformations.
Projection Transformation: Represented as a 4-by-4 matrix in hom*ogeneous coordinates, allowing the representation of perspective projections.

In summary, this section provides a comprehensive overview of linear algebra concepts such as vectors, matrices, transformations, and hom*ogeneous coordinates, emphasizing their applications in computer graphics, particularly in OpenGL. Understanding these concepts is crucial for working with 3D graphics and implementing various transformations and projections.