Course Homepage: 15-462/662 Fall 2020
For assignments implementations: tbbbk/Scotty3D_Benky
Author: Bingkui Tong

1. Basic Math Review1.1 Linear Algebra Review1.1.1 Norm1.1.2 Linear Map1.1.3 Gram-Schmidt1.2 Vector Calculus1.2.1 Matrix Representation of Cross Product1.2.2 Determinant of a Linear Map1.2.3 Derivative as Best Linear Approximation1.2.3 Gradients of Matrix-Valued Expressions1.2.4 Vector Fields1.2.5 Divergence 散度1.2.6 Curl 旋度1.2.7 Hessian in Coordinates2. Rasterization2.1 Rasterization 101: Drawing a Triangle2.1.1 Pinhole camera2.1.2 Computing triangle coverage2.1.3 Aliasing2.1.4 SuperSampling2.2 Spatial Transformations2.2.1 Rotation2.2.2 Scaling2.2.3 Shear2.2.4 Composition2.2.5 Polar & Singular Value Decomposition2.2.6 Interpolating Transformations—Polar2.2.7 Translation!2.2.8 Homogeneous Coordinates2.2.9 Transformation Composition Order2.2.10 Scene Graph2.3 3D Rotations and Complex Representations2.3.1 Degree of Freedom2.3.2 Commutativity of Rotation—3D2.3.3 Euler Angles—Gimbal Lock2.3.4 Complex Analysis2.3.5 Rotation: Matrices vs.Complex2.3.6 Quaternions 四元数2.3.7 3D Rotations via Quaternions2.4 Perspective Projection2.4.1 View Frustum2.4.2 Clipping2.4.3 Mapping Frustum to Unit Cube2.5 Texture Mapping2.5.1 Barycentric Coordinates2.5.2 Perspective Correct Interpolation2.5.3 Texture Coordinates2.5.4 Magnification vs. Minification2.5.5 Bilinear Interpolation (Magnification)2.5.6 MIP Map (Minification)2.5.7 Trilinear Interpolation for MIP Map2.5.8 Anisotropic Filtering2.6 Depth and Transparency2.6.1 Depth-Buffer2.6.2 Compositing2.6.3 Fringing2.6.4 (Non-) Premultiplied Alpha2.7 Rasterization Pipeline Summary3. Geometry3.1 Encode Geometry3.1.1 Implicit Representations3.1.2 Explicit Representations3.1.3 Summary3.2 Meshes and Manifolds3.2.1 Manifold Assumption3.2.2 Why Do We Need Manifold?3.2.3 Halfedge Data Structure3.2.4 Halfedge Meshes Edition3.3 Digital Geometry Processing3.3.1 Remeshing as Resampling3.3.2 Upsampling via (Catmull-Clark/Loop) Subdivision3.3.3 Simplification via Edge Collapse3.3.4 Isotropic Remeshing Algorithm3.5 Geometric Queries3.5.1 Ray Equation3.5.2 Intersection3.6 Spatial Acceleration Data Structures3.6.1 Affine Map for Triangle3.6.2 Bounding Box3.6.3 Bounding Volume Hierarchy (BVH)4. Ray Tracing4.1 Color4.1.1 Intensity or Absorption4.1.2 Emission and Reflection4.1.3 Color Models4.1.4 Y’CbCr4.2 Radiometry4.2.1 Photon4.2.2 Radiant Energy $Q$ 4.2.3 Radiant Flux $\Phi$ (Power) 4.2.4 Irradiance $E$ 4.2.5 Lambert's Law4.2.6 Irradiance Falloff with Distance4.2.7 Solid Angles4.2.8 Radiance $L$ 4.2.9 Spectral Radiance4.2.10 Ambient Occlusion4.2.11 Radiant Intensity $I$ 4.3 The Rendering Equation4.3.1 Recursive Raytracing4.3.2 Reflection4.3.3 Models of Scattering4.3.4 BRDF (Bidirectional Reflectance Distribution Function)4.3.5 Subsurface scattering5. Optimization for Ray Tracing5.1 Numerical Integration5.1.1 Gauss Quadrature5.1.2 Trapezoid rule5.1.3 Curse of Dimensionality for Trapezoid Rule5.1.4 Sampling from Discrete Probability Distributions5.1.5 Sampling Continuous Random Variables using the Inversion Method5.1.6 Rejection Sampling5.2 Monte Carlo Rendering5.2.1 Expected Value and Variance5.2.2 Law of Large Number5.2.3 Biasing5.2.4 Importance Sampling5.2.5 Monte Carlo Integration5.2.6 Russian Roulette5.3 Variance Reduction5.3.1 Estimator5.3.2 Bias & Consistency5.3.3 Bidirectional Path Tracing5.3.4 Metropolis-Hastings Algorithm (MH)5.3.5 Multiple Importance Sampling5.3.6 Sampling Patterns & Variance Reduction6. Animation6.1 Keyframe6.1.1 Keyframing6.1.2 Spline Interpolation6.1.3 Natural Splines6.1.4 Hermite/Bézier Spline6.1.5 Catmull-Rom Spline6.1.6 Evaluation of Spline6.2 Dynamics6.2.1 Inverse Kinematics6.2.2 Animation EquationF6.2.3 Generalized Config6.2.3 Ordinary Differential Equation (ODE)6.2.4 Lagrangian Mechanics6.2.5 Particle Systems6.2.6 Forward/Backward Euler6.2.7 Symplectic Euler6.2.8 Automatic Differentiation6.2.9 Symbolic Differentiation6.2.10 Geometric Differentiation6.3 Partial Differential Equation (PDE)6.3.1 Definition of a PDE6.3.2 Lagrangian vs. Eulerian6.3.3 The Laplace Operator

1. Basic Math Review

I didn't write everything in detail for this section.

1.1 Linear Algebra Review

1.1.1 Norm

Which measures total size, length, volume, intensity, etc.

Warning: L2 Norm does not encode geometric length unless vectors are encoded in an orthonormal basis.

1.1.2 Linear Map

Key Idea: linear maps take lines to lines while keeps the origin fixed

It doesn't matter whether we add the vectors or apply the linear map first.

1.1.3 Gram-Schmidt

Warning: for large number of vectors / nearly parallel vectors, this is not the best algorithm

1.2 Vector Calculus

1.2.1 Matrix Representation of Cross Product

\begin{matrix} u := (u_{1}, u_{2}, u_{3}) \to \hat{u} := [\begin{matrix} 0 & - u_{3} & u_{2} \\ u_{3} & 0 & - u_{1} \\ - u_{2} & u_{1} & 0 \end{matrix}] \\ u \times v = \hat{u} v = [\begin{matrix} 0 & - u_{3} & u_{2} \\ u_{3} & 0 & - u_{1} \\ - u_{2} & u_{1} & 0 \end{matrix}] [\begin{matrix} v_{1} \\ v_{2} \\ v_{3} \end{matrix}] \end{matrix}

1.2.2 Determinant of a Linear Map

1.2.3 Derivative as Best Linear Approximation

Taylor series:

Replacing complicated functions with a linear (and sometimes quadratic) approximation is a powerful trick in graphics algorithms.

1.2.3 Gradients of Matrix-Valued Expressions

resource: matrixcookbook.pdf

1.2.4 Vector Fields

In general, a vector filed assigns a vector to each point in space, for example, we saw a gradient field:

\begin{matrix} f (x, y) = x^{2} + y^{2} \\ vector filed: \nabla f (x, y) = (2 x, 2 y) \end{matrix}

1.2.5 Divergence 散度

$\nabla · X$ .

\begin{matrix} \nabla = (\frac{\partial}{\partial u_{1}}, \dots, \frac{\partial}{\partial u_{n}}) \\ X (u) = (X_{1} (u), \dots, X_{n} (u)) \\ \nabla \cdot X := \sum_{i = 1}^{n} \frac{\partial X_{i}}{\partial u_{i}} \\ R^{n} \to R \end{matrix}

1.2.6 Curl 旋度

$\nabla \times X$

\begin{matrix} \nabla = (\frac{\partial}{\partial u_{1}}, \frac{\partial}{\partial u_{2}}, \frac{\partial}{\partial u_{3}}) \\ X (u) = (X_{1} (u), X_{2} (u), X_{3} (u)) \\ \nabla \times X := [\begin{matrix} \frac{\partial X_{3}}{\partial u_{2}} - \frac{\partial X_{2}}{\partial u_{3}} \\ \frac{\partial X_{1}}{\partial u_{3}} - \frac{\partial X_{3}}{\partial u_{1}} \\ \frac{\partial X_{2}}{\partial u_{1}} - \frac{\partial X_{1}}{\partial u_{2}} \end{matrix}] \\ If we only consider two dimensions: \\ \nabla \times X := \frac{\partial X_{2}}{\partial u_{1}} - \frac{\partial X_{1}}{\partial u_{2}} \end{matrix}

$X$ $X$

1.2.7 Hessian in Coordinates

Hessian is operator that gives us partial derivatives of the gradient.

\begin{matrix} f (x) : R^{n} \to R \\ \nabla^{2} f := [\begin{matrix} \frac{\partial^{2} f}{\partial x_{1} \partial x_{1}} & \dots & \frac{\partial^{2} f}{\partial x_{1} \partial x_{n}} \\ ⋮ & ⋱ & ⋮ \\ \frac{\partial^{2} f}{\partial x_{n} \partial x_{1}} & \dots & \frac{\partial^{2} f}{\partial x_{n} \partial x_{n}} \end{matrix}] \end{matrix}

2. Rasterization

2.1 Rasterization 101: Drawing a Triangle

Why triangle?

Can approximate any shape
always planar, well-defined normal
easy to interpolate data at corners
barycentric coordinates

Key reason: once everything is reduced to triangles, can focus on making an extremely well-optimized pipeline for drawing them

2.1.1 Pinhole camera

2.1.2 Computing triangle coverage

Key question: Which pixels does the triangle overlap?

2.1.3 Aliasing

The sampling process lose some information, leading the reconstruction result not exactly accurate. Or we say undersampling high-frequency signals results in aliasing.

2.1.4 SuperSampling

$N \times N$ $2\times2$ $4\times4$ ), or generate multiple random sampling points in each pixel. For each sampling points, execute the a complete pipeline and average these points to rasterize the pixel.

2.2 Spatial Transformations

2.2.1 Rotation

Properties:

Matrix Representations:

For rotation matrix, the transpose matrix equals inverse matrix.

R^{T} R = I

Orthogonal Transformations
$R^TR=I$ does not mean it is a rotation.
$R^TR=I$ ,
preserve $det(R)>0$
Reflections $det(R)<0$
$\begin{matrix} R = [\begin{matrix} - 1 & 0 \\ 0 & 1 \end{matrix}] \\ R^{⊤} R = [\begin{matrix} (- 1)^{2} & 0 \\ 0 & 1 \end{matrix}] = I \end{matrix}$

2.2.2 Scaling

f (u) = a u

Matrix Representations:

\begin{matrix} D = [\begin{matrix} a & 0 & 0 \\ 0 & a & 0 \\ 0 & 0 & a \end{matrix}], u = [\begin{matrix} u_{1} \\ u_{2} \\ u_{3} \end{matrix}] \\ D u = [\begin{matrix} a & 0 & 0 \\ 0 & a & 0 \\ 0 & 0 & a \end{matrix}] [\begin{matrix} u_{1} \\ u_{2} \\ u_{3} \end{matrix}] = [\begin{matrix} a u_{1} \\ a u_{2} \\ a u_{3} \end{matrix}] = a u \end{matrix}

Spectral Theorem:
$A=A^T$ has
$e_1,...,e_n\in\mathbb{R}^n$
$\lambda_1,...,\lambda_n\in\mathbb{R}$
$A e_{i} = λ_{i} e_{i}$
Hence, every symmetric matrix performs a non-uniform scaling along some set of orthogonal axes.
$A$ $\lambda_i>0$ ), this scaling is positive

2.2.3 Shear

f_{u, v} = x + ⟨ v, x ⟩ u

2.2.4 Composition

We can composite transformations:

How do we decompose a linear transformation into pieces?

2.2.5 Polar & Singular Value Decomposition

$A$ $Q$ $P$ (scaling).

A = Q P

$P$ $P=VDV^T$ $V$ $D$ diagonal (eigenvalue)):

\begin{matrix} A & = Q V D V^{T} \\ = U D V^{T} \end{matrix}

$U$ $V^T$ $D$ axis-aligned scaling matrix.

2.2.6 Interpolating Transformations—Polar

2.2.7 Translation!

f_{u} (x) = x + u

This transformation is NOT linear!

Additivity
$\begin{matrix} f_{u} (x + y) = x + y + u \\ f_{u} (x) + f_{u} (y) = (x + u) + (y + u) = x + y + 2 u \end{matrix}$
Homogeneity
$\begin{matrix} f_{u} (a x) = a x + u \\ a f_{u} (x) = a (x + u) = a x + a u \end{matrix}$

Translation is AFFINE, not linear. Hence we cannot composite it with other linear transformations.

So, how do we composite all the transformations?

2.2.8 Homogeneous Coordinates

Hint: Every points along a ray represents the same point.

$\mathbf{p} = (x, y)$ $z=1$ $\hat{\mathbf{p}}=(a,b,c)$ $(a/c,b/c)=(x,y)$ homogeneous coordinates $\mathbf{p}$

$\mathbf{p}$ $\hat{\mathbf{p}}$ looks like shear transformation.

But shear is a linear transformation!

$\mathbf{p} = (p_1, p_2)$ $\mathbf{u} = (u_1, u_2)$ $\mathbf{p}' = (p_1 + u_1, p_2 + u_2)$ $\widehat{\mathbf{p}} = (c p_1, c p_2, c)$ $c \neq 0$ ), the translated coordinates become:

{\hat{p}}^{'} = (c p_{1} + c u_{1}, c p_{2} + c u_{2}, c)

$\widehat{\mathbf{p}}'$ $c\mathbf{u}$ then that’s proportional to the distance along the third axis—a shear.

Homogeneous coordinates can also used to distinguish the vectors and points
$0$ $1$ $0$ to ignore the translation.

2.2.9 Transformation Composition Order

2.2.10 Scene Graph

2.3 3D Rotations and Complex Representations

2.3.1 Degree of Freedom

We need three degrees of freedom to specify a rotation in 3D

Two determine the "axis" direction and one determine how much spinning around the "axis".

2.3.2 Commutativity of Rotation—3D

Order of rotation matters in 3D. Verify by yourself.

Rotate 90° around Y, then 90° around Z, then 90° around X
Rotate 90° around Z, then 90° around Y, then 90° around X

2.3.3 Euler Angles—Gimbal Lock

We can use the Euler Angles to represent a 3D rotation like:

But sometimes we will encounter the Gimbal Lock!

$x, y, z$ axes:

\begin{matrix} R_{x} = [\begin{matrix} 1 & 0 & 0 \\ 0 & \cos θ_{x} & - \sin θ_{x} \\ 0 & \sin θ_{x} & \cos θ_{x} \end{matrix}], R_{y} = [\begin{matrix} \cos θ_{y} & 0 & \sin θ_{y} \\ 0 & 1 & 0 \\ - \sin θ_{y} & 0 & \cos θ_{y} \end{matrix}], R_{z} = [\begin{matrix} \cos θ_{z} & - \sin θ_{z} & 0 \\ \sin θ_{z} & \cos θ_{z} & 0 \\ 0 & 0 & 1 \end{matrix}] \\ R_{x} R_{y} R_{z} = [\begin{matrix} \cos θ_{y} \cos θ_{z} & - \cos θ_{y} \sin θ_{z} & \sin θ_{y} \\ \cos θ_{z} \sin θ_{x} \sin θ_{y} + \cos θ_{x} \sin θ_{z} & \cos θ_{x} \cos θ_{z} - \sin θ_{x} \sin θ_{y} \sin θ_{z} & - \cos θ_{y} \sin θ_{x} \\ - \cos θ_{x} \cos θ_{z} \sin θ_{y} + \sin θ_{x} \sin θ_{z} & \cos θ_{z} \sin θ_{x} + \cos θ_{x} \sin θ_{y} \sin θ_{z} & \cos θ_{x} \cos θ_{y} \end{matrix}] \end{matrix}

$\theta_y = \frac{\pi}{2}$ $\cos\theta_y = 0, \sin\theta_y = 1$ ), the matrix simplifies to:

\begin{matrix} R_{x} R_{y} R_{z} |_{θ_{y} = π / 2} = [\begin{matrix} 0 & 0 & 1 \\ \cos θ_{z} \sin θ_{x} + \cos θ_{x} \sin θ_{z} & \cos θ_{x} \cos θ_{z} - \sin θ_{x} \sin θ_{y} \sin θ_{z} & 0 \\ - \cos θ_{x} \cos θ_{z} + \sin θ_{x} \sin θ_{z} & \cos θ_{z} \sin θ_{x} + \cos θ_{x} \sin θ_{y} \sin θ_{z} & 0 \end{matrix}] \end{matrix}

$\theta_x$ $\theta_z$ , it can only rotate in on plane

2.3.4 Complex Analysis

First, change the way of your thinking!

Instead, imagine it's just a quarter-turn in CCW direction:

$1$ $i$ bases:

And the multiple operation is a little different, but the rest operations are the same:

\begin{matrix} z_{1} := (r_{1}, θ_{1}) \\ z_{2} := (r_{2}, θ_{2}) \\ z_{1} z_{2} = (r_{1} r_{2}, θ_{1} + θ_{2}) \end{matrix}

So:

\begin{matrix} e^{i π} + 1 = 0 \\ Specialization of Euler’s formula: \\ e^{i θ} = c o s (θ) + i s i n (θ) \\ Can use to “implement” complex product: \\ z_{1} = a e^{i θ}, z_{2} = b e^{i ϕ} \\ z_{1} z_{2} = a b e^{i (θ + ϕ)} \end{matrix}

*All the angle use rad.

2.3.5 Rotation: Matrices vs.Complex

2.3.6 Quaternions 四元数

Hamilton’s insight: in order to do 3D rotations in a way that mimics complex numbers for 2D, actually need FOUR coordinates.

One real, and three imaginary:

\begin{matrix} H := s p a n ({1, i, j, k}) \\ q = a + b i + c j + d k \in H \\ Quaternion product determined by: \\ i^{2} = j^{2} = k^{2} = i j k = - 1 \\ i j = k, j k = i, k i = j, \\ j i = - k, k j = - i, i k = - j . \end{matrix}

*product no longer commutes!

$x, y, z$ with quaternions, map them to:

(x, y, z) \mapsto 0 + x i + y j + z k

$i, j, k$ are imaginary units of quaternions.

A quaternion can be represented as a pair:

(\underset{R}{\underset{⏟}{scalar}}, \underset{R^{3}}{\underset{⏟}{vector}}) \in H

\begin{matrix} (a, u) (b, v) = (a b - u \cdot v, a v + b u + u \times v) \\ u v = u \times v - u \cdot v \end{matrix}

2.3.7 3D Rotations via Quaternions

$\mathbf{u}$ $θ$ $q$ representing rotation is

$\mathbf{x}$ $\mathbf{\bar{q}xq}$

2.4 Perspective Projection

Distant objects appear smaller!

2.4.1 View Frustum

View frustum is region the camera can see:

Top / bottom / left / right planes correspond to four sides of the image
Near / far planes correspond to closest/furthest thing we want to draw

2.4.2 Clipping

When objects are not visible to the camera / in view frustum, we clip it out!

*Also near/far clipping

2.4.3 Mapping Frustum to Unit Cube

$A\mathbf{x_i} = \mathbf{y_i}$ $A$

While we take perspective projection into account, it is:

For derivation: OpenGL Projection Matrix

Warp Up:

2.5 Texture Mapping

2.5.1 Barycentric Coordinates

Very useful for interpolation.

You can also regard it as the area proportions.

2.5.2 Perspective Correct Interpolation

Due to perspective projection (homogeneous divide), barycentric interpolation of values on a triangle with different depths is not an affine function of screen XY coordinates.

We want to interpolate attribute values linearly in 3D object space, not image space. If we compute barycentric coordinates using 2D (projected) coordinates, leads to (derivative) discontinuity in interpolation where quad was split:

How do we do?

$Z:=1/z$ $P:=\phi/z$ $\phi$ is the attribute
$Z$ $P$ using standard (2D) barycentric coordinates
$P$ $Z$

For a derivation, see Microsoft Word - lowk_persp_interp_06.doc

2.5.3 Texture Coordinates

Texture coordinate define a mapping from surface coordinates to points in texture domain. Often defined by linearly interpolating texture coordinates at triangle vertices.

$(u,v)$ in texture space

2.5.4 Magnification vs. Minification

Magnification: camera is very close to scene object
Minification: scene objects are very far away

2.5.5 Bilinear Interpolation (Magnification)

2.5.6 MIP Map (Minification)

When a pixel on the screen covers many pixels of the texture, we can average texture values of these pixels, which is expensive to compute. So, we can precompute it and choose one demanded.

To get the mipmap level, we need to compute differences between texture coordinate values at neighboring samples.

2.5.7 Trilinear Interpolation for MIP Map

We can first bilinear interpolate with mipmap level, and then interpolate between two different levels.

w = d - ⌊ d ⌋

2.5.8 Anisotropic Filtering

$u$ $v$ .

We can sample multiple times along the longer direction and less times along the short direction.

2.6 Depth and Transparency

2.6.1 Depth-Buffer

For each sample, depth-buffer stores the depth of the closest primitive seen so far.

There are also color buffer, which can also be used for super-sampling, like (4 samples per pixel)

2.6.2 Compositing

$\alpha$

$\alpha$ channel:

2.6.3 Fringing

What cause this?

2.6.4 (Non-) Premultiplied Alpha

$B$ $\alpha_B$ $A$ $\alpha_A$

If we do not use the premulitplied alpha, there is fringe. Because the upsampled color mix the background color with original color. So we have to pre-multiplied color and then upsample it.

Premultiplied alpha is better!

2.7 Rasterization Pipeline Summary

Transform triangle vertices into camera space
Apply perspective projection transform to transform triangle vertices into normalized coordinate space
Clipping
Transform to screen coordinates. Perform homogeneous divide, transform vertex xy positions from normalized coordinates into screen coordinates (based on screen w,h)
Setup triangle (triangle preprocessing). Before rasterizing triangle, can compute a bunch of data that will be used by all fragments
Sample coverage and compute triangle color at sample point
Perform depth test (if enabled)
Update color buffer* (if depth test passed) (* Possibly using OVER operation for transparency)

OpenGL/Direct3D graphics pipeline:

3. Geometry

3.1 Encode Geometry

3.1.1 Implicit Representations

$x^2+y^2+z^2=1$ .

f (x, y, z) = 2 - 1.23

Now, find a point on the plane. And we can observe that implicit surfaces make sampling hard.

f (x, y, z) = x^{2} + y^{2} + z^{2} - 1.

Now check if a point is inside or outside the unit sphere. And we can observe implicit surfaces make inside/outside tests task easy.

Common Implicit Representations:

Algebraic Surfaces
Constructive Solid Geometry (Boolean operations)
Blobby Surfaces (Gradually blend surfaces together)
Blending Distance Functions
Level Set Methods (Surface is found where interpolated values equal zero )
Mandelbrot Set

Pros:

description can be very compact (e.g., a polynomial)
easy to determine if a point is in our shape (just plug it in!)
other queries may also be easy (e.g., distance to surface)
for simple shapes, exact description/no sampling error
easy to handle changes in topology (e.g., fluid)

Cons:

expensive to find all points in the shape (e.g., for drawing)
very difficult to model complex shapes

3.1.2 Explicit Representations

$(cos(u)sin(v),sin(u)sin(v),cos(v)),\text{ for }0\le u\lt 2\pi\text{ and }0\le v \le\pi$

Many explicit representations in graphics. Like triangle meshes, polygon meshes, subdivision surfaces, NURBS point clouds…

Unsimilar to implicit representations, explicit representations make sampling easy but inside/outside test hard.

Common Explicit Representations:

Point Cloud
Polygon Mesh ( Store vertices and polygons)
Bézier Curves/Surfaces
Bernstein Basis:
It can use to interpolate different points:
We can piece together many Bézier curves to interpolate lots of points (because High-degree Bernstein polynomials don’t interpolate well):
Bézier Patches is Bézier patch is sum of (tensor) products of Bernstein bases:
By connecting Bézier curves, can connect Bézier patches to get a surface:
Basically, it is weight-average points (2D, 3D)
Rational B-Splines
Bézier can’t exactly represent conics—not even the circle!
Solution: interpolate in homogeneous coordinates, then project back to the plane:
NURBS: (N)on-(U)niform (R)ational (B)-(S)pline
- knots at arbitrary locations (non-uniform)
- expressed in homogeneous coordinates (rational)
- piecewise polynomial curve (B-Spline)
w is homogeneous coordinate, controlling "strength" of a vertex
We can use tensor product to the NURBS curve
Subdivision:

3.1.3 Summary

Some representations work better than others—depends on the task!

3.2 Meshes and Manifolds

3.2.1 Manifold Assumption

If you zoom in far enough, can draw a regular coordinate grid. (Very rough definition)

This is not manifold:

Which of shapes are manifold?

Or, we can say: A manifold polygon mesh has fans, not fins

Every edge is contained in only two polygons (no “fins”)
The polygons containing each vertex make a single “fan”

3.2.2 Why Do We Need Manifold?

To make some assumptions about our geometry to keep data structures/algorithms simple and efficient
In many common cases, doesn’t fundamentally limit what we can do with geometry

3.2.3 Halfedge Data Structure

*There are lots data structure can be used to store meshes, here we only talk about halfedge.

Halfedge makes mesh traversal easy:

Visit all vertices of a face


1
Halfedge* h = f->halfedge;
2
do {
3
    h = h->next;
4
    h->vertex...
5
} while (h != f-> halfedge);

Visit all neighbors of a vertex:


xxxxxxxxxx
4
1
Halfedge* h = v->halfedge;
2
do {
3
    h = h->twin->next;
4
} while (h != v-> halfedge);

Halfedge connectivity is always manifold:

Keep following next, and you’ll get faces.
Keep following twin and you’ll get edges.
Keep following next->twin and you’ll get vertices.

3.2.4 Halfedge Meshes Edition

Edge Flip
Edge Split
Edge Collapse
More…

3.3 Digital Geometry Processing

3.3.1 Remeshing as Resampling

Undersampling destroys features
Oversampling bad for performance

We need "good sampling"—"good" mesh. We need good approximation of original shape! Keep only elements that contribute information about shape. Add additional information where, e.g., curvature is large.

Vertices exactly on the surface doesn't mean it is a good approximation.

(Some attributes, like normal is not good).

One rule of thumb: triangle shape—Delaunay
For any triangle in the decomposition, the interior of its circumcircle does not contain any other point.
Another rule of thumb: regular vertex degree: Degree 6 for triangle mesh, 4 for quad mesh

3.3.2 Upsampling via (Catmull-Clark/Loop) Subdivision

There are lots of ways to do the subdivision.

Catmull-Clark subdivision
For more detailed tutorial: Catmull-Clark Subdivision: The Basics – CodeItNow
Loop Subdivision
We can use edge operations to complete the subdivision:
(Don’t forget to update vertex positions!)

3.3.3 Simplification via Edge Collapse

Basically, we assign each edge a cost, collapse the edge with least cost, and repeat until we reach the target.

And we use Quadric Error Metrics to determine edge's cost.

For a derivation, see Scotty3D_Benky/assignments/A2/simplify.md at main · tbbbk/Scotty3D_Benky

3.3.4 Isotropic Remeshing Algorithm

How to make triangles uniform shape & size?

Repeat four steps:

Split any edge over 4/3rds mean edge length
Collapse any edge less than 4/5ths mean edge length
Flip edges to improve vertex degree
Center vertices tangentially

3.5 Geometric Queries

3.5.1 Ray Equation

r (t) = o + t d

3.5.2 Intersection

implicit surface intersection $f(r(t))=0$ $t$ .

For explicit surface intersection (e.g. triangle), things become much harder and we do care about performance!

We will introduce Spatial Acceleration Data Structures!

3.6 Spatial Acceleration Data Structures

What we care about most is the ray-triangle intersection!

3.6.1 Affine Map for Triangle

$\mathbf{p}_0,\mathbf{p}_1,\mathbf{p}_2$ using barycentric coordinates:

\begin{matrix} f (u, v) = (1 - u - v) p_{0} + u p_{1} + v p_{2} \\ f (u, v) = p_{0} + u (p_{1} - p_{0}) + v (p_{2} - p_{0}) \end{matrix}

Now it's like:

So now the ray-triangle intersection is like:

\begin{matrix} p_{0} + u (p_{1} - p_{0}) + v (p_{2} - p_{0}) = o + t d \\ [\begin{matrix} p_{1} - p_{0} & p_{2} - p_{0} & - d \end{matrix}] [\begin{matrix} u \\ v \\ t \end{matrix}] = o - p_{0} \\ M [\begin{matrix} u \\ v \\ t \end{matrix}] = o - p_{0} ⟹ [\begin{matrix} u \\ v \\ t \end{matrix}] = M^{- 1} (o - p_{0}) \\ u \geq 0, v \geq 0, u + v \leq 1, t \geq 0 \end{matrix}

$u,v,\text{ and }t$ .

3.6.2 Bounding Box

We can pre-compute a bounding box around all primitives. If a ray intersect with a bounding box, then we test each primitives within this bounding box to avoid meaningless tradeoff.

Then use calculate the ray-axis-aligned box intersection:

The uniform equation is:

N^{T} (o + t d) = c

$t$ $x_0$

\begin{matrix} N^{T} = [1 0 0]^{T} (we only care about x-axis) \\ c = x_{0} \\ t = \frac{x_{0} - o_{x}}{d_{x}} \end{matrix}

More examples:

3.6.3 Bounding Volume Hierarchy (BVH)

BVH implementation assignment is really the pain in the ass…

How do we build the better BVH? We need a better partition.

A good partitioning minimizes the cost of finding the closest intersection of a ray with primitives in the node.

C = C_{t r a v} + p_{A} N_{A} C_{i s e c t} + p_{B} N_{B} C_{i s e c t}

$C_{trav}$ is the cost of traversing an interior node (e.g., bounding box test)
$C_A$ $C_B$ are the costs of intersection with the resultant child subtrees
$p_A$ $p_B$ are the probability a ray intersects the bbox of the child nodes A and B

P (h i t A | h i t B) = \frac{S_{A}}{S_{B}}

The pipeline about building BVH:

Beside BVH, there are also lots data structure to accelerate:

K-D tree
Uniform grid
Heuristic: Choose number of voxels ~ total number of primitives
Quad-tree / octree

4. Ray Tracing

4.1 Color

For the color section, I literally omit a lot of contents… Because I didn't listen to the color lecture very carefully orz

Light is oscillating electric & magnetic field.

KEY IDEA: frequency determines color of light

4.1.1 Intensity or Absorption

4.1.2 Emission and Reflection

4.1.3 Color Models

RGB
CMYK
HSV
SML
XYZ
…

4.1.4 Y’CbCr

Y’ = luma: perceived luminance (same as L* in CIELAB)
Cb = blue-yellow deviation from gray
Cr = red-cyan deviation from gray

4.2 Radiometry

4.2.1 Photon

Imagine every photon is a little rubber ball hitting the scene.

$Q$

This it "the number of hits". Energy for single photon:

\begin{matrix} Q = \frac{h c}{λ} \\ h \approx 6.626 \times 10^{- 34} J \cdot s \\ c \approx 3.00 \times 10^{8} m / s \\ λ \approx 390 - 700 \times 10^{- 3} m (visible) \\ Unit: \frac{(J \times s) (m / s)}{m} = J \end{matrix}

$h$ $c$ $\lambda$ is wavelength (color!).

$\Phi$ (Power)

Energy per unit time (Watts) received by the sensor (or emitted by the light)

\begin{matrix} Φ = lim_{Δ t \to 0} \frac{Δ Q}{Δ t} = \frac{d Q}{d t} \\ Q = \int_{t_{0}}^{t_{1}} Φ (t) d t \end{matrix}

$E$

$A$ , the average flux is :

\frac{Φ}{A}

$E$ ) is given by taking limit of area at a single point on the sensor:

E (p) = lim_{Δ \to 0} \frac{Δ Φ (p)}{Δ A} = \frac{d Φ (p)}{d A} [\frac{W}{m^{2}}]

4.2.5 Lambert's Law


$E=\frac{\Phi}{A}$	$E=\frac{E}{A'}=\frac{\Phi cos\theta}{A}$

4.2.6 Irradiance Falloff with Distance

$\Phi$ :

\begin{aligned} E_{1} & = \frac{Φ}{4 π r_{1}^{2}} \to Φ = 4 π r_{1}^{2} E_{1} \\ E_{2} & = \frac{Φ}{4 π r_{2}^{2}} \to Φ = 4 π r_{2}^{2} E_{2} \\ \frac{E_{2}}{E_{1}} & = \frac{r_{1}^{2}}{r_{2}^{2}} = {(\frac{r_{1}}{r_{2}})}^{2} \end{aligned}

Since same amount of energy is distributed over larger and larger spheres, has to get darker quadratically with distance.

4.2.7 Solid Angles

Radians	Steradians

$\theta=\frac{l}{r}$	$\Omega=\frac{A}{r^2}$

Differential solid angle:

\begin{aligned} d A & = (r d θ) (r \sin θ d ϕ) \\ = r^{2} \sin θ d θ d ϕ \\ d ω & = \frac{d A}{r^{2}} = \sin θ d θ d ϕ \\ Ω & = \int_{S^{2}} d ω \\ = \int_{0}^{2 π} \int_{0}^{π} \sin θ d θ d ϕ \\ = 4 π \end{aligned}

$L$

Radiance is the solid angle density of irradiance:

L (p, ω) = lim_{Δ \to 0} \frac{Δ E_{ω} (p)}{Δ ω} = \frac{d E_{ω} (p)}{d ω} [\frac{W}{m^{2} sr}]

$E_\omega$ $\omega$ !

$\omega$ !

Energy per unit time per unit area per unit solid angle…!

Surface radiance:

L (p, ω) = \frac{d E (p)}{d ω \cos θ} = \frac{d^{2} Φ (p)}{d A d ω \cos θ}

Reminder: Often need to distinguish between incident radiance and exitant radiance functions at a point on a surface. In general:

L_{i} (p, ω) \neq L_{o} (p ω)

4.2.9 Spectral Radiance

Now, the radiance is radiant energy per unit time per unit area per unit solid angle. If we wanna get the COLOR, we need to add a "per unit wavelength"

4.2.10 Ambient Occlusion

Assume spherical (vs. hemispherical) light source, “at infinity”. Irradiance is now rotation, translation invariant. Can pre-compute, “bake” into texture to enhance shading

$I$

Power per solid angle emanating from a point source.

I (ω) = \frac{d Φ}{d ω} [\frac{W}{sr}]

4.3 The Rendering Equation

4.3.1 Recursive Raytracing

Basic strategy: recursively evaluate rendering equation!

Renderer measures radiance along a ray:

4.3.2 Reflection

When the ray bounce in scene, how does the reflection of light affect the outgoing radiance?

What we are talking about is the scatter function in the rendering equation. Choice of reflection function determines surface appearance.

Some basic reflection functions:

Reflection	Examples
Ideal specular: Perfect mirror
Ideal diffuse: Uniform reflection in all directions
Glossy specular: Majority of light distributed in reflection direction
Retro-reflective: Reflects light back toward source

4.3.3 Models of Scattering

What goes in must come out! (Total energy must be conserved)

4.3.4 BRDF (Bidirectional Reflectance Distribution Function)

$ω_i$ $ω_o$ .

\begin{matrix} f_{r} (ω_{i} \to ω_{o}) \geq 0 \\ \int_{H^{2}} f_{r} (ω_{i} \to ω_{o}) \cos θ d ω_{i} \leq 1 \\ the sum ≤1 instead of =1 because the surface may absorb the energy \\ and convert it into heat or something. \\ f_{r} (ω_{i} \to ω_{o}) = f_{r} (ω_{o} \to ω_{i}) \end{matrix}

Radiometric description of BRDF:For a given change in the incident irradiance, how much does the exitant radiance change?

f_{r} (ω_{i} \to ω_{o}) = \frac{d L_{o} (ω_{o})}{d E_{i} (ω_{i})} = \frac{d L_{o} (ω_{o})}{d L_{i} (ω_{i}) \cos θ_{i}} [\frac{1}{sr}]

Common BRDF:

Lambertian reflection $f_r = \frac{\rho}{\pi}$
Specular reflection $f_r(\theta_i, \phi_i; \theta_o, \phi_o) = \frac{\delta(\cos\theta_i - \cos\theta_o) \delta(\phi_i - \phi_o \pm \pi)}{\cos\theta_i}$
Refraction:
- $\eta_i \sin\theta_i = \eta_t \sin\theta_t$
- $\mathbf{cos} \theta_t$ in the Snell's Law
- Optical manhole: Only small “cone” visible, due to total internal reflection (TIR) (When light is moving from a more optically dense medium to a less optically dense medium, light incident on boundary from large enough angle will not exit medium.)
Fresnel reflection: Many real materials: reflectance increases w/ viewing angle
$\phi$

4.3.5 Subsurface scattering

BSSRDF:

L (x_{o}, ω_{o}) = \int_{A} \int_{H^{2}} S (x_{i}, ω_{i}, x_{o}, ω_{o}) L_{i} (x_{i}, ω_{i}) \cos θ_{i} d ω_{i} d A

5. Optimization for Ray Tracing

How can we possibly evaluate this integral?

5.1 Numerical Integration

Basic idea:

integral is “area under curve”
sample the function at many points
integral is approximated as weighted sum

5.1.1 Gauss Quadrature

For any polynomial of degree n, we can always obtain the exact integral by sampling at a special set of n points and taking a special weighted combination.

Weighted combination of sample points.

Key idea so far: To approximate an integral, we need

quadrature points
weights for each point

\int_{a}^{b} f (x) d x \approx \sum_{i = 1}^{n} w_{i} f (x_{i})

5.1.2 Trapezoid rule

$f(x)$ by pretending function is piecewise affine.

\begin{aligned} h & = \frac{b - a}{n - 1} \\ \int_{a}^{b} f (x) d x & = h (\sum_{i = 1}^{n - 1} f (x_{i}) + \frac{1}{2} (f (x_{0}) + f (x_{n}))) \end{aligned}

$O(n)$

$O(h^2)=O(\frac{1}{n^2})$

How about 2D?

$O(n^2)$ (n x n set of measurements).

How about k dimensions?

$N=n^k$ $O(h^2)=O(\frac{1}{n^2})=O(\frac{1}{N^{\frac{2}{k}}})$

5.1.3 Curse of Dimensionality for Trapezoid Rule

How much does it cost to apply the trapezoid rule as we go up in dimension?

$O(n)$
$O(n^2)$
…
$O(n^k)$

For many problems in graphics (like rendering), k is very, very big (e.g., tens or hundreds or thousands). Applying trapezoid rule does not scale!

5.1.4 Sampling from Discrete Probability Distributions

$x_i$ if:

P_{i - 1} < ξ \leq P_{i}

$\xi$ $\in [0,1)$

5.1.5 Sampling Continuous Random Variables using the Inversion Method

$P(x)=Pr(X<x)$ $P^{-1}(x)$ $\xi$ $\in [0,1)$ $x=P^{-1}(\xi)$ .

$p(x)$ $P(x)$ ), and also the inverse function

First try: uniformly sampling unit circle
*For the second line:

5.1.6 Rejection Sampling

5.2 Monte Carlo Rendering

5.2.1 Expected Value and Variance

Expected value:

\begin{matrix} \underset{expected value of random variable Y}{\underset{⏟}{E (Y)}} := \sum_{\begin{matrix} i = 1 \\ probability of i th outcome \end{matrix}}^{k} \underset{probability of i th outcome}{\underset{⏟}{p_{i}}} \underset{value of i th outcome}{\underset{⏟}{y_{i}}} \end{matrix}

\begin{aligned} E [\sum_{i} Y_{i}] & = \sum_{i} E [Y_{i}] \\ E [a Y] & = a E [Y] \end{aligned}

Variance:

\begin{matrix} V [Y] = E [Y^{2}] - E [Y]^{2} \\ V [\sum_{i = 1}^{N} Y_{i}] = \sum_{i = 1}^{N} V [Y_{i}] \\ V [a Y] = a^{2} V [Y] \end{matrix}

5.2.2 Law of Large Number

$N$ $N$ .

V [\frac{1}{N} \sum_{i = 1}^{N} Y_{i}] = \frac{1}{N^{2}} \sum_{i = 1}^{N} V [Y_{i}] = \frac{1}{N^{2}} N V [Y] = \frac{1}{N} V [Y]

$N$ .

5.2.3 Biasing

\int_{Ω} f (x) d x \approx \frac{1}{N} \sum_{i = 1}^{N} \frac{f (X_{i})}{p (X_{i})}

$p$ to fix the biasing caused by the weighted sampling (like importance sampling)

5.2.4 Importance Sampling

Sample the important area more. Remember divide the probability.

5.2.5 Monte Carlo Integration

According to law of large number, we know no matter how hard the integral is, we can always get the right image by taking more samples.

Keep in mind three key ideas:

Expected Value: what value do we get on average?
Variance: what’s the expected deviation from the average?
Importance Sampling: how do we (correctly) take more samples in more important regions?

$\times$ area size"

\int_{Ω} f (x) d x = lim_{N \to \infty} \frac{| Ω |}{N} \sum_{i = 1}^{N} f (X_{i})

$\Omega$ is the area size (volume for 3D, …)

$O(n^{-\frac{1}{2}})$ .

5.2.6 Russian Roulette

Randomly terminate the recursive integral of rendering equation.

5.3 Variance Reduction

Keep in mind: You can’t reduce variance of the integrand! Can only reduce variance of an estimator.

5.3.1 Estimator

An “estimator” is a formula used to approximate an integral, like the Monte Carlo estimator.

5.3.2 Bias & Consistency

Two important things to ask about an estimator

it consistent?
Is it biased?

Consistency: “converges to the correct answer”:

lim_{n \to \infty} P (| I - {\hat{I}}_{n} | > 0) = 0

Unbiased: “estimate is correct on average”:

E [I - {\hat{I}}_{n}] = 0

Example (Consistent but biased):

Example (Inconsistent but unbiased):

Rasterization and Path Tracing are neither consistent and unbiased.

Light has a very “spiky” distribution, how can we sample the lights more?

5.3.3 Bidirectional Path Tracing

Idea: connect paths from light, eye (“bidirectional”)

Example (path length is 2):

5.3.4 Metropolis-Hastings Algorithm (MH)

Good path can be hard to find!

Basic idea: prefer to take steps that increase sample value

5.3.5 Multiple Importance Sampling

\frac{1}{N} \sum_{i = 1}^{n} \sum_{j = 1}^{n_{i}} \frac{f (x_{i j})}{\sum_{k} c_{k} p_{k} (x_{i j})}

5.3.6 Sampling Patterns & Variance Reduction

How do we sample our function in the first place?

Stratified Sampling

Split into n bins, pick uniformly in each bin

Low-Discrepancy Sampling: Number of samples should be proportional to area

$A(S)$ here is the proportion of S to the whole area

Hammersley & Halton Points:

$\phi_{P_r}(i)$

$x_i = \left( \phi_{P_1}(i), \phi_{P_2}(i), \ldots, \phi_{P_k}(i) \right)$ .

$x_i = \left( i/n, \phi_{P_1}(i), \phi_{P_2}(i), \ldots, \phi_{P_{k-1}}(i) \right)$

Blue Noise: Can observe that monkey retina exhibits blue noise pattern

…

6. Animation

6.1 Keyframe

6.1.1 Keyframing

Specify important events only and computer fills in the rest via interpolation/approximation.

6.1.2 Spline Interpolation

Runge Phenomenon: Tempting to use higher-degree polynomials, in order to get higher-order continuity, but can lead to oscillation, ultimately worse approximation.

6.1.3 Natural Splines

For each interval, want polynomial “piece” pi to interpolate data (e.g., keyframes) at both endpoints:

p_{i} (t_{i}) = f_{i}, p_{i} (t_{i + 1}) = f_{i + 1}, i = 0, \dots, n - 1

Want tangents to agree at endpoints (“C1 continuity”):

p_{i}^{'} (t_{i + 1}) = p_{i + 1}^{'} (t_{i + 1}), i = 0, \dots, n - 2

Also want curvature to agree at endpoints (“C2 continuity”):

p_{i}^{″} (t_{i + 1}) = p_{i + 1}^{″} (t_{i + 1}), i = 0, \dots, n - 2

Degree of freedom:

2 n + (n - 1) + (n - 1) = 4 n - 2

6.1.4 Hermite/Bézier Spline

6.1.5 Catmull-Rom Spline

Use the difference of neighbors to define tangent.

u_{i} := \frac{f_{i + 1} - f_{i - 1}}{t_{i + 1} - t_{i - 1}}

Details: Scotty3D_Benky/assignments/A4/T1-splines.md at main · tbbbk/Scotty3D_Benky

6.1.6 Evaluation of Spline

INTERPOLATION: spline passes exactly through data points
CONTINUITY: at least twice differentiable everywhere
LOCALITY: moving one control point doesn’t affect whole curve

6.2 Dynamics

6.2.1 Inverse Kinematics

Set a goal and use algorithm (like grad descent) to come up with a plausible motion.

*Detail: Scotty3D_Benky/assignments/A4/T2-skeleton.md at main · tbbbk/Scotty3D_Benky

6.2.2 Animation EquationF

F = m a

6.2.3 Generalized Config

Collect all points all into a single vector of generalized coordinates:

q = (x_{0}, x_{1}, . . ., x_{n})

\dot{q} = (\dot{x_{0}}, \dot{x_{1}}, . . ., \dot{x_{n}})

6.2.3 Ordinary Differential Equation (ODE)

Many dynamical systems can be described via an ordinary differential equation (ODE) in generalized coordinates:

\begin{matrix} \frac{d}{d t} q = f (q, \dot{q}, t) \\ \ddot{q} = F / m \end{matrix}

Example:

6.2.4 Lagrangian Mechanics

$K$
$U$
Lagrangian $\mathcal{L} := K - U$
Dynamics then given by Euler-Lagrange equation:
$\frac{d}{d t} \frac{\partial L}{\partial \dot{q}} = \frac{\partial L}{\partial q}$

6.2.5 Particle Systems

We model phenomenon as large collection of particles

How can we solve all these things numerically?

6.2.6 Forward/Backward Euler

We can use the difference to replace derivatives.

Forward Euler

But this is very unstable!

Backward Euler

But it is unconditionally stable!

6.2.7 Symplectic Euler

Backward Euler was stable, but we also saw (empirically) that it exibits numerical damping (damping not found in original eqn.). Nice alternative is symplectic Euler: