Front Matter
F4 · What's Inside

What's Inside

Every chapter of this book is built from a small set of recurring elements, each with a specific job. This page is a live preview: the boxes, code, and figures below are real samples of what you will meet on nearly every page, so you can learn to read the book's visual language before Chapter 0.

The Shape of the Journey

Figure F4.1 shows the four-part arc at a glance. Each part stands on the one before it, and the recurring ideas (convolution, denoising, geometry) travel left to right across all four.

Part I · Pixels Image Processing Part II · Geometry Classical Vision Part III · Learning Deep Networks Part IV · Creation Generative Models
Figure F4.1 The four-part arc: pixels, geometry, learning, generation. Each stage builds directly on the previous one.

Epigraphs: A Light Beginning

Every section opens with a short quotation from a fictional vision persona, a sensor, a kernel, a slightly overfit transformer, whose complaint or boast captures the section's theme. The one below is typical of the species.

"I detect edges. Whether they are the edges of a cat or the edges of a shadow shaped like a cat is, respectfully, not my department."

An Edge Detector Who Sees Things in Black and White

Callouts That Carry the Argument

Colored boxes do structured work throughout the book. A Big Picture box opens each section and tells you where you are in the larger story; the sample below shows the format.

Big Picture

Boxes like this one start every section. They state what the section teaches, why it matters now, and how it connects backward and forward, so you are never more than a paragraph away from knowing why you are reading.

Several siblings follow it through each section. Key Insight boxes distill the one idea worth remembering after the details fade. Practical Example boxes tell realistic industry mini-stories: who faced the problem, what they decided, and what it cost or saved. Research Frontier boxes connect the material to work from 2024 through 2026, with named methods and papers. Warning and Tip boxes flag the potholes and shortcuts, and an occasional Fun Fact keeps the journey humane. Exercises arrive in the same dress, labeled by type.

Code You Can Run

Concepts in this book are executable. Sections build their ideas in short, runnable Python, like the pipeline below, and every code block carries a specific caption underneath explaining what it demonstrates.

import cv2

image = cv2.imread("street.jpg")                # uint8 BGR array, shape (H, W, 3)
gray = cv2.cvtColor(image, cv2.COLOR_BGR2GRAY)  # collapse to one channel
edges = cv2.Canny(gray, threshold1=100, threshold2=200)
cv2.imwrite("street_edges.png", edges)
print(edges.shape, edges.dtype)                 # e.g. (720, 1280) uint8
Five lines from photograph to edge map: the kind of small, complete pipeline that anchors most sections.

Library Shortcuts: The "Right Tool" Principle

Whenever the book builds something from scratch, it also shows the production version. The from-scratch code teaches; the library call ships. These pairings appear in boxes like the following.

Library Shortcut: scikit-image in Practice

After a section derives adaptive histogram equalization in roughly thirty lines, a box like this shows the two-line production equivalent:

from skimage import exposure
balanced = exposure.equalize_adapthist(image, clip_limit=0.03)
CLAHE in two lines: scikit-image handles the tiling, clipping, and interpolation internally.

Figures, Exercises, and References

Diagrams are drawn inline as SVG with numbered captions, like Figure F4.1 above, so they stay crisp at any zoom and render in any browser. Each section closes with two or three exercises spanning three types: conceptual (reason about the idea), coding (build or modify a pipeline), and analysis (measure something and interpret the result). Each chapter ends with a What's Next bridge to the following chapter and an annotated bibliography of 8 to 15 real, hyperlinked references: foundational papers, books, tools, tutorials, and datasets.

Tools of the Trade and the Capstone

Each part closes with a Tools of the Trade chapter (Chapters 8, 17, 29, and 38), a consolidated reference to the libraries, datasets, and tooling for that layer of the stack. They are designed for dipping into long after your first read. Finally, the capstone project assembles everything into one end-to-end vision system, the proof that the four parts were one story all along.

That is the toolkit. The next page shows how to route your own path through it.