Lecture 11: Image Processing
HTML Slides
│ PDF Slides
│ Demo code on GitHub

Topic overview#
- Image representation
- File formats and compression
- Preprocessing for image recognition tasks
Resources used:
Images as 2D signals#
- The light that enters a camera can be modelled as continuous signal:
$$f(x, y),\space -\infty < x, y < \infty$$
- Digital images are sampled:
$$f[n, m], \space n = n \Delta_x, m = n \Delta_y$$
where typically $\Delta_x = \Delta_y$
- The area $\Delta_x \times \Delta_y$ is called a picture element, or pixel

Image channels#
- A photosensor responds to light intensity with an electrical signal
- Physical filters restrict the colour that reaches each sensor
- On most digital cameras, 1 sensor $\ne$ 1 pixel; it is instead interpolated to create a typical 3-channel image
- (Inexplicably, the Python imaging library Pillow uses the term “band”)
