Timing and Structure
Michaelmas term. 16 lectures (including 3 examples classes). Assessment: 100% exam
The aims of the course are to:
- introduce the principles, models and applications of computer vision.
- cover image structure, projection, stereo vision, structure from motion and object detection and recognition.
- give case studies of industrial (robotic) applications of computer vision, including visual navigation for autonomous robots, robot hand-eye coordination and novel man-machine interfaces.
As specific objectives, by the end of the course students should be able to:
- design feature detectors to detect, localise and track image features.
- model perspective image formation and calibrate single and multiple camera systems.
- recover 3D position and shape information from arbitrary viewpoints;
- appreciate the problems in finding corresponding features in different viewpoints.
- analyse visual motion to recover scene structure and viewer motion, and understand how this information can be used in navigation;
- understand how simple object recognition systems can be designed so that they are independent of lighting and camera viewpoint.
- appreciate the commerical and industrial potential of computer vision but understand its limitations.
- Introduction (1L)
Computer vision: what is it, why study it and how ? The eye and the camera, vision as an information processing task. Geometrical and statistical frameworks for vision. 3D interpretation of 2D images. Applications.
- Image structure (3L)
Image intensities and structure: edges, corners and blobs. Edge detection, the aperture problem and corner detection. Image pyramids, blob detection with band-pass filtering. The SIFT feature descriptor for matching. Characterising textures.
- Projection (3L)
Orthographic projection. Planar perspective projection. Vanishing points and lines. Projection matrix, homogeneous coordinates. Camera calibration, recovery of world position. Weak perspective and the affine camera. Projective invariants.
- Stereo vision and Structure from Motion (2L)
Epipolar geometry and the essential matrix. Recovery of depth by triangulation. Uncalibrated cameras and the fundamental matrix. The correspondence problem. Structure from motion. 3D shape examples from multiple view stereo.
- Deep Learning for Computer Vision (4L)
Basic architectures for deep learning in computer vision. Object detection, classification and semantic segmentation. Object recognition, feature embedding and metric learning. Transformers for computer vision and self-supervised learning.
- Example classes (3L)
Discussion of examples papers and past examination papers will be integrated with lectures.
Please refer to the Booklist for Part IIB Courses for references to this module, this can be found on the associated Moodle course.
Please refer to Form & conduct of the examinations.
The UK Standard for Professional Engineering Competence (UK-SPEC) describes the requirements that have to be met in order to become a Chartered Engineer, and gives examples of ways of doing this.
UK-SPEC is published by the Engineering Council on behalf of the UK engineering profession. The standard has been developed, and is regularly updated, by panels representing professional engineering institutions, employers and engineering educators. Of particular relevance here is the 'Accreditation of Higher Education Programmes' (AHEP) document which sets out the standard for degree accreditation.
The Output Standards Matrices indicate where each of the Output Criteria as specified in the AHEP 3rd edition document is addressed within the Engineering and Manufacturing Engineering Triposes.
Last modified: 28/09/2022 11:29