Prof R Cipolla and Dr R Turner
Timing and Structure
Michaelmas term. 16 lectures (including 3 examples classes). Assessment: 100% exam
The aims of the course are to:
- introduce the principles, models and applications of computer vision.
- cover image structure, projection, stereo vision, structure from motion and object detection and recognition.
- give case studies of industrial (robotic) applications of computer vision, including visual navigation for autonomous robots, robot hand-eye coordination and novel man-machine interfaces.
As specific objectives, by the end of the course students should be able to:
- design feature detectors to detect, localise and track image features.
- model perspective image formation and calibrate single and multiple camera systems.
- recover 3D position and shape information from arbitrary viewpoints;
- appreciate the problems in finding corresponding features in different viewpoints.
- analyse visual motion to recover scene structure and viewer motion, and understand how this information can be used in navigation;
- understand how simple object recognition systems can be designed so that they are independent of lighting and camera viewpoin.
- appreciate the industrial potential of computer vision but understand the limitations of current methods.
- Introduction (1L)
Computer vision: what is it, why study it and how ? The eye and the camera, vision as an information processing task. A geometrical framework for vision. 3D interpretation of 2D images. Applications.
- Image structure (3L)
Image intensities and structure: edges, corners and blobs. Edge detection, the aperture problem. Corner and blob detection. Contour extraction using B-spline snakes. Texture. Feature descriptors and matching.
- Projection (3L)
Orthographic projection. Planar perspective projection. Vanishing points and lines. Projection matrix, homogeneous coordinates. Camera calibration, recovery of world position. Weak perspective and the affine camera. Projective invariants.
- Stereo vision and Structure from Motion (3L)
Epipolar geometry and the essential matrix. Recovery of depth. Uncalibrated cameras and the fundamental matrix. The correspondence problem. Structure from motion. 3D shape from multiple view stereo.
- Object detection and recognition (3L)
Basic target detection and tracking. Machine learning for object detection and recognition. Random decision forests, support vector machines and boosting. Deep learning with convolutional neural networks.
- Example classes (3L)
Discussion of examples papers and past examination papers.
Please see the Booklist for Group F Courses for references for this module.
Please refer to Form & conduct of the examinations.
The UK Standard for Professional Engineering Competence (UK-SPEC) describes the requirements that have to be met in order to become a Chartered Engineer, and gives examples of ways of doing this.
UK-SPEC is published by the Engineering Council on behalf of the UK engineering profession. The standard has been developed, and is regularly updated, by panels representing professional engineering institutions, employers and engineering educators. Of particular relevance here is the 'Accreditation of Higher Education Programmes' (AHEP) document which sets out the standard for degree accreditation.
The Output Standards Matrices indicate where each of the Output Criteria as specified in the AHEP 3rd edition document is addressed within the Engineering and Manufacturing Engineering Triposes.
Last modified: 04/11/2014 10:09