Undergraduate Teaching 2024-25

Engineering Tripos Part IIA Project, SF3: Machine Learning, 2024-25

Engineering Tripos Part IIA Project, SF3: Machine Learning, 2024-25

Not logged in. More information may be available... Login via Raven / direct.

PDF versionPDF version

Leader

Prof J M H Lobato

Leader

Prof C E Rasmussen

Timing and Structure

Students work to their own schedule. A staffed "surgery" runs every weekday 10-11am to give help, advice and feedback.

Prerequisites

Part I computing; Either of 3F3 or 3F8

Aims

The aims of the course are to:

  • expose students to machine learning approaches to non-linear regression
  • to gain practical experience necessary to use these techniques successfully (e.g the use of training and test sets for evaluation, optimisation etc.)
  • to understand the robustness of these approaches to challenging real world phenomena including noise and non-linearities

Content

 

In this project, students will consider the inverted pendulum system receiving a software simulator of a cart with a pendulum attached written in Python.
 
The goal will be to learn a controller that balances the pendulum in a data-driven way. The students will initially learn how to operate the simulator and explore the different types of behaviour that the system can exhibit. Next, they will collect training data from the simulator and use this to train non-linear regression models, including linear regression with non-linear basis functions. The trained models will be assessed on test data from the simulator. Once accurate models are learned these will be used to learn controllers that can balance the pendulum in the upright position and keep it there. Finally, the controllers and the models will will be stress tested in various ways to test their robustness. 
 
Students work individually for this project. 
 

Week 1

Explore the cart-pendulum system using the simulator. Understand the state space and the governing differential equations.
 

Week 2

Gather training and test data from the simulator for building models of the system and validating them. Fit various models and assess their quality.
 

Week 3

Define a function that maps from the system's state to control actions (the "policy"), optimise the policy to keep pendulum upright. 
 

Week 4

Stress-test control and learning systems in various ways. 
 

Coursework

Coursework Due date Marks

Interim report

TBC

20

Final report

TBC

60

 

Examination Guidelines

Please refer to Form & conduct of the examinations.

 
Last modified: 29/11/2024 15:20