Computer Science
Vision Guided Control and Robotics
COMPSCI 773 S1 T
Introduction
This course introduces computational methods and techniques used in vision-based robotics and real-time control. Many topics are only overviewed, but a number of interesting theoretical and practical problems are analysed in detail. You should not expect exciting things which may be found in sci-fi books or movies like "Terminator" as you will soon find out that even a seemingly simple robotic action may present a real challenge.
Design of modern robotic and industrial control systems involves different mathematical tools, especially, optimization techniques, matrix analysis, and analytic 2D / 3D geometry. Some tools will be explained in brief in the lectures. Still, you are expected to learn these methods in details and use them to complete assignments.
Assessment
Assessment is based on 60% course work (30% group work, 30% individual work) and 40% open-book final examination. Course work includes one-to-one oral test and assignments that exploit the hardware (digital cameras and PCs) available in the CITR Robotics Lab at Tamaki (room 731.234). For each assignment, each group will have to write a report which should be organised as follows:- Each member of the group works on a distinct part of the assignment and writes an individual report
- Each group provides a group report presenting the group solution and achievements for each assignment. Basically the group report should consist of an introduction of the problem and different solutions proposed.
- Both the individual and group reports should show students'
abilities to:
- analyse a problem
- propose feasible solutions based on materials taught during the lectures or learnt while reading research papers
- use statistical tools to assess their experimental results
Course work
A particular feature of the course work is the emphasis on complete system design. Therefore, instead of picking a small part of the material covered in lectures as assignment tasks, the project in this paper has the aim of developing a complete system to perform a specified task. The individual assignments present intermediate steps toward achieving this goal. At the end of the paper, there will be a competition to evaluate your project.
The equipment in the CITR Robotics Lab consists of a number of PCs running Linux and a few digital video cameras to be used to perform hand-gesture recognition and user's face authentication. Also there are two pan tilt digital cameras forming a stereo system for sensing and describing a 3D working space where hand gestures should be recognised and traced.
Nowadays, Human - Computer Interaction is a hot research topic. It consists mainly on extracting information (from audio-visual speech, visual expression, hand signs, body expression) to interact efficiently with a robot or a machine via a computer. Potential applications range from automatic speech recognition (ASR), videoconference, virtual reality, communication for disabled people, user verification and recognition (audiovisual biometrics features), to remote control of robots, vehicles, and devices.
This year course projects will encompass topics such as stereo
vision
and 3D positioning, feature extraction and classification
with a focus on real-time processes for efficient
interaction. Basically, you will have to:
- Find a 3D trajectory of a hand motion and specify particular objects using stereo vision and hand gesture recognition.
- Your task will be to track your hand movement along an arbitrary path but with mandatory check points in a 3D field. You will have to use specific gesture to name the check point and create a meaningful 3D description of the work field.
- Autentify faces in the labs
- Extract faces from images and identify them using advanced
statistical analysis techniques such as PCA, LDA.
- Deal with putting analysis
The work is subdivided into three assignments covering the following parts of the project:
- Calibration of stereo cameras for computing 3D positions of a desired item in the cameras field-of-view by intersecting optical rays and visualising 3D movements of the item (you will use in this assignment the existing Tsai calibration software but also do several programming tasks in networking and client-server camera access). You will build a GUI which will display the 3D positions and trajectories any object part of the field.
- Hand gesture recognition for hand localisation, hand mask extraction, and hand signs recognition at real-time.
- Golf Putter and ball tracking for putter and ball localisation at real-time. 3D field surface description.
- Face authentication for faces localisation, face mask extraction, and face recognition at real-time.
- Whole system testing. You will integrate your previous
work
to:
- effectively recognise hand signs and use these latter to indicate objects
- automatically track hand movement and restore its 3D trajectory in the work field
- display a sequence of check point objects visited by a moving hand
-
The preliminary schedule of these assignments is as follows:
Theme | Due date | |
---|---|---|
Assignment 1 | Camera calibration, line detection, and 3D GUI | TBA |
Assignment 2 | HG-Face recognition / 3D Golf putting tracking | TBA |
Assignment 3 | Whole System Testing | TBA |
Basic Topics of the Course
- Active Vision
- Camera calibration and projective geometry
- 2D and 3D vision geometry
- Single camera calibration
- Stereo calibration: epipolar geometry, triangulation
- Low-level Image processing
- Colour detection and discrimination
- Binary image segmentation
- Image matching / dynamic programming
- Video sequence processing
- Object tracking and image filtering
- Real-time image processing
- 3D scene description and understanding
- Gesture and Face recognition
- Feature extraction: hand and face
- Feature classification (PCA, LDA, ICA)
- Control and Robotics
- Networking and Videoservers
- Basics of random sequences
- Adaptive and Kalman filtering
Student groups in 2004
The four groups are as follows (only ID numbers):
Group 1 | |
|
|
|
---|---|---|---|---|
Group 2 | |
|
|
|
Group 3 | |
|
|
|
Marks
Students |
Assignment1 |
Assignment2 |
Assignment3 |
|
|
|
-
Related Programmes