The University of Auckland - COMPSCI 773 S1 T

Computer Science

Vision Guided Control (Applied Early Vision)

COMPSCI 773 S1 T

Introduction

This course introduces computational methods and techniques used in popular vision-based research areas such as 2/3D face recognition, 3D scene reconstruction, 2D and 3D hand posture recognition that ar enow at the forefront of HCI and 3D Vision applications in the industry . Many topics are only reviewed, but a number of interesting theoretical and practical problems are analysed in detail. You will be able to acquire knowledge currently used in the latest technological advances available. This course is a must for students eager to pursue post-graduate studies and/or a career in applied computer vision.

Design of modern control systems involves different mathematical tools, especially, optimisation techniques, matrix analysis, and analytic 2D/3D geometry. Some tools will be explained in brief in the lectures. Still, you are expected to learn these methods in details and use them to complete assignments.

Programming in Computer Vision is usually undertaken in C, C++, C# . You are expected to be pro-efficient in at least one of the above mentioned programming language or show a strong willingness to learn.

In 2009, we are offering the opportunity to program Assignment 1 and tentatively Assignment 2 in Java using classes provided by the lecturers. It is expected that your programs will be uploaded and will have to work with the following Online Stereo Vision application

Assessment

Assessment is based on 60% course work (30% group work, 30% individual work) and 40% open-book final examination. Course work includes one-to-one oral test with the lecturers and assignments that exploit the digital cameras (Quickcam Pro 9000) available in our research labs at Tamaki (room 731.234). For each assignment, each group will have to write a report which should be organised as follows:

Each member of the group works on a distinct part of the assignment and writes an individual report
Each group provides a shorter group report presenting the group solution and achievements for each assignment. Basically the group report should consist of an introduction of the problem and different solutions proposed.
Both the individual and group reports should show students' abilities to:
- analyse a problem.s
- propose feasible solutions based on materials taught during the lectures or learnt while reading research papers.
- use statistical tools to assess their experimental results.

Course work

A particular feature of the course work is the emphasis on complete system design. Therefore, instead of picking a small part of the material covered in lectures as assignment tasks, the project in this paper has the aim of developing a complete system to perform a specified task. The individual assignments present intermediate steps toward achieving this goal. At the end of the paper, each group will present his completed project.

The course relies on the Postgraduate room for lab sessions and equipment, namely, a number of PCs running Windows and a few web-cameras to complete your project. We may also use our 3D scanner and/or stereo-vision systems for 3D face acquisition.

This year course projects will encompass topics such as stereo vision, camera calibration, image pairs rectification, feature extraction and classification with a focus on depth maps and recognition tasks based on 2D and 3D (depth) image information. Basically, you will have to:

Implement matching algorithms, display depth map, visualise the 3D data (optional)
Design an interface to acquire synchronised images from 2 USB cameras.
Calibrate cameras, rectify image pairs
Authenticate faces in the labs

Extract faces from images and identify them using advanced statistical analysis techniques such as PCA, LDA.
Fuse stereo-vision face data (depth map) and readily available face texture for 2+3D face recognition

The work is subdivided into three assignments (see Assignments panel) covering the following parts of the project:

Image / stereo matching algorithms.
Calibration of stereo cameras for computing 3D positions of a desired item in the cameras field-of-view(you will use in this assignment the existing Tsai calibration software (or any suitable calibration method you might have researched).
You will use statistical analysis techniques to recognise face.

Face authentication for faces localisation, face mask extraction, and face recognition at real-time.
2+3D face authentication (fusing depth map and face texture images)

Whole system testing.

The schedule of these assignments is as follows:

	Theme	Due date
Assignment 1	Image matching and stereo matching algorithms implementation, depth map construction, 3D visulisation (OpenGl, optional)	27/03 (might be extended to 03/04)
Assignment 2	Camera calibration, USB cameras synchronisation for image acquisition, image pairs rectifications	08/05
Assignment 3	2+3D Face recognition (tentatives) Whole System Testing (live demo)	Last lecture

Basic Topics of the Course

Low-level Image processing (for handouts: go to "Lectures")
- Colour detection and discrimination
- Binary image segmentation
- Image matching

Stereo-vision

Stereo matching

Camera calibration and projective geometry (for handouts: go to "Lectures")
- 2D and 3D vision geometry
- Single camera calibration
- Stereo calibration: epipolar geometry, traingulation, rectification
Face expression and face recognition

Feature extraction: face
Feature classification (PCA, LDA)