Computer Science


Vision Guided Control (Early applied vision)

COMPSCI 773 S1 T

Introduction

This course introduces computational methods and techniques used in popular vision-based research areas such as 2/3D face recognition, 3D scene reconstruction, etc... Many topics are only overviewed, but a number of interesting theoretical and practical problems are analysed in detail. While most seemingly simple automatic real-world actions present a real challenge you will be able to acquire knowledge currently used in the latest technological advances available. This course is a must for students eager to pursue post-graduate studies and/or a career in applied computer vision.

Design of modern control systems involves different mathematical tools, especially, optimization techniques, matrix analysis, and analytic 2D/3D geometry. Some tools will be explained in brief in the lectures. Still, you are expected to learn these methods in details and use them to complete assignments.
Programming will be undertaken in C, C++, C#. You are expected to be pro-efficient in at least one of the above mentioned programming language or show a strong willingness to learn. Java-only students with a strong motivation should be able to progress enough in any of these languages within the first few weeks of the course.

Assessment

Assessment is based on 60% course work (30% group work, 30% individual work) and 40% open-book final examination. Course work includes one-to-one oral test and assignments that exploit the hardware (digital cameras and PCs) available in our research labs at Tamaki (room 731.234). For each assignment, each group will have to write a report which should be organised as follows:

Course work

A particular feature of the course work is the emphasis on complete system design. Therefore, instead of picking a small part of the material covered in lectures as assignment tasks, the project in this paper has the aim of developing a complete system to perform a specified task. The individual assignments present intermediate steps toward achieving this goal. At the end of the paper, there will be a competition to evaluate your project.

The equipment in the CITR Active Vision Lab consists of a number of PCs running Windows and a few web-cameras to be used to perform HCI applications (face recognition, dynamic 3D face animation, face expression recognition).  We may also use our 3D scanner and stereo-vision systems for 3D face acquisition.

Nowadays, Human - Computer Interaction is a hot research topic. It consists mainly on extracting information (from audio-visual speech, visual expression, hand signs, body expression) to interact efficiently with a robot or a machine via a computer. Potential applications range from automatic speech recognition (ASR), videoconference, virtual reality, communication for disabled people, user verification and recognition (audiovisual biometrics features), to remote control of robots, vehicles, and devices.

This year course projects will encompass topics such as stereo vision, 3D positioning, feature extraction and classification with a focus on real-time processes for efficient interaction. Basically, you will have to:

The work is subdivided into three assignments covering the following parts of the project:

  1. Calibration of stereo cameras for computing 3D positions of a desired item in the cameras field-of-view by intersecting optical rays and visualising 3D movements of the item (you will use in this assignment the existing Tsai calibration software (or any suitable calibration method you might have researched). You will also do several programming tasks in camera control, vdeo-stream synchronization, video display, etc...).
  2. You will use statistical analysis techniques to recognise face. You will track and display face features using set markers. 
    1. Face authentication for faces localisation, face mask extraction, and face recognition at real-time.
    2. 3D face authentication (fusing depth map and face texture images)
  1. Whole system testing. You will integrate your previous work to:
    1. Effectively track faces and face features from synchronised video-images.
    2. Effectively recognise faces in 2D and 3D.

The schedule of these assignments is as follows:

 

Theme

Due date

Assignment 1

Camera calibration, USB camera image/video acquisition, and 3D GUI

March 21 (preamble) and April 11 (full assignment), 2008

Assignment 2

3D face features tracking, 3D face animation, 2D face and face expression recognition

16 May 2008

Assignment 3

Whole System Testing (live demo)

Last day of lectures of the semester

Basic Topics of the Course

Notice that there are no lectures during the Graduation Week, on Friday May 9, 2008

  1. Camera calibration and projective geometry (for handouts: go to "Lectures")
    • 2D and 3D vision geometry
    • Single camera calibration
    • Stereo calibration: epipolar geometry, triangulation
  2. Low-level Image processing (for handouts: go to "Lectures")
    • Colour detection and discrimination
    • Binary image segmentation
    • Image matching
  3. Video sequence processing
    • Object tracking and image filtering
    • Real-time image processing
  4. Face expression and face recognition
    •  Feature extraction: face
    •  Feature classification (PCA, LDA)
  5. Stereo-vision
    •  Epipolar geometry, rectification
    •  Stereo matching
  6. Elements of real-time control
    •  Kalman filtering

773 Groups

Groups

1

2

3

4

Students Name/
UPI

TBA

TBA

TBA

TBA


Apply now!


Handbook

Postgraduate study options

Computer Science Blog



Please give us your feedback or ask us a question

This message is...


My feedback or question is...


My email address is...

(Only if you need a reply)

A to Z Directory | Site map | Accessibility | Copyright | Privacy | Disclaimer | Feedback on this page