|
Vision Guided ControlCOMPSCI 773 S1 TIntroductionThis course introduces computational methods and techniques used in vision-based real-time control. Many topics are only overviewed, but a number of interesting theoretical and practical problems are analysed in detail. You should not expect exciting things which may be found in sci-fi books or movies like "Terminator" as you will soon find out that even a seemingly simple automatic real-world action may present a real challenge. Design of modern control systems involves different mathematical tools, especially, optimization techniques, matrix analysis, and analytic 2D / 3D geometry. Some tools will be explained in brief in the lectures. Still, you are expected to learn these methods in details and use them to complete assignments. AssessmentAssessment is based on 60% course work (30% group work, 30% individual work) and 40% open-book final examination. Course work includes one-to-one oral test and assignments that exploit the hardware (digital cameras and PCs) available in the CITR Robotics Lab at Tamaki (room 731.234). For each assignment, each group will have to write a report which should be organised as follows:
Course workA particular feature of the course work is the emphasis on complete system design. Therefore, instead of picking a small part of the material covered in lectures as assignment tasks, the project in this paper has the aim of developing a complete system to perform a specified task. The individual assignments present intermediate steps toward achieving this goal. At the end of the paper, there will be a competition to evaluate your project. The equipment in the CITR Active Vision Lab consists of a number of PCs running Linux/Zindows and a few digital video cameras to be used to perform hand-gesture recognition and user's face authentication. Also there are two pan tilt digital cameras forming a stereo system for sensing and describing a 3D working space where hand gestures should be recognised and traced. Nowadays, Human - Computer Interaction is a hot research topic. It consists mainly on extracting information (from audio-visual speech, visual expression, hand signs, body expression) to interact efficiently with a robot or a machine via a computer. Potential applications range from automatic speech recognition (ASR), videoconference, virtual reality, communication for disabled people, user verification and recognition (audiovisual biometrics features), to remote control of robots, vehicles, and devices. This year course projects will encompass topics such as stereo
vision
and 3D positioning, feature extraction and classification
with a focus on real-time processes for efficient
interaction. Basically, you will have to:
The work is subdivided into three assignments covering the following parts of the project:
The schedule of these assignments is as follows:
Basic Topics of the Course
|
|