Machine Perception

Course description

Lectures:

  1. Overview of the field of Machine perception and scientific challenges
  2. Image processing
    1. Image formation
    2. Binarization, morfology, segmentation
    3. Colour spaces and colour perception
    4. Linear and nonlinear filters
  3. Image derivatives and edge perception
    1. Derivative-based edge perception
    2. Edge-based object perception
    3. Parametric shape perception
  4. Model fitting
    1. Normal equations
    2. Homogenous systems
    3. Robust approaches
  5. Local features
    1. Corner perception
    2. Local descriptors in scale space and affine adaptation
  6. Stereoscopy and depth perception
    1. Calibrated and uncalibrated systems and reconstruction
  7. Object recognition
    1. Subspace methods (PCA, LDA)
    2. Local-features-based recognition
  8. Object detection
    1. Visual features and detection approaches
  9. Motion perception
    1. Local motion perception and object tracking

Exercises:

Exercises will take a form of project-oriented exercises in properly equipped student laboratories. Students will implement various algorithms and test them on different datasets using a variety of sensor systems. Exercises will support an in-depth understanding of the theory. They will also encourage independent thinking and creativity.

Objectives and competences

In the framework of this course, the students will acquire concrete knowledge and skills in the area of machine perception. The students will develop competences in low-level image processing, 3D geometry of stereo systems, object detection, object recognition, and motion extraction in video sequences. The students will also practice mathematical basics crucial for solving demanding engineering problems, which are essential for analysis of complex signals such as images and video.

In addition, the students will obtain the following competences:

  • The ability to understand and solve professional challenges in computer and information science.

  • The ability of professional communication in the native language as well as a foreign language.

  • The ability to independently perform both less demanding and complex engineering and organisational tasks in certain narrow areas and independently solve specific well-defined tasks in computer and information science.

Learning and teaching methods

Lectures, laboratory exercises in computer classroom with active participation. Individual work on exercises. Theory from the lectures made concrete with hands-on laboratory exercises. Special emphasis will be put on continuous assessment at exercises.

Intended learning outcomes

After completing this course a students will be able to:

– understand computer technology and computational methodology for use and development of components for machine vision systems,

– understand the basics of low-level image processing,

– understand the basics of 3D geometry of stereo systems,

– understand the basics of object detection, object recognition,

– know basic motion extraction techniques in video sequences,

– analyze modern computer vision and machine vision algorithms,

– use computer technology and computational methodology for specific applications of autonomous intelligent cognitive systems.

Reference nosilca

LUKEŽIČ, Alan, ČEHOVIN ZAJC, Luka, KRISTAN, Matej. Deformable parts correlation filters for robust visual tracking. IEEE transactions on cybernetics, ISSN 2168-2267, 2017, vol. , no. , str. 1-13, [COBISS.SI-ID 1537625283],

KRISTAN, Matej, SULIĆ KENK, Vildana, KOVAČIČ, Stanislav, PERŠ, Janez. Fast image-based obstacle detection from unmanned surface vehicles. IEEE transactions on cybernetics, ISSN 2168-2267 , 2016, vol. 46, no. 3, str. 641-654, [COBISS.SI-ID 1536310979],

KRISTAN, Matej, MATAS, Jiří, LEONARDIS, Aleš, VOJÍŘ, Tomáš, PFLUGFELDER, Roman, FERNÁNDEZ, Gustavo, NEBEHAY, Georg, PORIKLI, Fatih, ČEHOVIN ZAJC, Luka. A novel performance evaluation methodology for single-target trackers. IEEE transactions on pattern analysis and machine intelligence, ISSN 0162-8828. [Print ed.], Nov. 2016, vol. 38, no. 11, str. 2137-2155, [COBISS.SI-ID 1536872643]

URŠIČ, Peter, LEONARDIS, Aleš, SKOČAJ, Danijel, KRISTAN, Matej. Learning part-based spatial models for laser-vision-based room categorization. The international journal of robotics research, ISSN 0278-3649, 2017, vol. 36, no. 4, str. 379-402, [COBISS.SI-ID 1537424323]

ČEHOVIN, Luka, KRISTAN, Matej, LEONARDIS, Aleš. Robust visual tracking using an adaptive coupled-layer visual model. IEEE trans. pattern anal. mach. intell.. [Print ed.], 2012, str. [1-14], [COBISS.SI-ID 9431124]

Celotna bibliografija je dostopna na SICRISu:

http://sicris.izum.si/search/rsr.aspx?lang=slv&id=32801.

Study materials

Obvezna:

  • D. Forsyth and J. Ponce, Computer Vision: A modern approach, Prentice Hall 2011.
  • R. Szeliski,Computer Vision: Algorithms and Applications, Springer, 2011

Dopolnilna:

  • H. R. Schiffman: Sensation and Perception, An Integrated Approach, John Wilez & Sons 2001.

Izbrani članki iz revij IEEE PAMI, CVIU, IJCV, Pattern Recognition (dostopno na spletu)

Bodi na tekočem

Univerza v Ljubljani, Fakulteta za elektrotehniko, Tržaška cesta 25, 1000 Ljubljana

E:  dekanat@fe.uni-lj.si T:  01 4768 411