16:332:579 ADVANCED TOPICS IN COMPUTER ENGINEERING:
Computer Vision
Fall 2009. Index No. 30782

Description

The goal of the course is to provide a state-of-the-art overview to some of the recent computer vision methods, starting from the cameras and ending with motion interpolation. In general, the latest algorithms will be discussed, but this course will not present anything done in our laboratory, even if this sometimes gives better results. Those results will be presented in 16:332:570 in the Spring 2010. The course is organized like a seminar. The material is presented from different sources and papers will be assigned almost every week. The code for some of the papers will be also available from the web and good knowledge of MATLAB (at least) is required. Previous exposure to computer vision, for example, the course 16:332:561 or equivalent, is needed in order to take the most from this course.

Schedule

Tuesday 3:20--6:20 pm, CoRE 538.

Instructor

Peter Meer       CoRE 519. Ph: (732) 445-5243. E-mail: meer@caip.rutgers.edu
Office hours: During the day of the lecture, or by appointment.

Textbook

R. I. Hartley and A. Zisserman. Multiple View Geometry in Computer Vision. Cambridge University Press, 2000 or the second edition, 2004.   Notation:HZSec.x.x. The references from the second edition, 2Sec.x.x, are shown in parenthesis.
This book will be used a lot. If you do graduate work in a related area you probably should have it. The second book is not a textbook.
W. H. Press, S. A. Teukolsky, W. T. Vetterling, and B. P. Flannery. Numerical Recipes in C. Cambridge University Press, second edition, 1992.   Notation:NRSec.x.x.
Is a wonderful collection of programs in C, which we will use it in a MATLAB version. You laboratory probably have a copy of it.

Read for the Lectures

Tentative Outline. All the slides, more details, will come as we go along.
Lecture 1. Projective geometry. Cameras. Different projections.
The slides of this and the following lecture. You need to know the matrix inversion in block form too.
Ref: HZChap.1 without Sec.1.6 (2Chap.2), projective geometry in 2D;       HZSec.7.6 (2Sec.8.6), vanishing points, line;       HZSec.2.1, Sec.2.2 but not from the Plücker matrices on, Sec.2.4 to Sec.2.7 (2Sec.3.x), the 3D representations;       HZChap.5 without Sec.5.3.6 and Sec.5.4 (2Chap.6), camera models;       HZSec.A3.2.1 (2Sec.A4.2.1) and NRSec2.9, Cholesky factorization.


Lecture 2. Continuation of lecture 1.

Lecture 3. Estimation of computer vision problems.
The slides of this lecture.
Ref: NRSec.2.8 and Sec.3.1, polynomial interpolation (the Vandermonde system, also good on the web, Wikipedia "Vandermonde matrix"); other polynomial bases are in NRSec5.8, Sec.4.5, Bernstein polynomial is in Wikipedia, but we will not cover them;       NRSec.3.3 cubic spline interpolation;       NRSec.15.1, least squares as maximum likelihood estimator; HZSec.A3.3 and Sec.A3.4 (2Sec.A4.3, 2Sec.A4.4, 2Sec.A5.1, 2Sec.A5.2), but specific least squares appendices which will be covered later;       Wikipedia's "Lagrange multipliers" is a good start;       NRSec10.1 and Sec.10.2, bisection methods in 1D;       NRSec.10.4, downhill simplex;       NRSec.9.6 or HZSec.A4.1 (2Sec.A6.1), Newton's method;       HZSec.A4.2 and Sec.A4.3 (2Sec.A6.2, 2Sec.A6.3, 2Sec.A6.6, 2Sec.A6.7) and NRSec.15.5, Levenberg-Marquardt iterations;       A pattern recognition/machine learning techinque, principal components analysis, in any books which contains the PCA too.

Lecture 4. Camera calibration. A few methods.
The slides of this lecture.
Ref: HZChap.6 (2Chap.7), camera calibration;       HZSec.7.5-7.7 (2Sec.8.5, 2Sec.8.6, 2Sec.8.8-2Sec.8.10), calibration with absolute conic or vanishing points and lines;       HZSec.A.3.1.1 (2Sec.A4.1.1), Givens rotations and RQ decomposition.

Lecture 5. Interest points in 2D images.
The slides of this lecture.
Ref: The currently most popular approaches are given below.

Lecture 6. RANSAC and some other robust fittings.
The slides of this lecture.
Ref:HZSec.3.7 (2Sec.4.7, 2Sec.A6.8), RANSAC.

Fourier transform, Gabor filters. Textons.
The slides of this lecture.
Ref: The following two papers give you a good introduction to textons.

Texture synthesis.
The slides of this lecture.
Ref: The following two papers give a computer vision type approach. Other methods also exist, mainly in computer graphics.

Lecture 7. Homograpy and error analysis.
The slides of this lecture.
Ref: HZSec.3.1-3.6 and Sec.3.8 (2Sec.4.1-4.6, 2Sec.4.8), homograpy;       HZChap.4 (2Chap.5), error analysis;       HZSec.A3.2 (2Sec.A4.2), the 3x3 skew-symmetric matrix including its matrix of cofactor;       HZSec.A3.3 and Sec.A3.4 (2Sec.A5.3, 2Sec.A5.4), different variants of least-squares fitting;       HZSec.A4.4 (2Sec.A6.4), Levenberg-Marquardt applied to homograpy.

Lecture 8. What can be extracted from a single 2D image.
The slides of this lecture.
Ref:HZSec.7.1 without Plücker line representation and Sec.7.4 (2Sec.8.1 and 2Sec.8.4), single view geometry;       HZSec.A5.2 (2Sec.A7.2), planar homologies;       HZSec.A5.3 (2Sec.A7.3), elations.

Lecture 9. Epipolar geometry in details.
The slides of this lecture.
Ref: HZChap.8 without Sec.8.4 (2Chap.9), epipolar geometry;       HZSec.10.1 to Sec.10.6 without Sec.10.3 and Sec.10.4.2 (2Chap.11), computation of the fundamental matrix;       HZSec.A3.2 (2Sec.A4.2), symmetric and skew-symmetric matrices.

Lecture 10. Computations based on two images.
The slides of this lecture.
Ref: HZChap.9 (2Chap.10), 3D reconstruction;       HZChap.11, Sec.11.4 and Sec.11.5 will not be treated in detail (2Chap.12), structure computation;       HZChap.12 (2Chap.13), homography and scene planes;       HZSec.A3.1.2 (2Sec.A4.1.2) Householder matrices.
Lecture 11. Factorization. Auto-Calibration.
The slides of this lecture.
Ref: HZSec.17.1 to Sec.17.3 and Sec.17.5 without Sec.17.2.1 (2Chap.18 with 2Sec.18.3 on non-rigid factorization is only in the second edition), factorization;       HZSec.18.1, Sec.18.2 and Sec.18.5 (2Chap.19), describe our auto-calibration but a little bit differently, so these sections are only for additional reading.

Lecture 12. Stereo vision.
The slides of this lecture.
Ref: A chapter from the draft of R. Szeliski book, stereo correspondence, can also be consulted. The references are a subset from here.

Lecture 13. Motion.
The slides of this lecture.
Ref: A chapter from the draft of R. Szeliski book, dense motion estimation, can also be consulted. The references are in the previous lecture.

Lecture 14. Continuation of lecture 13. What can and what cannot be achieved today in computer vision.



Additional Information

CVonline
Computer Vision Home Page
Computer Vision Industry
OpenCV Reference
MATLAB Processing Toolboxes start at MATLAB On-line.

Grading

Active participation in the course (20%). Homeworks, presentations and projects based on papers distributed in the course (80%).