Professor:
These slides, for the most part, are not my own. The majority of this content was created by the James Hays, Derek Hoiem, Svetlana Lazebnik, and Steve Seitz.
Some info is a little old. Although I am working on updating the content and examples, the fundamentals and principles are the same.
Note: Computer Vision is a broad topic! We cannot cover everything :(
Read the assigned book readings, so I can know if they were helpful.
“Through this course, we will study the fundamental concepts of digital image acquisition, manipulation, enhancement, representation, analysis, and understanding.
”

Required: Computer Vision: Algorithms and Applications, 1st edition, Richard Szeliski, Springer, 2010. ISBN-13: 978-1-848-82934-3.
Important Note: use the electronic manuscript available by the author on the book's site http://szeliski.org/Book
We will use a variety of software packages in this course.
All of these are installed on the CSE lab machines and can be installed on your own machines.
Required:
Not necessary, but extremely helpful
What is Computer Vision?
What are examples of computer vision being used in the world?
Make computers understand images and video

Vision is really hard
Vision is an amazing feat of natural intelligence

Why computer vision matters:




The following is a short list of how vision is used today
Technology to convert scanned docs to text

Many new digital cameras now detect faces

“The Smile Shutter flow
”
Imagine a camera smart enough to catch every smile! In Smile Shutter Mode, your Cyber-shot® camera can automatically trip the shutter at just the right instant to catch the perfect expression.




“A smart camera is flush-mounted in the checkout lane, continuously watching for items. When an item is detected and recognized, the cashier verifies the quantity of items that were found under the basket, and continues to close the transaction. The item can remain under the basket, and with LaneHawk, you are assured to get paid for it...
”





Sportvision visualizing first down line and other effects




Vision-guided robots position nut runners on wheels


NASA's Mars Exploration Rover Spirit captured this westward view from atop a low plateau where Spirit spent the closing months of 2007.
Vision Systems (JPL) used for several tasks: Panorama stitching, 3D terrain modeling, Obstacle detection, position tracking.
For more, read "Computer Vision on Mars" by Matthies et al.


Derogatory summary of computer vision: Machine learning applied to visual data
Sub-domains of computer vision include:
We will not have time to touch all that computer vision has to offer
Instead, we will focus on understanding the fundamentals of CV through interesting applications and projects
Fundamentals:
Implement image filtering to separate high and low frequencies
Combine high frequencies and low frequencies from different images to create an image with scale-dependent interpretation

Implement interest point detector, SIFT-like local feature descriptor, and simple matching algorithm

a.k.a., camera motion reconstruction
Locate and track features through a pair of images or a short video to reconstruct the camera's motion
Train a face detector based on positive examples and "mined" hard negatives, detect faces at multiple scales and suppress duplicate detections

Quantize local features into a "vocabulary", describing images as histograms of "visual words", train classifiers to recognize scenes based on these histograms

Quantize human-annotated boundaries into "sketch tokens", train a multi-way classifier to recognize such tokens

Reading, understanding, presenting CV research papers
Group application project
