Introduction to Course

COS 351 - Computer Vision

Who am I?

Professor:

Disclaimer



These slides, for the most part, are not my own. The majority of this content was created by the James Hays, Derek Hoiem, Svetlana Lazebnik, and Steve Seitz.

Some info is a little old. Although I am working on updating the content and examples, the fundamentals and principles are the same.

Note: Computer Vision is a broad topic! We cannot cover everything :(

help!

At the end of the term, you will fill out an evaluation. I need you to be critical, letting me know what worked and what did not.

Read the assigned book readings, so I can know if they were helpful.

learning

This term will be a time of learning for you as well as for me.

Syllabus



Through this course, we will study the fundamental concepts of digital image acquisition, manipulation, enhancement, representation, analysis, and understanding.

Book

Required: Computer Vision: Algorithms and Applications, 1st edition, Richard Szeliski, Springer, 2010. ISBN-13: 978-1-848-82934-3.

Important Note: use the electronic manuscript available by the author on the book's site http://szeliski.org/Book

Software

We will use a variety of software packages in this course.

All of these are installed on the CSE lab machines and can be installed on your own machines.

prerequisites

Required:

Not necessary, but extremely helpful

computer vision

What is Computer Vision?

What are examples of computer vision being used in the world?

computer vision

Make computers understand images and video

computer vision

Vision is really hard

Vision is an amazing feat of natural intelligence

is that a queen or a bishop?

computer vision

Why computer vision matters:

safety
health
security
comfort
fun
access

ridiculously brief history of cv

Guzman 68
Ohta Kanade 78
Turk and Pentland 91

ridiculously brief history of cv

modern cv





The following is a short list of how vision is used today

modern cv: optical character recognition

Technology to convert scanned docs to text

Digit recognition, AT&T Labs
License plate readers

modern cv: face detection

Many new digital cameras now detect faces

modern cv: smile detection

The Smile Shutter flow
Imagine a camera smart enough to catch every smile! In Smile Shutter Mode, your Cyber-shot® camera can automatically trip the shutter at just the right instant to catch the perfect expression.

modern cv: 3d from thousands of images

modern cv: vision-based biometrics

modern cv: login without a password

modern cv: recognition in supermarkets

A smart camera is flush-mounted in the checkout lane, continuously watching for items. When an item is detected and recognized, the cashier verifies the quantity of items that were found under the basket, and continues to close the transaction. The item can remain under the basket, and with LaneHawk, you are assured to get paid for it...

modern cv: recognition in mobile phones

Point & Find, Search for pictures with Google Goggles

modern cv: recognition in mobile phones

Word Lens

modern cv: visual effects, shape capture



The Matrix movies, ESC Entertainment, XYZRGB, NRC ]

modern cv: visual effects, motion capture

Pirates of the Carribean, Industrial Light and Magic ]

modern cv: visual effects, motion capture

modern cv: sports



Sportvision visualizing first down line and other effects

modern cv: smart cars

Mobileye

modern cv: smart cars

modern cv: smart cars, google cars

modern cv: interactive games, kinect

modern cv: industrial robots

Vision-guided robots position nut runners on wheels

modern cv: vision in space

NASA's Mars Exploration Rover Spirit captured this westward view from atop a low plateau where Spirit spent the closing months of 2007.

Vision Systems (JPL) used for several tasks: Panorama stitching, 3D terrain modeling, Obstacle detection, position tracking.

For more, read "Computer Vision on Mars" by Matthies et al.

modern cv: mobile robots

NASA's Mars Spirit Rover
Robocup
STAIR

modern cv: medical imaging



3D imaging MRI, CT
Image guided surgery

state of the art today?

With enough training data, computer vision nearly matches human vision at most recognition tasks.

computer vision and nearby fields


Derogatory summary of computer vision: Machine learning applied to visual data

sub-domains of computer vision

Sub-domains of computer vision include:

Course details

We will not have time to touch all that computer vision has to offer

Instead, we will focus on understanding the fundamentals of CV through interesting applications and projects

Fundamentals:

tentative projects

Project: Image filtering and hybrid images

Implement image filtering to separate high and low frequencies

Combine high frequencies and low frequencies from different images to create an image with scale-dependent interpretation

project: local feature matching

Implement interest point detector, SIFT-like local feature descriptor, and simple matching algorithm

project: estimating fundamental matrices

a.k.a., camera motion reconstruction

Locate and track features through a pair of images or a short video to reconstruct the camera's motion

project: object detection with sliding window

Train a face detector based on positive examples and "mined" hard negatives, detect faces at multiple scales and suppress duplicate detections

project: scene recognition with bag of words

Quantize local features into a "vocabulary", describing images as histograms of "visual words", train classifiers to recognize scenes based on these histograms

project: boundary detection

Quantize human-annotated boundaries into "sketch tokens", train a multi-way classifier to recognize such tokens

other assignments

Reading, understanding, presenting CV research papers

Group application project

loading...