Stereo Matching

COS 351 - Computer Vision

Fundamental Matrix

Let \(p\) be a point in left image, \(p'\) in right image

Epipolar relation

  • \(p\) maps to epipolar line \(l'\)
  • \(p'\) maps to epipolar line \(l\)

Epipolar mapping described by a 3x3 matrix \(F\)

\[l' = Fp \quad\quad l = p'F\]

It follows that

\[p'Fp = 0\]

Fundamental Matrix

This matrix \(F\) is called

Can solve for \(F\) from point correspondences

stereo image rectification

  • Reproject image planes onto a common plane parallel to the line between camera centers
  • Pixel motion is horizontal after this transformation
  • Two homographies (3x3 transform), one for each input image reprojection
  • C.Loop and Z.Zhang. Computing Rectifying Homographies for Stereo Vision. IEEE Conf. Computer Vision and Pattern Recognition, 1999. link

rectification example

correspondence problem




Epipolar geometry constrains our search, but we still have a difficult correspondence problem.

correspondence problem

Multiple match hypotheses satisfy epipolar constraint, but which is correct?

[ Figure: Gee & Cipolla 1999 ]

correspondence problem

Beyond the hard constraint of epipolar geometry, there are "soft" constraints to help identify corresponding points


To find matches in the image pair, we will assume

dense correspondence search

[ adapted: Li Zhang ]

corr. search with similarity constraint

corr. search with similarity constraint

corr. search with similarity constraint

correspondence problem

Clear correspondence between intensities, but also noise and ambiguity

[ source: Andrew Zisserman ]

correspondence problem

Neighborhoods of corresponding points are similar in intensity patterns

[ source: Andrew Zisserman ]

correspondence problem

[ source: Andrew Zisserman ]

correspondence problem

[ source: Andrew Zisserman ]

correspondence problem

[ source: Andrew Zisserman ]

correspondence problem

[ source: Andrew Zisserman ]

correspondence problem

[ source: Andrew Zisserman ]

effect of window size

[ source: Andrew Zisserman ]

effect of window size

left: \(W=3\); right: \(W=20\)

Want window large enough to have sufficient intensity variation, yet small enough to contain only pixels with about the same disparity.

[ figures: Li Zhang ]

results with window search

window-based matching (best window size)
ground truth

better solutions

scanline stereo

coherent stereo on 2D grid

scanline dp optimization
ground truth

stereo as energy minimization

stereo as energy minimization

\[\begin{array}{cc} E = \alpha E_\text{data}(I_1,I_2,D)+ \beta E_\text{smooth}(D) & \begin{array}{c} E_\text{data} = \sum_i \bigl(W_1(i) - W_2(i+D(i))\bigr)^2 \\ E_\text{smooth} = \sum_{\mathrm{neighbors}\ i,j} \rho\bigl(D(i) - D(j)\bigr) \end{array}\end{array}\]

Energy functions of this form can be minimized using graph cuts

Y.Boykov, O.Veksler, and R.Zabih. Fast Approximate Energy Minimization via Graph Cuts. PAMI 2001. link

[ source: Steve Seitz ]

stereo as energy minimization

graph cuts
ground truth

Better results!

For the latest and greatest: http://vision.middlebury.edu/stereo/

challenges

active stereo with structured light

Project "structured" light patterns onto the object

  • Simplifies the correspondence problem
  • Allows us to use only one camera

L.Zhang, B.Curless, and S.M.Seitz. Rapid Shape Acquisition Using Color Structured Light and Multi-pass Dynamic Programming. 3DPVT 2002. link

Kinect: Structured infared light

https://bbzippo.wordpress.com/2010/11/28/kinect-in-infrared/

summary

loading...