Experimental study of apparent behavior. Fritz Heider & Marianne Simmel. 1944
“
With Fritz Heider, Simmel co-authored 'An Experimental Study of Apparent Behavior,' which explored the experience of animacy. The study showed that subjects presented with a certain display of inanimate two-dimensional figures are inclined to ascribe intentions to those figures. This result has been taken to establish "the human instinct for storytelling" and to serve as important data in the study of theory of mind.
B.Lucas and T.Kanade. An iterative image registration technique with an application to stereo vision. In Proceedings of the International Joint Conference on Artificial Intelligence. pp. 674–679. 1981.
How to get more equations for a pixel?
Spatial coherence constraint
Assume the pixel's neighbors have the same (u,v)
If we use a 5x5 window, that gives us 25 equations per pixel
0=It(pi)+∇I(pi)⋅[uv]
solving the ambiguity
Least squares problem! (Ax=b, where A: 25x2, x: 2x1, b: 25x1)
The summations are over all pixels in the K×K window
conditions for solvability
Optimal (u,v) satisfies Lucas-Kanade equation
[∑IxIx∑IxIy∑IxIy∑IyIy][uv]=−[∑IxIt∑IyIt]
When is this solvable? i.e., what are good points to track?
ATA should be invertible
ATA should not be too small due to noise
eigenvalues λ1 and λ2 of ATA should not be too small
ATA should be well-conditioned
λ1/λ2 should not be too large (λ1 is larger eigenvalue)
Does this remind you of anything? (criteria of Harris corner detector)
low texture region
∑∇I(∇I)T
gradients have small magnitude
small λ1, small λ2
edge
∑∇I(∇I)T
large gradients, all the same
large λ1, small λ2
high textured region
∑∇I(∇I)T
gradients are different, large magnitudes
large λ1, large λ2
the aperture problems solved
the aperture problems solved
the aperture problems solved
the aperture problems solved
errors in lucas-kanade
A point does not move like its neighbors
Motion segmentation
Brightness constancy does not hold
Do exhaustive neighborhood search with normalized correlation, tracking features, maybe SIFT, more later...
The motion is large (larger than a pixel)
Not-linear: iterative refinement
Local minima: coarse-to-fine estimation
revisiting the small motion assumption
Is this motion small enough?
Probably not—it's much larger than one pixel
How might we solve this problem?
optical flow: aliasing
Temporal aliasing causes ambiguities in optical flow because images can have many pixels with the same intensity. i.e., how do we know which "correspondence" is correct?
nearest match is correct (no aliasing)nearest match is incorrect (aliasing)