Buffer Anytime: Zero-Shot Video Depth and Normal from Image Priors
Zhengfei Kuang
1
,
Tianyuan Zhang
2
,
Kai Zhang
3
,
Hao Tan
3
,
Sai Bi
3
,
Yiwei Hu
3
,
Zexiang Xu
3
,
Milos Hasan
3
,
Gordon Wetzstein
1
,
Fujun Luan
3
1
Stanford University
2
Massachusetts Institute of Technology
3
Adobe Research
arXiv
Smooth and Consistent Video Depth and Normal Generation without Annotated Video Data.
(This webpage contains a lot of videos. We suggest using Chrome or Edge for the best experience)
Video Depth Results
(Click to see more results)
We compare our model with DepthCrafter (Trained on Video Dataset) and DepthAnything V2 (Our Backbone Model).
Video Normal Results
(Click to see more results)
We compare our model with DSINE and Marigold-E2E-FT.