Buffer Anytime: Zero-Shot Video Depth and Normal from Image Priors

Zhengfei Kuang1, Tianyuan Zhang2, Kai Zhang3, Hao Tan3, Sai Bi3, Yiwei Hu3, Zexiang Xu3,

Milos Hasan3, Gordon Wetzstein1, Fujun Luan3

1 Stanford University    2 Massachusetts Institute of Technology    3 Adobe Research   

arXiv


Smooth and Consistent Video Depth and Normal Generation without Annotated Video Data.

(This webpage contains a lot of videos. We suggest using Chrome or Edge for the best experience)

Video Depth Results

(Click to see more results)

We compare our model with DepthCrafter (Trained on Video Dataset) and DepthAnything V2 (Our Backbone Model).

Video Normal Results

(Click to see more results)

We compare our model with DSINE and Marigold-E2E-FT.