SOAR

Self-Occluded Avatar Recovery from a Single Video In the Wild

Zhuoyang Pan^1,2,* Angjoo Kanazawa¹ Hang Gao^1,*

¹UC Berkeley ²ShanghaiTech University

^*Denotes Equal Contribution

TLDR: A framework for human reconstruction from partial observations where parts of the body are unobserved.

A Unified Framework

Self-occlusion is common when capturing people in the wild, where the performers do not follow predefined motion scripts. This challenges existing monocular human reconstruction systems that assume full body visibility. We introduce Self-Occluded Avatar Recovery (SOAR), a method for complete human reconstruction from partial observations where parts of the body are entirely unobserved. SOAR leverages structural normal prior and generative diffusion prior to address such an ill-posed reconstruction problem. For structural normal prior, we model human with an reposable surfel model with well-defined and easily readable shapes. For generative diffusion prior, we perform an initial reconstruction and refine it using score distillation. On various benchmarks, we show that SOAR performs favorably than state-of-the-art reconstruction and generation methods, and on-par comparing to concurrent works.

Our Results

Novel View Rendering

We show novel view rendering results for our reconstructed avatars, including RGB, normal, and occlusion.

Animation

We animate our reconstructed avatars in novel poses.

Comparison

We compare our method with GART [1] and GaussianAvatar [2], showing the rendered RGB and normal maps for each. Our approach produces more realistic and detailed results in terms of both texture and structure.

BibTeX

@inproceedings{pan2024soar,
  title     = {SOAR: Self-Occluded Avatar Recovery from a Single Video In the Wild},
  author    = {Pan, Zhuoyang and Kanazawa, Angjoo and Gao, Hang},
  journal   = {arXiv preprint arXiv:2410.23800},
  year      = {2024}
}