We construct a dense multi-view dome to acquire a complex human object interaction dataset, named HODome, that consists of ∼71M frames on 10 subjects interacting with 23 objects. To process the HODome dataset, we develop NeuralDome, a layer-wise neural processing pipeline tailored for multi-view video inputs to conduct accurate tracking, geometry reconstruction and free-view rendering, for both human subjects and objects.
Juze Zhang,
Haimin Luo,
Hongdi Yang,
Xinru Xu,
Qianyang Wu,
Ye Shi,
Jingyi Yu,
Lan Xu,
Jingya Wang