Research

1. 3D segmentation from 2D multi-view images

3D segmentation from 2D multi-view images, refers to segmenting the 3D structure of arbitrary objects from multi-view 2D images, enabling the rapid and convenient creation of 3D assets. Its capability facilitates applications such as robotic navigation and embodied intelligence simulations. However, this task is much more difficult than 2D image segmentation, requiring solutions to challenges such as establishing cross-view consistent constraints between pixels. We achieve this task by lifting the Segment Anything Model (SAM) from 2D to 3D with the assistance of radiance fields, such as NeRFs and 3DGS, which are known as efficient representations to connect 2D multi-view images with 3D structures.


Publications

2. Tissue reconstruction from endoscopic surgery videos

Reconstructing deformable tissues from endoscopic videos in robotic surgery is crucial for various clinical applications, such as intraoperative assistance, surgery simulation and training. We propose highly-efficient tissue reconstruction methods based on NeRFs and 3DGS, which not only significantly accelerate the reconstruction process as well as the rendering process (up to real-time), but also maintain or even improve the reconstruction quality across a variety of non-rigid deformations. Our work received Young Scientist Award of MICCAI 2023.


Publications