Sparse Representation and Construction for High-Resolution 3D Shapes Modeling
(Sparc3D)

Zhihao Li 1,2,   Yufei Wang 1,   Heliang Zheng 2,   Yihao Luo 2,3,   Bihan Wen 1

1 Nanyang Technological University

,  

2 Math Magic

,  

3 Imperial College London


Project Lead

,  

Corresponding Authors

*Code may be released upon the approval of Math Magic.

Abstract

High-fidelity 3D object synthesis remains significantly more challenging than 2D image generation due to the unstructured nature of mesh data and the cubic complexity of dense volumetric grids. Existing two-stage pipelines—compressing meshes with a VAE (using either 2D or 3D supervision), followed by latent diffusion sampling—often suffer from severe detail loss caused by inefficient representations and modality mismatches introduced in VAE. We introduce Sparc3D, a unified framework that combines a sparse deformable marching cubes representation Sparcubes with a novel encoder Sparconv-VAE. Sparcubes converts raw meshes into high-resolution (10243) surfaces with arbitrary topology by scattering signed distance and deformation fields onto a sparse cube, allowing differentiable optimization. Sparconv-VAE is the first modality-consistent variational autoencoder built entirely upon sparse convolutional networks, enabling efficient and near-lossless 3D reconstruction suitable for high-resolution generative modeling through latent diffusion. Sparc3D achieves state-of-the-art reconstruction fidelity on challenging inputs, including open surfaces, disconnected components, and intricate geometry. It preserves fine-grained shape details, reduces training and inference cost, and integrates naturally with latent diffusion models for scalable, high-resolution 3D generation.

VAE Reconstruction Results

Hold Z and hover over the video to zoom in.

GT Mesh

Load mesh

Ours (1024)

Load mesh

Ours (512)

Load mesh

Trellis

Load mesh
Load more samples

Image-to-3D Generation Results (Video)

Method Overview (Sparcubes)

Strategy

Illustration of our Sparcubes reconstruction pipeline for converting a raw mesh into a watertight mesh.

Method Overview (Sparconv-VAE)

Strategy

Our Sparconv-VAE comprises a sequence of sparse convolutional blocks, integrated with a lightweight local attention module adapted from Point Transformer V3.