Transforming polyhedra into powerful vector representations for 3D shape analysis
Dazhou Yu, Genpei Zhang, Liang Zhao
Department of Computer Science, Emory University
Published as a conference paper at ICLR 2025
PolyhedronNet is a groundbreaking framework designed specifically for learning representations of 3D polyhedron objects. As the first dedicated solution for polyhedron representation learning, PolyhedronNet addresses the unique challenges of working with these complex geometric structures.
A polyhedron (plural: polyhedra) is a 3D geometric shape with flat polygonal faces, straight edges, and sharp vertices. Common examples include cubes, pyramids, and more complex shapes like dodecahedrons. Polyhedra are fundamental building blocks in 3D modeling, computer graphics, architecture, and many scientific disciplines.
Our innovative approach transforms polyhedra into comprehensive vector representations that capture both structural and geometric properties, enabling advanced analysis, classification, and retrieval of 3D shapes with unprecedented accuracy.
Ubiquitous geometric objects can be precisely and efficiently represented as polyhedra. The transformation of a polyhedron into a vector, known as polyhedra representation learning, is crucial for manipulating these shapes with mathematical and statistical tools for tasks like classification, clustering, and generation.
This study proposes PolyhedronNet, a general framework tailored for learning representations of 3D polyhedral objects. We introduce the concept of the surface-attributed graph to model vertices, edges, faces, and their geometric interrelationships within a polyhedron, achieving rotation and translation invariance while preserving discriminative power.
Our experimental evaluations on four distinct datasets (MNIST-C, Building, ShapeNet-P, and ModelNet-P), encompassing both 3D polyhedral object classification and 3D shape retrieval tasks, substantiate PolyhedronNet's efficacy in capturing comprehensive and informative representations of 3D polyhedral objects. Code and data are available at github.com/dyu62/3D_polyhedron.
Our innovative approach to polyhedron representation learning
Figure: The PolyhedronNet framework for polyhedron representation learning consists of three main components:
A novel graph representation that captures vertices, edges, faces, and their relationships in a polyhedron through face-hyperedges, enabling comprehensive polyhedral graph transformation.
A five-tuple geometric representation that preserves structural information while achieving rotation and translation invariance in 3D polyhedra, crucial for consistent polyhedron representation learning.
A specialized graph neural network for 3D polyhedra that aggregates local representations through intra-face and inter-face message passing, enabling effective polyhedron representation learning.
PolyhedronNet introduces several key innovations in the field of polyhedron representation learning:
Benchmark performance on polyhedron classification and retrieval tasks
3D digit polyhedra with color-coded faces for rotation invariance testing in polyhedron classification
3D building polyhedra from OpenStreetMap for architectural polyhedron shape analysis
15 object categories from ShapeNetCore converted to polyhedra for diverse 3D object classification
14 object categories from ModelNet40 converted to polyhedra for 3D shape retrieval evaluation
PolyhedronNet outperforms all comparison methods in 3D polyhedron classification:
Open-source implementation for polyhedron representation learning
Our implementation of PolyhedronNet is available on GitHub, including code, documentation, and instructions for reproducing our experiments on 3D polyhedron classification and polyhedron shape retrieval.
github.com/dyu62/3D_polyhedronAccess to MNIST-C, Building, ShapeNet-P, and ModelNet-P datasets for 3D polyhedra research and benchmarking.
Download Polyhedra DatasetsAccess our paper on PolyhedronNet published at ICLR 2025 and supplementary materials on polyhedron representation learning.
Download PolyhedronNet PaperPolyhedronNet is a novel framework for learning representations of 3D polyhedral objects. It introduces the concept of surface-attributed graphs to model vertices, edges, faces, and their geometric interrelationships within a polyhedron.
PolyhedronNet represents polyhedra using a three-step process: 1) Transforming a polyhedron into a Surface-attributed Graph (SAG), 2) Computing Local Rigid Representation with geometric relationships, and 3) Using PolyhedronGNN for learning representations through message passing.
Polyhedron representation learning is crucial for manipulating 3D shapes with mathematical and statistical tools for tasks like 3D object classification, clustering, retrieval, and generation. It has applications in computer graphics, CAD, architecture, and 3D modeling.
PolyhedronNet outperforms all comparison methods in 3D polyhedron classification, achieving 85.8% accuracy on MNIST-C, 98.0% accuracy on Building dataset, 62.7% accuracy on ShapeNet-P, and 43.5% accuracy on ModelNet-P.
Polyhedron representation specifically focuses on 3D shapes with flat polygonal faces, straight edges, and sharp vertices. Unlike point clouds or voxels, polyhedra preserve exact geometric information and topological structure, making them ideal for precise 3D shape analysis and manipulation.