3DAffordSplat

3DAffordSplat: Efficient Affordance Reasoning with 3D Gaussians

Zeming Wei^1*, Junyi Lin^1*, Yang Liu^1,3†, Weixing Chen¹, Jingzhou Luo¹, Guanbin Li^1,2,3, Liang Lin^1,2,3

¹Sun Yat-sen University ²Peng Cheng Laboratory ³Guangdong Key Laboratory of Big Data Analysis and Processing
^*Equal contribution ^†Corresponding Author

Abstract

3D affordance reasoning plays a critical role in associating human instructions with the functional regions of 3D objects, facilitating precise, task-oriented manipulations in embodied AI. However, current methods, which predominantly depend on sparse 3D point clouds, exhibit limited generalizability and robustness due to their sensitivity to coordinate variations and the inherent sparsity of the data. By contrast, 3D Gaussian Splatting (3DGS) delivers high-fidelity, real-time rendering with minimal computational overhead by representing scenes as dense, continuous distributions. This positions 3DGS as a highly effective approach for capturing fine-grained affordance details and improving recognition accuracy. Nevertheless, its full potential remains largely untapped due to the absence of large-scale, 3DGS-specific affordance datasets. To overcome these limitations, we present 3DAffordSplat, the first large-scale, multi-modal dataset tailored for 3DGS-based affordance reasoning. This dataset includes 23k Gaussian instances, 8.3k point cloud instances, and 6.6k manually annotated affordance labels, encompassing 21 object categories and 18 affordance types. Building upon this dataset, we introduce AffordSplatNet, a novel model specifically designed for affordance reasoning using 3DGS representations. AffordSplatNet features an innovative cross-modal structure alignment module that exploits structural consistency priors to align 3D point cloud and 3DGS representations, resulting in enhanced affordance recognition accuracy. Extensive experiments demonstrate that the 3DAffordSplat dataset significantly advances affordance learning within the 3DGS domain, while AffordSplatNet consistently outperforms existing methods across both seen and unseen settings, highlighting its robust generalization capabilities.

BibTeX

@misc{wei20253daffordsplatefficientaffordancereasoning, title={3DAffordSplat: Efficient Affordance Reasoning with 3D Gaussians}, author={Zeming wei and Junyi Lin and Yang Liu and Weixing Chen and Jingzhou Luo and Guanbin Li and Liang Lin}, year={2025}, eprint={2504.11218}, archivePrefix={arXiv}, primaryClass={cs.CV}, url={https://arxiv.org/abs/2504.11218}, }

3DAffordSplat: Efficient Affordance Reasoning with 3D Gaussians

Abstract

Dataset: 3DAffordSplat

Model

Experiments

Results Visualization

Statement And Contact

BibTeX