“3QNet: 3D Point Cloud Geometry Quantization Compression Network” by Huang, Zhang, Chen, Ding, Tai, et al. …
Conference:
Type(s):
Title:
- 3QNet: 3D Point Cloud Geometry Quantization Compression Network
Session/Category Title:
- Distances and Matching
Presenter(s)/Author(s):
Abstract:
Since the development of 3D applications, the point cloud, as a spatial description easily acquired by sensors, has been widely used in multiple areas such as SLAM and 3D reconstruction. Point Cloud Compression (PCC) has also attracted more attention as a primary step before point cloud transferring and saving, where the geometry compression is an important component of PCC to compress the points geometrical structures. However, existing non-learning-based geometry compression methods are often limited by manually pre-defined compression rules. Though learning-based compression methods can significantly improve the algorithm performances by learning compression rules from data, they still have some defects. Voxel-based compression networks introduce precision errors due to the voxelized operations, while point-based methods may have relatively weak robustness and are mainly designed for sparse point clouds. In this work, we propose a novel learning-based point cloud compression framework named 3D Point Cloud Geometry Quantiation Compression Network (3QNet), which overcomes the robustness limitation of existing point-based methods and can handle dense points. By learning a codebook including common structural features from simple and sparse shapes, 3QNet can efficiently deal with multiple kinds of point clouds. According to experiments on object models, indoor scenes, and outdoor scans, 3QNet can achieve better compression performances than many representative methods.
References:
1. Panos Achlioptas, Olga Diamanti, Ioannis Mitliagkas, and Leonidas Guibas. 2018. Learning representations and generative models for 3d point clouds. In International conference on machine learning. PMLR, 40–49.
2. Jens Behley, Martin Garbade, Andres Milioto, Jan Quenzel, Sven Behnke, Cyrill Stachniss, and Jurgen Gall. 2019. Semantickitti: A dataset for semantic scene understanding of lidar sequences. In Proceedings of the IEEE/CVF International Conference on Computer Vision. 9297–9307.
3. Sourav Biswas, Jerry Liu, Kelvin Wong, Shenlong Wang, and Raquel Urtasun. 2020. Muscle: Multi sweep compression of lidar using deep entropy models. Advances in Neural Information Processing Systems 33 (2020), 22170–22181.
4. Gisle Bjontegaard. 2001. Calculation of average PSNR differences between RD-curves. VCEG-M33 (2001).
5. Cesar Cadena, Luca Carlone, Henry Carrillo, Yasir Latif, Davide Scaramuzza, José Neira, Ian Reid, and John J Leonard. 2016. Past, present, and future of simultaneous localization and mapping: Toward the robust-perception age. IEEE Transactions on robotics 32, 6 (2016), 1309–1332.
6. Chao Cao, Marius Preda, and Titus Zaharia. 2019. 3D point cloud compression: A survey. In The 24th International Conference on 3D Web Technology. 1–9.
7. Angela Dai, Angel X Chang, Manolis Savva, Maciej Halber, Thomas Funkhouser, and Matthias Nießner. 2017a. Scannet: Richly-annotated 3d reconstructions of indoor scenes. In Proceedings of the IEEE conference on computer vision and pattern recognition. 5828–5839.
8. Angela Dai, Charles Ruizhongtai Qi, and Matthias Nießner. 2017b. Shape completion using 3d-encoder-predictor cnns and shape synthesis. In Proceedings of the IEEE conference on computer vision and pattern recognition. 5868–5877.
9. Konstantinos G Derpanis. 2010. Overview of the RANSAC Algorithm. Image Rochester NY 4, 1 (2010), 2–3.
10. Haoqiang Fan, Hao Su, and Leonidas J Guibas. 2017. A point set generation network for 3d object reconstruction from a single image. In Proceedings of the IEEE conference on computer vision and pattern recognition. 605–613.
11. Martin A Fischler and Robert C Bolles. 1981. Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography. Commun. ACM 24, 6 (1981), 381–395.
12. Frank Galligan, Michael Hemmer, Ondrej Stava, Fan Zhang, and Jamieson Brettle. 2018. Google/Draco: a library for compressing and decompressing 3D geometric meshes and point clouds.
13. D Graziosi, O Nakagami, S Kuma, A Zaghetto, T Suzuki, and A Tabatabai. 2020. An overview of ongoing point cloud compression standardization activities: Video-based (V-PCC) and geometry-based (G-PCC). APSIPA Transactions on Signal and Information Processing 9 (2020).
14. Amos Gropp, Lior Yariv, Niv Haim, Matan Atzmon, and Yaron Lipman. 2020. Implicit geometric regularization for learning shapes. arXiv preprint arXiv:2002.10099 (2020).
15. Yun He, Xinlin Ren, Danhang Tang, Yinda Zhang, Xiangyang Xue, and Yanwei Fu. 2022. Density-preserving Deep Point Cloud Compression. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).
16. Lila Huang, Shenlong Wang, Kelvin Wong, Jerry Liu, and Raquel Urtasun. 2020. Oct-squeeze: Octree-structured entropy model for lidar compression. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition. 1313–1323.
17. Tianxin Huang and Yong Liu. 2019. 3d point cloud geometry compression on deep learning. In Proceedings of the 27th ACM international conference on multimedia. 890–898.
18. Xin Kong, Guangyao Zhai, Baoquan Zhong, and Yong Liu. 2019. Pass3d: Precise and accelerated semantic segmentation for 3d point cloud. In 2019 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS). IEEE, 3467–3473.
19. Andreas Kopecki, Uwe WOSSNER, Dimitris Mavrikios, Loukas Rentzos, Christian Weidig, Lionel Roucoules, Okung-Dike Ntofon, Martin Reed, Georges Dumont, Daniel BUNDGENS, et al. 2011. Visionair vision advanced infrastructure for research. (2011).
20. Shitong Luo and Wei Hu. 2021. Diffusion probabilistic models for 3d point cloud generation. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 2837–2845.
21. Dat Thanh Nguyen, Maurice Quach, Giuseppe Valenzise, and Pierre Duhamel. 2021. Learning-based lossless compression of 3d point cloud geometry. In ICASSP 2021–2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, 4220–4224.
22. En Yen Puang, Hao Zhang, Hongyuan Zhu, and Wei Jing. 2022. Hierarchical Point Cloud Encoding and Decoding With Lightweight Self-Attention Based Model. IEEE Robotics and Automation Letters 7, 2 (2022), 4542–4549.
23. Charles R Qi, Wei Liu, Chenxia Wu, Hao Su, and Leonidas J Guibas. 2018. Frustum pointnets for 3d object detection from rgb-d data. In Proceedings of the IEEE conference on computer vision and pattern recognition. 918–927.
24. Charles R Qi, Hao Su, Kaichun Mo, and Leonidas J Guibas. 2017a. Pointnet: Deep learning on point sets for 3d classification and segmentation. In Proceedings of the IEEE conference on computer vision and pattern recognition. 652–660.
25. Charles Ruizhongtai Qi, Li Yi, Hao Su, and Leonidas J Guibas. 2017b. Pointnet++: Deep hierarchical feature learning on point sets in a metric space. In Advances in neural information processing systems. 5099–5108.
26. Maurice Quach, Giuseppe Valenzise, and Frederic Dufaux. 2020. Improved deep point cloud geometry compression. In 2020 IEEE 22nd International Workshop on Multimedia Signal Processing (MMSP). IEEE, 1–6.
27. Zizheng Que, Guo Lu, and Dong Xu. 2021. Voxelcontext-net: An octree based framework for point cloud compression. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 6042–6051.
28. Jarek Rossignac. 1999. Edgebreaker: Connectivity compression for triangle meshes. IEEE transactions on visualization and computer graphics 5, 1 (1999), 47–61.
29. Radu Bogdan Rusu and Steve Cousins. 2011. 3d is here: Point cloud library (pcl). In 2011 IEEE international conference on robotics and automation. IEEE, 1–4.
30. Shaoshuai Shi, Xiaogang Wang, and Hongsheng Li. 2019. Pointrcnn: 3d object proposal generation and detection from point cloud. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition. 770–779.
31. Maxim Tatarchenko, Alexey Dosovitskiy, and Thomas Brox. 2017. Octree generating networks: Efficient convolutional architectures for high-resolution 3d outputs. In Proceedings of the IEEE international conference on computer vision. 2088–2096.
32. Gabriel Taubin and Jarek Rossignac. 1998. Geometric compression through topological surgery. ACM Transactions on Graphics (TOG) 17, 2 (1998), 84–115.
33. Dorina Thanou, Philip A Chou, and Pascal Frossard. 2016. Graph-based compression of dynamic 3D point cloud sequences. IEEE Transactions on Image Processing 25, 4 (2016), 1765–1778.
34. Dong Tian, Hideaki Ochimizu, Chen Feng, Robert Cohen, and Anthony Vetro. 2017. Geometric distortion metrics for point cloud compression. In 2017 IEEE International Conference on Image Processing (ICIP). IEEE, 3460–3464.
35. Costa Touma and Craig Gotsman. 1998. Triangle mesh compression. In Proceedings-Graphics Interface. Canadian Information Processing Society, 26–34.
36. Jianqiang Wang, Dandan Ding, Zhu Li, and Zhan Ma. 2021a. Multiscale point cloud geometry compression. In 2021 Data Compression Conference (DCC). IEEE, 73–82.
37. Jianqiang Wang, Hao Zhu, Haojie Liu, and Zhan Ma. 2021b. Lossy point cloud geometry compression via end-to-end learning. IEEE Transactions on Circuits and Systems for Video Technology 31, 12 (2021), 4909–4923.
38. Xuanzheng Wen, Xu Wang, Junhui Hou, Lin Ma, Yu Zhou, and Jianmin Jiang. 2020. Lossy geometry compression of 3d point cloud data via an adaptive octree-guided network. In 2020 IEEE International Conference on Multimedia and Expo (ICME). IEEE, 1–6.
39. Louis Wiesmann, Andres Milioto, Xieyuanli Chen, Cyrill Stachniss, and Jens Behley. 2021. Deep compression for dense point cloud maps. IEEE Robotics and Automation Letters 6, 2 (2021), 2060–2067.
40. Ian H Witten, Radford M Neal, and John G Cleary. 1987. Arithmetic coding for data compression. Commun. ACM 30, 6 (1987), 520–540.
41. Zhirong Wu, Shuran Song, Aditya Khosla, Fisher Yu, Linguang Zhang, Xiaoou Tang, and Jianxiong Xiao. 2015. 3d shapenets: A deep representation for volumetric shapes. In Proceedings of the IEEE conference on computer vision and pattern recognition. 1912–1920.
42. Yaoqing Yang, Chen Feng, Yiru Shen, and Dong Tian. 2018. Foldingnet: Point cloud auto-encoder via deep grid deformation. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 206–215.
43. Lequan Yu, Xianzhi Li, Chi-Wing Fu, Daniel Cohen-Or, and Pheng-Ann Heng. 2018. Pu-net: Point cloud upsampling network. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 2790–2799.


