“Understanding and Exploiting Object Interaction Landscapes” by Pirk, Krs, Hu, Rajasekaran, Benes, et al. …
Conference:
Type(s):
Title:
- Understanding and Exploiting Object Interaction Landscapes
Session/Category Title: Comparing 3D Shapes and Part
Presenter(s)/Author(s):
- Soren Pirk
- Vojtech Krs
- Kaimo Hu
- Suren Deepak Rajasekaran
- Bedrich Benes
- Yusuke Yoshiyasu
- Leonidas (Leo) J. Guibas
Moderator(s):
Abstract:
Interactions play a key role in understanding objects and scenes for both virtual and real-world agents. We introduce a new general representation for proximal interactions among physical objects that is agnostic to the type of objects or interaction involved. The representation is based on tracking particles on one of the participating objects and then observing them with sensors appropriately placed in the interaction volume or on the interaction surfaces. We show how to factorize these interaction descriptors and project them into a particular participating object so as to obtain a new functional descriptor for that object, its interaction landscape, capturing its observed use in a spatiotemporal framework. Interaction landscapes are independent of the particular interaction and capture subtle dynamic effects in how objects move and behave when in functional use. Our method relates objects based on their function, establishes correspondences between shapes based on functional key points and regions, and retrieves peer and partner objects with respect to an interaction.
References:
1. Adobe. 2015. Mixamo. Retrieved from https://www.mixamo.com/.Google Scholar
2. R. Ali Al-Asqhar, T. Komura, and M. Geol Choi. 2013. Relationship descriptors for interactive motion adaptation. In Proceedings of ACM SIGGRAPH/Eurographics Symposium on Computer Animation (SCA’13). ACM, 45–53. Google ScholarDigital Library
3. N. Amenta, M. Bern, and M. Kamvysselis. 1998. A new Voronoi-based surface reconstruction algorithm. In Proceedings of SIGGRAPH. ACM, New York, 415–421. Google ScholarDigital Library
4. D. Anguelov, P. Srinivasan, D. Koller, S. Thrun, J. Rodgers, and J. Davis. 2005. SCAPE: Shape completion and animation of people. In Proceedings of SIGGRAPH. Google ScholarDigital Library
5. F. G. Ashby. 1992. Multidimensional Models of Perception and Cognition. L. Erlbaum. https://books.google.com/books?id=eeQh4ESeOfsCGoogle Scholar
6. E. Bar-Aviv and E. Rivlin. 2006. Functional 3D object classification using simulation of embodied agent. In BMVC. British Machine Vision Association, 307–316.Google Scholar
7. I. Baran and J. Popović. 2007. Automatic rigging and animation of 3D characters. ACM Trans. Graph. 26, 3 (July 2007), 72:1–72:8. Google ScholarDigital Library
8. S. Biasotti, A. Cerri, A. Bronstein, and M. Bronstein. 2015. Recent trends, applications, and perspectives in 3D shape similarity assessment. Comp. Graph. Forum (2015). Google ScholarDigital Library
9. A. W. Black and P. Taylor. 1997. Automatically clustering similar units for unit selection in speech synthesis. In Eurospeech97. 601–604.Google Scholar
10. M. Caine. 1994. The design of shape interactions using motion constraints. In Proceedings of the 1994 IEEE International Conference on Robotics and Automation, Vol. 1. 366–371.Google ScholarCross Ref
11. A. X. Chang, T. Funkhouser, L. Guibas, P. Hanrahan, Q. Huang, Z. Li, S. Savarese, M. Savva, S. Song, H. Su, J. Xiao, L. Yi, and F. Yu. 2015. ShapeNet: An information-rich 3D model repository. ArXiv e-prints (Dec. 2015).Google Scholar
12. Y.-W. Chao, Z. Wang, Y. He, J. Wang, and J. Deng. 2015. HICO: A benchmark for recognizing human-object interactions in images. ICCV (2015). Google ScholarDigital Library
13. D.-Y. Chen, X.-P. Tian, Y.-T. Shen, and M. Ouhyoung. 2003. On visual similarity based 3D model retrieval. Comp. Graph. Forum 22, 3 (2003), 223–232.Google ScholarCross Ref
14. J. Chen, X. Ge, L.-Y. Wei, B. Wang, Y. Wang, H. Wang, Y. Fei, K.-L. Qian, J.-H. Yong, and W. Wang. 2013. Bilateral blue noise sampling. ACM Trans. Graph. 32, 6, Article 216 (Nov. 2013), 11 pages. Google ScholarDigital Library
15. S. Durrleman. 2010. Statistical Models of Currents for Measuring the Variability of Anatomical Curves, Surfaces and Their Evolution. Ph.D. dissertation. Université Nice – Sophia Antipolis, France.Google Scholar
16. M. Fisher, M. Savva, Y. Li, P. Hanrahan, and M. Niessner. 2015. Activity-centric scene synthesis for functional 3D scene modeling. ACM Trans. Graph. 34, 6, Article 179 (Oct. 2015), 13 pages. Google ScholarDigital Library
17. R. Gal and D. Cohen-Or. 2006. Salient geometric features for partial shape matching and similarity. ACM Trans. Graph. 25, 1 (2006), 130–150. Google ScholarDigital Library
18. H. Grabner, J. Gall, and L. Van Gool. 2011. What makes a chair a chair?. In CVPR. 1529–1536. Google ScholarDigital Library
19. A. Gupta, A. Kembhavi, and L. S. Davis. 2009. Observing human-object interactions: Using spatial and functional compatibility for recognition. IEEE Trans. Pattern Anal. Mach. Intell. 31, 10 (Oct. 2009), 1775–1789. Google ScholarDigital Library
20. E. S. L. Ho, T. Komura, and C.-L. Tai. 2010. Spatial relationship preserving character motion adaptation. ACM Trans. Graph. 29, 4, Article 33 (2010), 8 pages. Google ScholarDigital Library
21. R. Hu, O. van Kaick, B. Wu, H. Huang, A. Shamir, and H. Zhang. 2016. Learning how objects function via co-analysis of interactions. ACM Trans. Graph. 35, 4, Article 47 (July 2016), 13 pages. Google ScholarDigital Library
22. R. Hu, C. Zhu, O. van Kaick, L. Liu, A. Shamir, and H. Zhang. 2015. Interaction context (ICON): Towards a geometric functionality descriptor. ACM Trans. Graph. 34, 4, Article 83 (2015), 12 pages. Google ScholarDigital Library
23. H. Jiang and D. R. Martin. 2008. Finding actions using shape flows. In Computer Vision ECCV 2008, David Forsyth, Philip Torr, and Andrew Zisserman (Eds.). Lecture Notes in Computer Science, Vol. 5303. Springer, Berlin, 278–292. Google ScholarDigital Library
24. Germany Karlsruhe Institute of Technology, Karlsruhe. 2015. KIT Whole Human Motion Database. Retrieved from https://motion-database.humanoids.kit.edu/.Google Scholar
25. V. G. Kim, S. Chaudhuri, L. Guibas, and T. Funkhouser. 2014. Shape2Pose: Human-centric shape analysis. ACM Trans. Graph. 33, 4, Article 120 (2014), 12 pages. Google ScholarDigital Library
26. H. Laga, M. Mortara, and M. Spagnuolo. 2013. Geometry and context for semantic correspondences and functionality recognition in man-made 3D shapes. ACM Trans. Graph. 32, 5, Article 150 (2013), 16 pages. Google ScholarDigital Library
27. C. H. Lee, A. Varshney, and D. W. Jacobs. 2005. Mesh saliency. ACM Trans. Graph. 24, 3 (July 2005), 659–666. Google ScholarDigital Library
28. M. Leordeanu and M. Hebert. 2005. A spectral technique for correspondence problems using pairwise constraints. In Proceedings of the 10th IEEE International Conference on Computer Vision (ICCV’05). Vol. 2. 1482–1489. Google ScholarDigital Library
29. P. Li, B. Wang, F. Sun, X. Guo, C. Zhang, and W. Wang. 2015. Q-MAT: Computing medial axis transform by quadratic error minimization. ACM Trans. Graph. 35, 1, Article 8 (Dec. 2015), 16 pages. Google ScholarDigital Library
30. Y. Li, J. L. Fu, and N. S. Pollard. 2007. Data-driven grasp synthesis using shape matching and task-based pruning. IEEE Trans. Vis. Comput. Graph. 13, 4 (July 2007), 732–747. Google ScholarDigital Library
31. Z. Liu, C. Xie, S. Bu, X. Wang, J. Han, H. Lin, and H. Zhang. 2015. Indirect shape analysis for 3D shape retrieval. Comput. Graphics 46 (2015), 110–116. Shape Modeling International 2014. Google ScholarDigital Library
32. N. Mitra, M. Wand, H. (Richard) Zhang, D. Cohen-Or, V. Kim, and Q.-X. Huang. 2013b. Structure-aware shape processing. In SIGGRAPH Asia 2013 Courses (SA’13). ACM, New York, Article 1, 20 pages. Google ScholarDigital Library
33. N. J. Mitra, M. Pauly, M. Wand, and D. Ceylan. 2013a. Symmetry in 3D geometry: Extraction and applications. Comput. Graph. Forum 32, 6 (2013), 1–23. Google ScholarDigital Library
34. J. J. Monaghan. 1992. Smoothed particle hydrodynamics. Ann. Rev. Astron. Astrophys. 30 (1992), 543–574.Google Scholar
35. O. Oreifej and Z. Liu. 2013. HON4D: Histogram of oriented 4D normals for activity recognition from depth sequences. In 2013 IEEE Conference on Computer Vision and Pattern Recognition (CVPR’13). 716–723. Google ScholarDigital Library
36. R. Osada, T. Funkhouser, B. Chazelle, and D. Dobkin. 2002. Shape distributions. ACM Trans. Graph. 21, 4 (Oct. 2002), 807–832. Google ScholarDigital Library
37. M. Ovsjanikov, M. Ben-Chen, J. Solomon, A. Butscher, and L. Guibas. 2012. Functional maps: A flexible representation of maps between shapes. ACM Trans. Graph. 31, 4 (2012). Google ScholarDigital Library
38. M. Pechuk, O. Soldea, and E. Rivlin. 2008. Learning function-based object classification from 3D imagery. Comput. Vis. Image Underst. 110, 2 (May 2008), 173–191. Google ScholarDigital Library
39. E. Rivlin, S. J. Dickinson, and A, Rosenfeld. 1994. Recognition by functional parts {function-based object recognition}. In Proceedings of the 1994 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR’94). 267–274.Google ScholarCross Ref
40. Columbia University USA Robotics Lab, Computer Science Laboratory. 2012. GraspIt! (2012). http://www.cs.columbia.edu/ cmatei/graspit/.Google Scholar
41. Y. Rubner, C. Tomasi, and L. J. Guibas. 1998. A metric for distributions with applications to image databases. In Proceedings of the 6th International Conference on Computer Vision, 1998. 59–66. Google ScholarDigital Library
42. M. Savva, A. X. Chang, P. Hanrahan, M. Fisher, and M. Nießner. 2014. SceneGrok: Inferring action maps in 3D environments. ACM Trans. Graph. 33, 6, Article 212 (Nov. 2014), 10 pages. Google ScholarDigital Library
43. M. Savva, A. X. Chang, P. Hanrahan, M. Fisher, and M. Nießner. 2016. PiGraphs: Learning interaction snapshots from observations. ACM Trans. Graph. 35, 4, Article 139 (July 2016), 12 pages. Google ScholarDigital Library
44. P. Shilane and T. Funkhouser. 2007. Distinctive regions of 3D surfaces. ACM Trans. Graph. 26, 2, Article 7 (June 2007). Google ScholarDigital Library
45. O. Sidi, O. van Kaick, Y. Kleiman, H. Zhang, and D. Cohen-Or. 2011. Unsupervised co-segmentation of a set of shapes via descriptor-space spectral clustering. ACM Trans. Graph. 30, 6, Article 126 (2011), 10 pages. Google ScholarDigital Library
46. H. O. Song, M. Fritz, C. Gu, and T. Darrell. 2011. Visual grasp affordances from appearance-based cues. In Proceedings of the 2011 IEEE International Conference on Computer Vision Workshops (ICCV Workshops). 998–1005.Google Scholar
47. M. Sutton, L. Stark, and K. Bowyer. 1994. GRUFF-3: Generalizing the domain of a function-based recognition system. Pattern Recog. 27, 12 (1994), 1743–1766.Google ScholarCross Ref
48. A. Tevs, Q. Huang, M. Wand, H.-P. Seidel, and L. Guibas. 2014. Relating shapes via geometric symmetries and regularities. ACM Trans. Graph. 33, 4, Article 119 (July 2014), 12 pages. Google ScholarDigital Library
49. A. Tversky. 1977. Features of similarity. Psych. Rev. 84 (1977), 327–352.Google Scholar
50. D. Tzionas and J. Gall. 2015. 3D object reconstruction from hand-object interactions. In Proceedings of the International Conference on Computer Vision (ICCV). http://files.is.tue.mpg.de/dtzionas/In-Hand-Scanning Google ScholarDigital Library
51. Z. Wu, S. Song, A. Khosla, X. Tang, and J. Xiao. 2014. 3D shapenets for 2.5d object recognition and next-best-view prediction. CoRR abs/1406.5670 (2014). http://arxiv.org/abs/1406.5670Google Scholar
52. L. Xu, H. Quynh Dinh, P. Mordohai, and T. Ramsay. 2011. Detecting patterns in vector fields. 49th AIAA Aerospace Sciences Meeting Including the New Horizons Forum and Aerospace Exposition 6 (2011).Google ScholarCross Ref
53. X. Zhao, H. Wang, and T. Komura. 2014. Indexing 3D scenes using the interaction bisector surface. ACM Trans. Graph. 33, 3, Article 22 (2014), 14 pages. Google ScholarDigital Library
54. Y. Zheng, D. Cohen-Or, and N. J. Mitra. 2013. Smart variations: Functional substructures for part compatibility. Comp. Graph. Forum 32, 2 pt2 (2013), 195–204.Google ScholarCross Ref
55. Y. Zhu, A. Fathi, and L. Fei-Fei. 2014. Reasoning about object affordances in a knowledge base representation. In Computer Vision ECCV 2014, D. Fleet, T. Pajdla, B. Schiele, and T. Tuytelaars (Eds.). Lecture Notes in Computer Science, Vol. 8690. Springer International Publishing, 408–424.Google Scholar