“Stereoscopic 3D copy & paste” – ACM SIGGRAPH HISTORY ARCHIVES

“Stereoscopic 3D copy & paste”

  • 2010 SA Technical Paper: Lo_Stereoscopic 3D copy & paste

Conference:


Type(s):


Title:

    Stereoscopic 3D copy & paste

Session/Category Title:   Image & video editing


Presenter(s)/Author(s):


Moderator(s):



Abstract:


    With the increase in popularity of stereoscopic 3D imagery for film, TV, and interactive entertainment, an urgent need for editing tools to support stereo content creation has become apparent. In this paper we present an end-to-end system for object copy & paste in a stereoscopic setting to address this need. There is no straightforward extension of 2D copy & paste to support the addition of the third dimension as we show in this paper. For stereoscopic copy & paste we need to handle depth, and our core objective is to obtain a convincing 3D viewing experience. As one of the main contributions of our system, we introduce a stereo billboard method for stereoscopic rendering of the copied selection. Our approach preserves the stereo volume and is robust to the inevitable inaccuracies of the depth maps computed from a stereo pair of images. Our system also includes an interactive stereoscopic segmentation tool to achieve high quality object selection. Hence, we focus on intuitive and minimal user interaction, and our editing operations perform within interactive rates to provide immediate feedback.

References:


    1. Agarwala, A., Hertzmann, A., Salesin, D. H., and Seitz, S. M. 2004. Keyframe-based tracking for rotoscoping and animation. ACM Trans. on Graph. 23, 3 (Aug.), 584–591. Google ScholarDigital Library
    2. Bai, X., Wang, J., Simons, D., and Sapiro, G. 2009. Video snapcut: robust video object cutout using localized classifiers. ACM Trans. on Graph. 28, 3 (Aug.). Google ScholarDigital Library
    3. Besl, P. J., and McKay, N. D. 1992. A method for registration of 3-d shapes. IEEE Trans. Pattern Anal. Mach. Intell. 14, 2, 239–256. Google ScholarDigital Library
    4. Boykov, Y., Veksler, O., and Zabih, R. 2001. Fast approximate energy minimization via graph cuts. IEEE Trans. on Pattern Anal. and Mach. Intell. 23, 11, 1222–1239. Google ScholarDigital Library
    5. Chuang, Y.-Y., Agarwala, A., Curless, B., Salesin, D. H., and Szeliski, R. 2002. Video matting of complex scenes. In Proc. of SIGGRAPH 2002, ACM Press / ACM SIGGRAPH, J. F. Hughes, Ed., ACM, 243–248. Google ScholarDigital Library
    6. Comaniciu, D., and Meer, P. 2002. Mean shift: A robust approach toward feature space analysis. IEEE Trans. on Pattern Anal. and Mach. Intell. 24, 603–619. Google ScholarDigital Library
    7. Dong Seon, C., and Figueiredo, M. A. T. 2007. Cosegmentation for image sequences. Int. Conf. on Image Anal. and Proc., 635–640. Google ScholarDigital Library
    8. Farbman, Z., Hoffer, G., Lipman, Y., Cohen-Or, D., and Lischinski, D. 2009. Coordinates for instant image cloning. ACM Trans. on Graph. 28, 3 (Aug.). Google ScholarDigital Library
    9. Fuji, 2009. Finepix REAL 3D W1. http://www.fujifilm.com/products/3d/camera/finepix_real3dw1/.Google Scholar
    10. Fukuda, K., Wilcox, L. M., Allison, R., and Howard, I. P. 2009. A reevaluation of the tolerance to vertical misalignment in stereopsis. Journal of Vision 9, 2 (February), 1–8.Google ScholarCross Ref
    11. Georgiev, T. 2006. Covariant derivatives and vision. Proc. of European Conf. on Comp. Vision 4, 56–69. Google ScholarDigital Library
    12. Hartley, R. I., and Zisserman, A. 2004. Multiple View Geometry in Computer Vision, second ed. Cambridge University Press, ISBN: 0521540518. Google ScholarDigital Library
    13. Howard, I. P., and Rogers, B. J. 2002. Seeing in Depth, Basic Mechanics & Depth Perception, vol. 1 & 2. I Porteous, Thornhill, Ontario.Google Scholar
    14. Jia, J., Sun, J., Tang, C.-K., and Shum, H.-Y. 2006. Drag-and-drop pasting. ACM Trans. on Graph. 25, 3 (July), 631–637. Google ScholarDigital Library
    15. Koppal, S., Zitnick, C., Cohen, M., Kang, S., Ressler, B., and Colburn, A. 2010. A viewer-centric editor for stereoscopic cinema. IEEE Comp. Graph. and Appl. Preprint.Google Scholar
    16. Lalonde, J.-F., Hoiem, D., Efros, A. A., Rother, C., Winn, J., and Criminisi, A. 2007. Photo clip art. ACM Trans. on Graph. 26, 3 (July). Google ScholarDigital Library
    17. Lambooij, M., IJsselsteijn, W., Fortuin, M., and Heynderickx, I. 2009. Visual discomfort and visual fatigue of stereoscopic displays: A review. Journal of Imaging Science and Tech. 53, 3, 030201.Google ScholarCross Ref
    18. Lang, M., Hornung, A., Wang, O., Poulakos, S., Smolic, A., and Gross, M. 2010. Nonlinear disparity mapping for stereoscopic 3d. ACM Trans. on Graph. 29, 4 (July). Google ScholarDigital Library
    19. Liu, F., Gleicher, M., Jin, H., and Agarwala, A. 2009. Content-preserving warps for 3d video stabilization. ACM Trans. on Graph. 28, 3. Google ScholarDigital Library
    20. Liu, J., Sun, J., and Shum, H.-Y. 2009. Paint selection. ACM Trans. on Graph. 28, 3 (Aug.). Google ScholarDigital Library
    21. Loos, B. J., and Sloan, P.-P. 2010. Volumetric obscurance. In ACM Symp. on Interactive 3D Graph. and Games, ACM, New York, NY, USA, 151–156. Google ScholarDigital Library
    22. Lu, F., Fu, Z., and Robles-Kelly, A. 2007. Efficient graph cuts for multiclass interactive image segmentation. Proc. of the Asian Conf. on Comp. vision, 134–144. Google ScholarDigital Library
    23. Mammen, A. 1989. Transparency and antialiasing algorithms implemented with the virtual pixel maps technique. IEEE Comp. Graph. and Appl. 9, 4, 43–55. Google ScholarDigital Library
    24. Ning, J., Zhang, L., Zhang, D., and Wu, C. 2010. Interactive image segmentation by maximal similarity based region merging. Pattern Recogn. 43, 2, 445–456. Google ScholarDigital Library
    25. Patterson, R. 2007. Human factors of 3d displays. Journal of the Soc. for Information Disp., 15, 861–871.Google ScholarCross Ref
    26. Pérez, P., Gangnet, M., and Blake, A. 2003. Poisson image editing. ACM Trans. on Graph. 22, 3 (July), 313–318. Google ScholarDigital Library
    27. Reinhard, E., Ashikhmin, M., Gooch, B., and Shirley, P. 2001. Color transfer between images. IEEE Comp. Graph. and Appl. 21, 5, 34–41. Google ScholarDigital Library
    28. Rhee, S.-M., Ziegler, R., Park, J., Naef, M., Gross, M., and Kim, M.-H. 2007. Low-cost telepresence for collaborative virtual environments. IEEE Trans on Vis. and Comp. Graph. 13, 1, 156–166. Google ScholarDigital Library
    29. Rother, C., Kolmogorov, V., and Blake, A. 2004. “grab-cut”: interactive foreground extraction using iterated graph cuts. ACM Trans. on Graph. 23, 3 (Aug.), 309–314. Google ScholarDigital Library
    30. Rother, C., Minka, T., Blake, A., and Kolmogorov, V. 2006. Cosegmentation of image pairs by histogram matching. IEEE Conf. on Comp. Vision and Pattern Recog., 993–1000. Google ScholarDigital Library
    31. Scharstein, D., and Szeliski., R. 2002. A taxonomy and evaluation of dense two-frame stereo correspondence algorithms. Int. Journal of Comp. Vision 47, 1/2/3, 7–42. Google ScholarDigital Library
    32. Scharstein, D., and Szeliski, R., 2010. Middlebury Stereo Repository. http://vision.middlebury.edu/stereo/.Google Scholar
    33. Shum, H. Y., Sun, J., Yamazaki, S., Li, Y., and Tang, C. K. 2004. Pop-up light field: An interactive image-based modeling and rendering system. ACM Trans. on Graph. 23, 2 (Aug.), 143–162. Google ScholarDigital Library
    34. Smith, B., Zhang, L., and Jin, H. 2009. Stereo matching with nonparametric smoothness priors in feature space. IEEE Conf. on Comp. Vision and Pattern Recog., 485–492.Google Scholar
    35. Taguchi, Y., Wilburn, B., and Zitnick, C. L. 2008. Stereo reconstruction with mixed pixels using adaptive over-segmentation. IEEE Conf. on Comp. Vision and Pattern Recog., 1–8.Google Scholar
    36. The Foundry, 2010. Nuke – Ocula Plug-in. http://www.thefoundry.co.uk/.Google Scholar
    37. Wang, J., and Cohen, M. F. 2008. Image and video matting: A survey. Foundations and Trends in Comp. Graph. and Vision 3, 2, 97–175. Google ScholarDigital Library
    38. Wang, C., and Sawchuk, A. A. 2008. Disparity manipulation for stereo images and video. Stereoscopic Disp. and Appl. 6803, 1, 68031E.Google ScholarCross Ref
    39. Wang, L., Jin, H., Yang, R., and Gong, M. 2008. Stereoscopic inpainting: Joint color and depth completion from stereo images. In IEEE Conf. on Comp. Vision and Pattern Recog., 1–8.Google Scholar
    40. Zitnick, C. L., and Kang, S. B. 2007. Stereo for image-based rendering using image over-segmentation. Int. Journal of Comp. Vision 75, 1, 49–65. Google ScholarDigital Library
    41. Zitnick, C. L., Kang, S. B., Uyttendaele, M., Winder, S., and Szeliski, R. 2004. High-quality video view interpolation using a layered representation. ACM Trans. on Graph. 23, 3 (Aug.), 600–608. Google ScholarDigital Library
    42. Zitnick, C. L., Jojic, N., and Kang, S. B. 2005. Consistent segmentation for optical flow estimation. IEEE Int. Conf. on Comp. Vision, 1308–1315. Google ScholarDigital Library
    43. Zwicker, M., Pfister, H., van Baar, J., and Gross, M. 2002. Ewa splatting. IEEE Trans. on Vis. and Comp. Graph. 8, 3, 223–238. Google ScholarDigital Library


ACM Digital Library Publication:



Overview Page:



Submit a story:

If you would like to submit a story about this presentation, please contact us: historyarchives@siggraph.org