“Masked depth culling for graphics hardware”
Conference:
Type(s):
Title:
- Masked depth culling for graphics hardware
Session/Category Title: Illumination and Real-Time
Presenter(s)/Author(s):
Abstract:
Hierarchical depth culling is an important optimization, which is present in all modern high performance graphics processors. We present a novel culling algorithm based on a layered depth representation, with a per-sample mask indicating which layer each sample belongs to. Our algorithm is feed forward in nature in contrast to previous work, which rely on a delayed feedback loop. It is simple to implement and has fewer constraints than competing algorithms, which makes it easier to load-balance a hardware architecture. Compared to previous work our algorithm performs very well, and it will often reach over 90% of the efficiency of an optimal culling oracle. Furthermore, we can reduce bandwidth by up to 16% by compressing the hierarchical depth buffer.
References:
1. Aila, T., and Miettinen, V. 2004. dPVS: An Occlusion Culling System for Massive Dynamic Environments. IEEE Computer Graphics and Applications, 24, 2, 86–97.
2. Aila, T., Miettinen, V., and Nordlund, P. 2003. Delay Streams for Graphics Hardware. ACM Transactions on Graphics 22, 3, 792–800.
3. Akeley, K. 1993. RealityEngine Graphics. In Proceedings of SIGGRAPH 93, ACM, 109–116.
4. Akenine-Möller, T., and Ström, J. 2003. Graphics for the Masses: A Hardware Rasterization Architecture for Mobile Phones. ACM Transactions on Graphics, 22, 3, 801–808.
5. Akenine-Möller, T., Munkberg, J., and Hasselgren, J. 2007. Stochastic Rasterization using Time-Continuous Triangles. In Graphics Hardware, 7–16.
6. Akenine-Möller, T., Haines, E., and Hoffman, N. 2008. Real-Time Rendering, 3rd ed. AK Peters Ltd.
7. Beaudoin, P., and Poulin, P. 2004. Compressed Multisampling for Efficient Hardware Edge Antialiasing. In Graphics Interface, 169–176.
8. Bittner, J., Wimmer, M., Piringer, H., and Purgathofer, W. 2004. Coherent Hierarchical Culling: Hardware Occlusion Queries Made Useful. Computer Graphics Forum, 23, 3, 615–624.
9. Boulos, S., Luong, E., Fatahalian, K., Moreton, H., and Hanrahan, P. 2010. Space-Time Hierarchical Occlusion Culling for Micropolygon Rendering with Motion Blur. In High Performance Graphics, 11–18.
10. Collin, D., 2011. Culling the Battlefield. Game Developer’s Conference (presentation).
11. Fatahalian, K., Boulos, S., Hegarty, J., Akeley, K., Mark, W. R., Moreton, H., and Hanrahan, P. 2010. Reducing Shading on GPUs using Quad-Fragment Merging. ACM Transactions on Graphics, 29, 4, 67:1–67:8.
12. Greene, N., and Kass, M. 1994. Error-bounded Antialiased Rendering of Complex Environments. In Proceedings of SIGGRAPH 94, 59–66.
13. Greene, N., Kass, M., and Miller, G. 1993. Hierarchical Z-Buffer Visibility. In Proceedings of SIGGRAPH 1993, ACM, 231–238.
14. Greene, N. 1996. Hierarchical Polygon Tiling with Coverage Masks. In Proceedings of SIGGRAPH 96, ACM, 65–74.
15. Guthe, M., Balázs, Á., and Klein, R. 2006. Near Optimal Hierarchical Culling: Performance Driven Use of Hardware Occlusion Queries. In Eurographics Symposium on Rendering, 207–214.
16. Hasselgren, J., and Akenine-Möller, T. 2006. Efficient Depth Buffer Compression. In Graphics Hardware, 103–110.
17. Hasselgren, J., Andersson, M., Nilsson, J., and Akenine-Möller, T. 2012. A Compressed Depth Cache. Journal of Computer Graphics Techniques, 1, 1, 101–118.
18. Jouppi, N. P., and Chang, C.-F. 1999. Z3: An Economical Hardware Technique for High-Quality Antialiasing and Transparency. In Graphics Hardware, 85–93.
19. Mattausch, O., Bittner, J., and Wimmer, M. 2008. CHC++: Coherent Hierarchical Culling Revisited. Computer Graphics Forum, 27, 2, 221–230.
20. Morein, S. 2000. ATI Radeon HyperZ Technology. In Graphics Hardware, Hot3D Proceedings.
21. Munkberg, J., Clarberg, P., Hasselgren, J., Toth, R., Sugihara, M., and Akenine-Möller, T. 2011. Hierarchical Stochastic Motion Blur Rasterization. In High Performance Graphics, 107–118.
22. Shapiro, J. 1993. Embedded Image Coding using Zerotrees of Wavelet Coefficients. IEEE Transactions on Signal Processing, 41, 12, 3445–3462.
23. Staneker, D., Bartz, D., and Meissner, M. 2003. Improving Occlusion Query Efficiency with Occupancy Maps. In IEEE Symposium on Parallel and Large-Data Visualization and Graphics, 111–118.
24. Zhang, H., Manocha, D., Hudson, T., and Hoff, Iii, K. E. 1997. Visibility Culling Using Hierarchical Occlusion Maps. In Proceedings of SIGGRAPH 97, 77–88.


