Holey Sampling: Topological Analysis of Sampling Patterns for Assessing Error in High-dimensional Quadrature

Lead Research Organisation: University of Edinburgh
Department Name: Sch of Informatics

Abstract

Estimating integrals of functions forms the cornerstone of many general classes of problems such as optimisation, sampling and normalisation; these problems, in turn, are central tools for a plethora of applications across various fields such as computer graphics, computer vision and machine learning. The integrand, or function to be integrated, is complicated and rarely available in closed form. Its domain spans spaces of arbitrarily high dimensionality. Exact integration is hopeless and approximation is unavoidable in practice. An estimate of the integral is typically constructed using evaluations of the integrand at a number of sampled locations in the domain. The set of points where the function is sampled is often referred to collectively as a sampling pattern. For computer graphics applications, a modern animation feature film of length 1.5h typically involves the generation of a total of a few hundreds of trillions of high-dimensional samples that are mapped into light paths.

Although a number of strategies have been proposed towards generating samples, measuring the quality of high-dimensional sampling patterns is an open problem. Sampling strategies are currently compared on a case-by-case basis by explicitly computing errors in the context of each application independently. The computation associated with measures such as discrepancy and Fourier analysis scale exponentially with dimensionality and are therefore not practicable for samples in high-dimensional domains.

The proposed work seeks to quantify equidistribution of high-dimensional point sets using an alternative measure to discrepancy that is tractable. This project will establish mathematical connections between computational topology, stochastic geometry and error analysis for Monte Carlo integration. The goal is to develop a measure for assessing the quality of sampling-based estimators purely based on the samples used. The derived theory will be evaluated and applied on Monte Carlo rendering for Computer Graphics applications.

Planned Impact

The proposed research will improve the state of the art in assessing sampling strategies used for rendering computer graphics imagery. Identifying efficient sampling strategies is key for fast rendering. An animated feature film of 1.5h typically requires about 100 million CPU hours of computation for its final render. The number of Monte Carlo samples drawn is typically in the order of hundreds of trillions. Even identifying that a particular sampling strategy is a mere 1% improvement in error over others leads to a saving of the order of millions of CPU hours.

The proposed project will impact companies that develop or rely on rendering software to generate realistic computer graphics imagery. Companies in the business of visual effects for motion pictures, such as Framestore, Double Negative, Moving Picture Company, Industrial Light Magic, etc., develop their own rendering software. In addition, major users of rendering software are spread widely across other fields such as video games, architecture and advertising. The Department for Culture Media and Support estimates (2016) that the Gross Value Added due to the Creative Industries was £84.1bn in 2014 of which IT and computer services constitute about 62%.

Companies with large budgets, such as Framestore and Double Negative, are able to adapt and tinker with sampling strategies developed by academic researchers for their own benefit. However, even this requires employees with specialist know-how and exemplary mathematical and computer programming abilities (letters attached). Recruitment of such trained people is a looming issue.

Techniques developed by large companies are often protected IP and thereby affect the balance of the wider ecosystem of the creative industries. As EPSRC acknowledges on their webpage for Graphics and Visualisation, a large portion of the creative industries is formed by a vibrant community of small and medium-sized enterprises (SMEs) which rely heavily on academic research for improvements to fundamental problems such as sampling.

The proposed research will generate impact through both fundamental improvements to sampling for rendering as well as improved skills (via improved curricula) and dissemination (course at ACM SIGGRAPH).

Publications

10 25 50
publication icon
Asenov M (2019) Active Localization of Gas Leaks Using Fluid Simulation in IEEE Robotics and Automation Letters

publication icon
Singh G (2019) Analysis of Sample Correlations for Monte Carlo Rendering in Computer Graphics Forum

publication icon
Singh G (2019) Fourier Analysis of Correlated Monte Carlo Importance Sampling in Computer Graphics Forum

 
Description Numerical integration via sampling operates by averaging a number of evaluations of the integrand. In naive Monte Carlo integration, these sample locations are chosen randomly and independently. However, many methods deliberately introduce correlations between the samples. The analysis of the impact of these correlations on the estimator is non-trivial, and typically requires different mathematical machinery for different families of correlated samplers. We show that Fourier Analysis is a powerful and general tool that can be used to represent the effect of correlated samplers. However, we also showed that this method suffers from poor computational scalability for integration over high-dimensional spaces.

Further to the above analysis, we have developed a new algorithm that performs similarly to existing algorithms but is more generally scalable and parallelisable. We are preparing a Journal Submission, but have uploaded it temporarily to https://arxiv.org/abs/2002.07002
Exploitation Route A potential next step would be to exploit our understanding of the impact of correlations, to design specially correlated samplers that yield low-error estimators. This gain in computational performance could be leveraged in two ways: First, to yield reasonable estimates of integrals for applications with a short time budget; second, to reduce the computational resources required for applications that rely on highly precise estimates.
Sectors Creative Economy,Financial Services, and Management Consultancy

URL http://homepages.inf.ed.ac.uk/ksubr/research.html#CGF19_1
 
Description Computational Topologist at Oxford 
Organisation University of Oxford
Country United Kingdom 
Sector Academic/University 
PI Contribution My PhD student Alexandros Keros and I had discussions with Prof. Vidit Nanda at Oxford, which led to our AAAI publication. In the research work, we developed a neural network architecture to solve a computationally intractable problem of localising homology generators (holes/cycles).
Collaborator Contribution Prof. Vidit Nanda is an expert in computational topology, and has developed a computational tool (library) that is used widely. His insight on potential pathways to solving the difficult computational problem (mentioned above) were key. He also contributed to the writing (the relevant background section) of the article.
Impact published article: Dist2Cycle: A Simplicial Neural Network for Homology Localization Alexandros Dimitrios Keros, Vidit Nanda, Kartic Subr AAAI 2022 Yes it is multidisciplinary. Vidit Nanda is a Professor of Mathematics at Oxford. Alexandros Keros and I are in the School of Informatics, University of Edinburgh.
Start Year 2019
 
Description Edinburgh Crucible for Sampling in the Special Effects Industry 
Form Of Engagement Activity Participation in an activity, workshop or similar
Part Of Official Scheme? No
Geographic Reach National
Primary Audience Industry/Business
Results and Impact This event was a crucible to foster discussions between academics, industry, students and practitioners. The format of the event was talks from prominent people from the special effects industry, professors from leading institutions in the UK in computer graphics and students. After short talks from each of the presenters, there was a Q&A session and a panel discussion surrounding how the supply and quality of computer graphics programmers might be improved to keep up with the demand from industry.
Year(s) Of Engagement Activity 2019
URL https://www.eventbrite.com/e/photorealistic-visuals-for-computer-graphics-applications-opportunities...
 
Description Public event at the Barbican, London 
Form Of Engagement Activity A talk or presentation
Part Of Official Scheme? No
Geographic Reach International
Primary Audience Public/other audiences
Results and Impact I was invited by Royal Society to lead a Cafe Scientifique session at the Barbican museum in London as a part of the year-long Life Rewired Season. I developed an activity to engage the public and stressed on the importance of approximate methods such as sampling for application to machine learning in different fields such as robotics, computer graphics, etc.

The event was held in the Atrium of the Barbican. There were about 60-80 seated audience members and many others who engaged partially as they were passing by. Since it was held outside the popular "AI: More than Human" exhibit, there was a steady crowd flowing through the atrium. The format of the event was a short presentation from me (about 10 mins) followed by about 40 mins of Q&A from audience members. The questions were at various levels from technical details to legal ramifications of AI.
Year(s) Of Engagement Activity 2019
URL https://royalsociety.org/science-events-and-lectures/2019/08/robotic-autonomy/