Gaussian process regression based on stationary Gaussian processes provides a powerful framework for data efficient learning in a relatively low dimension. One of the key features of the framework is the ability to quantify uncertainty associated to the predictions. This is often used in applications involving automatic decision making, including optimization [24], reinforcement learning [8] and more [21].
In some applications, inputs of the unknown function lie in a non-Euclidean space like a manifold or a graph. Although one can often model a function like this by embedding these inputs into a Euclidean space, the inner structure of the input space, which is an important modeling assumption, is lost. In practice this hinders data efficiency and impairs overall modeling quality. It is thus important to study Gaussian process regression with inputs on such spaces directly.
Recent developments on that account include [4] where the general case of compact Rieman- nian manifolds is studied and [3] where the object of consideration are Gaussian processes on graph-structured finite sets.
There exists, however, a number of examples of noncompact manifolds of great significance for applications for which the theory and the corresponding computational techniques are yet to be developed. Arguably, the most important ones are the manifold of postive definite matrices and the hyperbolic space — examples of the class of noncompact symmetric spaces.
In this work we study stationary Gaussian processes (most notably, based on heat and Matern kernels) on such spaces and computational approaches for Gaussian process regression on them.
These techniques include: efficient (approximate) algorithms for point-wise kernel evaluation and differentiation with respect to parameters; efficient algorithms for sampling, conditioning and sampling from the conditional Gaussian process.
In the next three parts of the introduction we give an overview of Gaussian processes regression on Euclidean spaces. In the forth and final part of the introduction we give a brief but more specific account of goals of the thesis and of the structure of the further text.
Notation We use lowercase bold to denote vectors (e.g. x) and uppercase upright bold to denote matrices (e.g. A).
For a function f(•) on X and x G Xn we put f(x) = [f(x1),...,f(xn)]. Similarly, for a function f (•, •) on X x Y and x G Xn, y G Ym we put f (x, y) = [f (xi,yj)]i
[1] P.-A. Absil, R. Mahony, and R. Sepulchre. Optimization Algorithms on Matrix Manifolds. Princeton University Press, 2008. Cited on page 16.
[2] M. Arnaudon, F. Barbaresco, and L. Yang. Riemannian Medians and Means With Applications to Radar Signal Processing. IEEE Journal of Selected Topics in Signal Processing, 7(4):595-604, 2013. Cited on page 18.
[3] V. Borovitskiy, I. Azangulov, A. Terenin, P. Mostowsky, M. Deisenroth, and N. Durrande. Matern Gaussian Processes on Graphs. 130:2593-2601. Cited on page 2.
[4] V. Borovitskiy, A. Terenin, P. Mostowsky, and M. Deisenroth. Matern Gaussian Processes on Riemannian Manifolds. In Advances in Neural Information Processing Systems, pages 12426-12437, 2020. Cited on pages 2, 8.
[5] E. Cartan. Sur une classe remarquable d’espaces de Riemann. Bulletin de la Societe Mathematique de France, 54:214-264, 1926. Cited on page 8.
[6] E. Cartan. Sur une classe remarquable d’espaces de Riemann. II. Bulletin de la Societe Mathematique de France, 55:114-134, 1927. Cited on page 8.
[7] J.-P. Chiles and P. Delfiner. Geostatistics: Modeling Spatial Uncertainty. Wiley, 2009. Cited on page 3.
[8] M. P. Deisenroth and C. E. Rasmussen. PILCO: A Model-Based and Data-Efficient Approach to Policy Search. In ICML’11, pages 465-472. Omnipress, 2011. Cited on page 2.
[9] G. Dong and G. Kuang. Target Recognition in SAR Images via Classification on Rieman- nian Manifolds. IEEE Geoscience and Remote Sensing Letters, 12(1):199-203, 2015. Cited on page 18.
[10] J. Eschenburg. Lecture Notes on Symmetric Spaces, Jan. 1997. Cited on page 7.
[11] I. S. Gradshteyn and I. M. Ryzhik. Table of Integrals, Series, and Products. Academic Press, 7th edition, 2014. Cited on page 17.
[12] S. Helgason. Differential Geometry and Symmetric Spaces. Pure and Applied Mathematics: Academic Press. Academic Press, 1962. isbn: 9780123384508. Cited on page 9.
[13] S. Helgason. Geometric Analysis on Symmetric Spaces. Mathematical surveys and monographs. American Mathematical Soc., 1993. Cited on pages 7, 10, 13.
[14] J. Hensman, N. Fusi, and N. D. Lawrence. Gaussian Processes for Big Data. In UAI’13, pages 282-290. AUAI Press, 2013. Cited on page 6.
[15] E. Hewitt, M. Naimark, A. Stern, and E. Hewitt. Theory of Group Representations. Grundlehren der mathematischen Wissenschaften. Springer New York. Cited on page 13....31