Dimensional adaptivity

With the standard sparse grid approach, all dimensions are treated equally, i.e. in each coordinate direction, the number of grid points is equal. The question arises as to whether one can further reduce the computational effort for objective functions where not all input variables carry equal weight. This is especially important in the case of higher-dimensional problems, where this property is often observed. Unfortunately, it is usually not known a priori which variables (or, in this context, dimensions) are important. Therefore, an efficient approach should be able to automatically detect which dimensions are the more or less important ones, without wasting any function evaluations.

Hegland [7] and Gerstner and Griebel [8] show that this is indeed possible. They propose an approach to generalize sparse grids such that the number of nodes in each dimension is adaptively adjusted to the problem at hand. Here, the adaptive refinement is not performed in a spatial manner, as it is commonly done in two- and three-dimensional problems (e.g. [5]), where more support nodes are placed in areas of increased nonlinearity or non-smoothness (this can become impractical in higher dimensions due to the required complex data structure and refinement criteria).

Besides being able to balancing the number of nodes in each coordinate direction, dimension-adaptive sparse grids are capable of automatically detecting separability (or partial separability) encountered in problems with additive (or nearly additive) structure.

The Sparse Grid Interpolation Toolbox includes a powerful dimension-adaptive algorithm based on the approach by Gerstner and Griebel [8], but also includes the significant performance enhancements described in [3, ch. 3]. Applying the dimension-adaptive algorithm is very easy - it can be switched "on" or "off" with a single parameter of the spare grid options structure that can be set with the spset function. Furthermore, the degree of dimensional adaptivity can be chosen as a scalar from the interval [0,1] where 1 stands for greedy (= purely adaptive) and 0 stands for conservative (= standard sparse grid) refinement.

Example

The function clearly exhibits additive structure, however, the function is not fully separable due to the second term coupling the variables. Consider the high-dimensional case d = 100. A traditional tensor-product approach would completely fail in interpolating a high-dimensional function of this type, since at least 2¹⁰⁰ nodes are required if an interpolation formula with two nodes is extended to the multivariate case via conventional tensor products. With the dimension-adaptive sparse grid algorithm, however, the structure is automatically detected, and the function is successfully recovered using just O(d²) points. For the interpolation domain, we have used [-d², d²] in each dimension.

Using piecewise multilinear basis functions and the Clenshaw-Curtis-Grid, f can be recovered with an estimated relative error of below 0.1 percent (the relative error is given with respect to the estimated range of the function) with about 27000 function evaluations, as the following code shows.

Since the objective function is quadratic, we can even approximate the function up to floating point accuracy with the polynomial basis functions and the Chebyshev-Gauss-Lobatto grid:

We can verify the quality of the interpolants by computing the maximum relative error for 100 randomly sampled points (the relative error is computed with respect to the range of the function values that occurred during the sparse grid construction). In this case, the estimated error was too optimistic in the piecewise linear case- however, the relative error for the sampled points is still below 1 percent.