In computational statistics, the preconditioned Crank–Nicolson algorithm (pCN) is a Markov chain Monte Carlo (MCMC) method for obtaining random samples – sequences of random observations – from a target probability distribution for which direct sampling is difficult.
The most significant feature of the pCN algorithm is its dimension robustness, which makes it well-suited for high-dimensional sampling problems. The pCN algorithm is well-defined, with non-degenerate acceptance probability, even for target distributions on infinite-dimensional Hilbert spaces. As a consequence, when pCN is implemented on a real-world computer in large but finite dimension N, i.e. on an N-dimensional subspace of the original Hilbert space, the convergence properties (such as ergodicity) of the algorithm are independent of N. This is in strong contrast to schemes such as Gaussian random walk Metropolis–Hastings and the Metropolis-adjusted Langevin algorithm, whose acceptance probability degenerates to zero as N tends to infinity.
The algorithm as named was highlighted in 2013 by Cotter, Roberts, Stuart and White,[1] and its ergodicity properties were proved a year later by Hairer, Stuart and Vollmer.[2] In the specific context of sampling diffusion bridges, the method was introduced in 2008.[3]
Description of the algorithm
Overview
The pCN algorithm generates a Markov chain on a Hilbert space whose invariant measure is a probability measure of the form
for each measurable set , with normalising constant given by
where is a Gaussian measure on with covariance operator and is some function. Thus, the pCN method applied to target probability measures that are re-weightings of a reference Gaussian measure.
The Metropolis–Hastings algorithm is a general class of methods that try to produce such Markov chains , and do so by a two-step procedure of first proposing a new state given the current state and then accepting or rejecting this proposal, according to a particular acceptance probability, to define the next state . The idea of the pCN algorithm is that a clever choice of (non-symmetric) proposal for a new state given might have an associated acceptance probability function with very desirable properties.
The pCN proposal
The special form of this pCN proposal is to take
or, equivalently,
The parameter is a step size that can be chosen freely (and even optimised for statistical efficiency). One then generates and sets
The acceptance probability takes the simple form
It can be shown[2] that this method not only defines a Markov chain that satisfies detailed balance with respect to the target distribution , and hence has as an invariant measure, but also possesses a spectral gap that is independent of the dimension of , and so the law of converges to as . Thus, although one may still have to tune the step size parameter to achieve a desired level of statistical efficiency, the performance of the pCN method is robust to the dimension of the sampling problem being considered.
Contrast with symmetric proposals
This behaviour of pCN is in stark contrast to the Gaussian random walk proposal
with any choice of proposal covariance , or indeed any symmetric proposal mechanism. It can be shown using the Cameron–Martin theorem that for infinite-dimensional this proposal has acceptance probability zero for -almost all and . In practice, when one implements the Gaussian random walk proposal in dimension , this phenomenon can be seen in the way that
- for fixed , the acceptance probability tends to zero as , and
- for a fixed desired positive acceptance probability, as .
References
- ↑ Cotter, S. L.; Roberts, G. O.; Stuart, A. M.; White, D. (2013). "MCMC methods for functions: modifying old algorithms to make them faster". Statist. Sci. 28 (3): 424–446. arXiv:1202.0709. doi:10.1214/13-STS421. ISSN 0883-4237. S2CID 36562755.
- 1 2 Hairer, M.; Stuart, A. M.; Vollmer, S. J. (2014). "Spectral gaps for a Metropolis–Hastings algorithm in infinite dimensions". Ann. Appl. Probab. 24 (6): 2455–2490. arXiv:1112.1392. doi:10.1214/13-AAP982. ISSN 1050-5164. S2CID 73662504.
- ↑ Beskos, A.; Roberts, G. O.; Stuart, A. M.; Voss, J. (2008). "MCMC Methods for Diffusion Bridges". Stochastics and Dynamics. 8 (3): 319–350. doi:10.1142/S0219493708002378.