6. Bayesian Approach

The Bayesian approach used by MTfit is based on the approach described in Pugh et al, 2016a

6.1. Bayes Theory

Inversion approaches fit model parameters to observed data, to find the best fitting parameters. In MTfit, the probability of the data being correct is evaluated for the possible sources. The resulting estimates of the PDF can be combined for all the data to approximate the true PDF for the source. This PDF can be considered to describe the probability of the observed data for a given source, the likelihood, \(p\left(\mathrm{data}\,|\mathrm{\, model}\right)\). However the value of interest in such an inversion is the probability of the model given the observed data, the posterior PDF. This can be evaluated from the likelihood using Bayes' Theory (Bayes and Price, 1763 , Laplace, 1812 , Sivia, 2000) is given simply by:

\[\mathrm{p}\left(\mathrm{model\,|\, data}\right)=\frac{\mathrm{p}\left(\mathrm{data}\,|\mathrm{\, model}\right)\, \mathrm{p}\left(\mathrm{model}\right)}{\mathrm{p}\left(\mathrm{data}\right)}\]

This relates the data likelihood \(\mathrm{p}\left(\mathrm{data}\,|\mathrm{\, model}\right)\) to the posterior probability \(\mathrm{p}\left(\mathrm{model\,|\, data}\right)\) via the prior probabilities \(\mathrm{p}\left(\mathrm{model}\right)\) and \(\mathrm{p}\left(\mathrm{data}\right)\). The prior distributions incorporate the known information about the two parameters, and although the choice of prior is not trivial, it does not have a large affect on the posterior PDF given enough data.

6.2. Uncertainties

Source inversion is particularly sensitive to uncertainties, and since it is usually carried out after several other required steps, the effect of the uncertainties may be difficult to understand. The uncertainties can be broken down into several different types: instrument errors, model errors and background noise.

Although the interdependencies between the uncertainties can be explored, a quantitative relationship is not known. For such a treatment to be truly rigorous, the variations in these errors must be included throughout the inversion. The Bayesian formulation allows rigorous inclusion of uncertainties in the problem using marginalisation. Marginalisation removes the dependence of the joint PDF, \(P(A,B)\) , on one of the variables, by integrating over the PDF for that variable:

\[\mathrm{p}\left(A\right)=\int \mathrm{p}\left(A\,,\, B\right)dB=\int \mathrm{p}\left(A\,|\, B\right)\mathrm{p}\left(B\right)dB\]

Marginalisation can be used to incorporate the uncertainties in the inversion into the final PDF.

It is assumed that the noise has a mean and variance that are measurable, and therefore the most ambiguous (maximum entropy) distribution for these measurements is the Gaussian distribution (Pugh et al, 2015). For de-meaned data, the noise can be assumed also to have zero mean, and a standard deviation \(\sigma_{mes}\), so that the PDF \(\mathrm{p}\left(\Delta_{mes}\right)\), is:

\[\mathrm{p}\left(\Delta_{mes}\right)=\frac{1}{\sqrt{2\pi\sigma_{mes}^{2}}}\mathrm{e}^{-\frac{\Delta_{mes}^{2}}{2\sigma_{mes}^{2}}}\]

There are several different types of uncertainties however most can be simplified to independent uncertainties on each trace, and uncertainties in the underlying model.

6.3. Posterior PDF

The likelihoods for the two different data types are described here.

The posterior PDFs for the data used in MTfit are, for a known Earth model:

\[\mathrm{p}\left(\mathbf{d'}\:|\:\mathbf{M},\mathbf{t},\mathbf{\epsilon},\mathbf{k}\right)=\iint\sum_{j=1}^{M}\prod_{i=1}^{N}\left[\mathrm{p}\left(Y_{i}\:|\:\mathbf{\mathbf{A_{ij}}}=\mathbf{a}_{j}\cdot\mathbf{\tilde{M}},\mathbf{\sigma_{i}},\varpi_{i}\right)\mathrm{p}\left(\mathbf{R_{i}}\:|\:\mathbf{\mathbf{A_{i}}}=\mathbf{a}_{j}\cdot\mathbf{\tilde{M}},\mathbf{\sigma_{i}},\varpi_{i}\right)\right]\mathrm{p}\left(\mathbf{\sigma}\right)\mathrm{p}\left(\mathbf{\varpi}\right)d\mathbf{\sigma}d\mathbf{\varpi}\]

An unknown earth model, samples from the model distribution, \(\mathbf{G}_{k}\), are included using a Monte Carlo method based marginalisation:

\[\mathrm{p}\left(\mathbf{d'}\:|\:\mathbf{M},\mathbf{t},\mathbf{\epsilon},\mathbf{k}\right)=\iint\sum_{k=1}^{Q}\sum_{j=1}^{M}\prod_{i=1}^{N}\left[\mathrm{p}\left(Y_{i}\:|\:\mathbf{\mathbf{A_{ijk}}}=\mathbf{a}_{jk}\cdot\mathbf{\tilde{M}},\mathbf{\sigma_{i}},\varpi_{i}\right)\mathrm{p}\left(\mathbf{R_{i}}\:|\:\mathbf{\mathbf{A_{ijk}}}=\mathbf{a}_{jk}\cdot\mathbf{\tilde{M}},\mathbf{\sigma_{i}},\varpi_{i}\right)\right]\mathrm{p}\left(\mathbf{\sigma}\right)\mathrm{p}\left(\mathbf{\varpi}\right)d\mathbf{\sigma}d\mathbf{\varpi},\]

where \(\mathbf{a}_{jk}=\mathbf{a}\left(\mathbf{x}_{j},\mathbf{G}_{k}\right)\) refers to the station propagation coefficients associated with the location at \(\mathbf{x}_{j}\) and earth model \(\mathbf{G}_{k}\).

A common location method is NonLinLoc by Anthony Lomax which can produces samples from the location PDF to be used in the Monte Carlo method described above (see location uncertainty tutorial).

The symbols used in these PDFs are:

Symbol Meaning
\(Y\) Polarity
\(A\) Amplitude
\(\sigma_{y}\) Uncertainty
\(\varpi\) Mispick Probability
\(\mathbf{\tilde{M}}\) Moment Tensor 6-vector
\(\mathbf{M}\) Moment Tensor
\(\mathbf{a}\) Receiver Ray Path Coefficients
\(\sigma\) Measurment Uncertainty
\(R\) Amplitude Ratio
\(A\) Amplitude
\(\mathbf{\epsilon}\) Nuisance Parameters
\(\mathbf{k}\) Known Parameters
\(\mathbf{t}\) Arrival times