Particulate filter (statistical)

The particulate filters , so known like sequential Methods of Monte Carlo , are sophisticated techniques of estimate of Modèle S based on the Simulation.

The particulate filters are generally used to estimate models Bayésiens and constitute the “in line” methods analogues with the Méthodes of Monte Carlo by Chaînes of Markov which they is methods “out-line” (thus a posteriori) and often similar to the methods of sampling of importance.

If they are conceived correctly, the particulate filters can be faster than the Methods of Monte Carlo by Chaînes of Markov. They often constitute an alternative to the filters of wide $kalmans with the advantage that with sufficient samples, they approach estimated optimal Bayésien. They can thus be made more precise than the filters of $kalman. The approaches can also be combined by using a Filtre of $kalman like a proposal for a distribution for the particulate filter .

Objective

The purpose of a particulate filter is to estimate the sequence of hidden parameters, x_k for k = 0,1,2,3, \ cdots, by basing only on the data observed y_k for k = 0,1,2,3, \ cdots. All the estimated parameters bayésiens of x_k come from the distribution a posteriori, but rather than to use the probability united a posteriori p (x_0, x_1, \ dowries, x_k | y_0, y_1, \ dowries, y_k) , which would result from one MCMC usual or a sampling of importance, the methods particulate estimate the distribution of filtering p (x_k|y_0, y_1, \ dowries, y_k) .

Modeling

The particulate filters make the assumption that the states x_k and the observations y_k can be modelled in the following form:

  • the continuation of the parameters x_0, x_1, \ cdots form a Chaîne of Markov of first order, such as x_k | x_{k-1} \sim p_{x_k | x_ {k-1}} (X | x_ {k-1}) and with an initial distribution p (x_0) .
  • the observations y_0, y_1, \ cdots are independent conditionally provided the x_0, x_1, \ cdots are known. In other words, each observation y_k depends only on the parameter x_k
{y_k | x_k} \ sim {p_ {there | X} (there | x_k)}

An example of this scenario is \ {\ begin {matrix} x_k=f (x_ {k-1}) + v_k \ \ y_k = H (x_k) + w_k \ end {matrix}

where at the same time v_k and w_k are sequences mutually independent and distributed to identical with functions of density of known probability and where f () and h () are known functions. These two equations can be seen like equations of the Espace of state and resemble those of the Filtre of $kalman.

If the functions f (\ cdot) and h (\ cdot) were linear, and so at the same time v_k and w_k were Gaussian , then the Filtre of $kalman finds the distribution of filtering Bayésien exact. In the contrary case, the methods containing filter of $kalman give an estimate of first order. The particulate filters also give approximations, but with sufficient particles, the results can be even more precise.

Approximation of Monte Carlo

The methods with particles, like all the methods containing samplings (such as the MCMC ), generate a whole of samples which approximate the distribution of filtering p (x_k|y_0, \ dowries, y_k) . Thus, with P samples, the values hoped with respect to the distribution of filtering are approximated by: \ int F (x_k) p (x_k|y_0, \ dowries, y_k) dx_k \ approx \ frac1P \ sum_ {L=1} ^Pf (x_k^ {(L)}) and f (\ cdot) , in the usual way of the methods Monte Carlo, can give all the data of the distribution (moments, etc) until a certain degree of approximation.

In general, the algorithm is repeated repeatedly for a given number of values k (which we will note N).

To initialize x_k=0|_ {k=0} for all the particles provides a starting position to generate x_1, which can be used to generate x_2, who can be used to generate x_3, and so on until k=N.

Once this carried out, the Average of the x_k on all the particles (or \ frac {1} {P} \ sum_ {L=1} ^P x_k^ {(L)}) is roughly the true value of x_k.

Sampling with rééchantillonnage by importance (SIR)

Sampling with Rééchantillonnage by importance (Sampling Resampling Importance or SIR) is an algorithm of filtering used very usually. It approximates the distribution of filtering p (x_k|y_0, \ ldots, y_k) by a whole of balanced particles: \ {(w^ {(L)}_k, x^ {(L)}_k) ~: ~L=1, \ ldots, P \} .

The weight of importance w^ {(L)}_k is approximations of the probabilities (or densities) a posteriori relative of the particles such as \ sum_ {L=1} ^P w^ {(L)}_k = 1.

Algorithm SIR is a recursive version of the sampling by importance. As in sampling by importance, hoped for function f (\ cdot) can be approximated like a weighted average: \ int F (x_k) p (x_k|y_0, \ dowries, y_k) dx_k \ approx \ sum_ {L=1} ^P w^ {(L)} F (x_k^ {(L)}).

The performance of the algorithm is dependant on the choice of the distributions of importances : \ pi (x_k|x_ {0: k-1}, y_ {0: K}) .

The distribution of optimal importance is given like: \ pi (x_k|x_ {0: k-1}, y_ {0: K}) = p (x_k|x_ {k-1}, y_ {K}).

However, the probability of transition is often used like function of importance, as it is easier to calculate, and that also simplifies calculations of the subsequent weights of importance: \ pi (x_k|x_ {0: k-1}, y_ {0: K}) = p (x_k|x_ {k-1}).

The rééchantillonnage filters by importance (SIR) with probabilities of transitions as function from importance are known commonly like filters starting (Bootstrap filters) or Algorithme of condensation.

The rééchantillonnage makes it possible to avoid the problem of the degeneration of the algorithm. One avoids the situations thus where all the weights of importance except one are close to zero. The performance of the algorithm can also be affected by the choice of the suitable method of rééchantillonnage. The rééchantillonnage laminated proposed by Kitagawa (1996) is optimal in terms of variance.

Only one sequential step of rééchantillonnage of importance proceeds in the following way:

  1. For L=1, \ ldots, P, one draws the samples from the distributions of importances : x^ {(L)}_k \ sim \ pi (x_k|x^ {(L)}_ {0: k-1}, y_ {0: K})
  2. For L=1, \ ldots, P, one evaluates the weights of importance with a constant of standardization:
\ hat {W} ^ {(L)}_k = w^ {(L)}_ {k-1} \ frac {p (y_k|x^ {(L)}_k) p (x^ {(L)}_k|x^ {(L)}_ {k-1})} {\ pi (x_k^ {(L)}|x^ {(L)}_ {0: k-1}, y_ {0: K})}.
  1. For L=1, \ ldots, P one calculates the standardized weights of importance:
w^ {(L)}_k = \ frac {\ hat {W} ^ {(L)}_k} {\ sum_ {J=1} ^P \ hat {W} ^ {(J)}_k}
  1. One calculates an estimate of the effective number of particles like
\ hat {NR} _ \ mathit {EFF} = \ frac {1} {\ sum_ {L=1} ^P \ left (w^ {(L)}_k \ right) ^2}
  1. If the effective number of particles is smaller than a threshold given \ hat {NR} _ \ mathit {EFF} < N_ {thr} , then one carries out the rééchantillonnage:
    1. To draw P particles from the whole of particles running with the probabilities proportional to their weight then to replace the whole of the current particles with this new unit.
    2. For L=1, \ ldots, P the unit w^ {(L)}_k = 1/P.

The term sequential Rééchantillonnage of importance (Sequential Resampling Importance) is also used sometimes to refer to filters SIR.

Sequential sampling by importance (LOCATED)

Sampling sequential by importance (or LOCATED for Sequential Sampling Importance) is similar to Sampling with rééchantillonage by importance (SIR) but without the stage of rééchantillonnage.

Direct version of the algorithm

The direct version of the algorithm is relatively simple compared to other algorithm of particulate filtering and uses the composition and the rejection. To generate a simple sample x with k of p_ {x_k|y_ {1: K}} (X|y_ {1: K}) :

(1) To fix p=1

(2) To uniformly generate L since \ {1,…, P \}
(3) To generate a test \ hat {X} since its distribution p_ {x_k|x_ {k-1}} (X|x_ {k-1|k-1} ^ {(L)})
(4) To generate the probabilities of \ hat {there} by using \ hat {X} since p_ {there|X} (y_k|\ hat {X}) where y_k is the value measured
(5) Générer another uniformly U since m_k
(6) Comparer U and \ hat {there}
(A) If U is larger then to repeat since the stage (2)
(b) If U is smaller then to save \ hat {X} like x {K|K} ^ {(p)} and to increment p
(c) If p > P then to stop

The objective is to generate P particles with the step k by using only the particles of the step k-1. That requires that a Markovian equation can be written (and calculated) to generate a x_k while being based only on x_ {k-1} . This algorithm uses the composition of P particles since k-1 to generate with k.

That can be more easily visualized if x is seen like a two-dimensional board. A dimension is k and other dimension corresponds to the number of particles. For example, x (K, L) would be Lème particle at the stage k and can be thus written x_k^ {(L)} (as carried out higher in the algorithm).

The stage (3) generates a potential x_k based on a particle chosen by chance (x_ {k-1} ^ {(L)}) has time k-1 and rejects or accepts this particle at the stage (6). In other words, the x_k values are generated by using the x_ {k-1} generated previously.

See too

  • Filter of $kalman, an analytical estimator for the Gaussian distributions
  • recursive Estimate bayésienne

Refer

  • Sequential Monte Carlo Methods in Practice , by has Doucet, NR of Freitas and NR Gordon. Published by Springer.
  • One Sequential Monte Carlo Sampling Methods for Bayesian Filtering , by has Doucet, C Andrieu and S. Godsill, Statistics and Computing, vol. 10, No 3, pp. 197-208, 2000 CiteSeer link
  • Tutorial one Particle Filters for On-line Nonlinear/Non-Gaussian Bayesian Tracking (2001) ; S. Arulampalam, S. Maskell, NR. Gordon and T. Clapp; CiteSeer link
  • F. Dellaert, D. Fox, W. Burgard, and S. Thrun, " Mobile Monte Carlo Localization for Robots, " International IEEE Conference one Robotics and Automation (ICRA99), May, 1999.

External bonds

  • Sequential Methods of Monte Carlo (Particulate Filtering) at the University of Cambridge
  • Animations MCL of Dieter Fox

Random links:Øystein Aarseth | Shrimp with white legs | Cape Ferrat | Bronzed boletus | Hologram Industries | Crocodile_d'agresseur