Английская Википедия:(a,b,0) class of distributions

Материал из Онлайн справочника
Перейти к навигацииПерейти к поиску

Шаблон:Short description

In probability theory, a member of the (a, b, 0) class of distributions is any distribution of a discrete random variable N whose values are nonnegative integers whose probability mass function satisfies the recurrence formula

<math> \frac{p_k}{p_{k-1}} = a + \frac{b}{k}, \qquad k = 1, 2, 3, \dots </math>

for some real numbers a and b, where <math>p_k = P(N = k)</math>.

Only the Poisson, binomial and negative binomial distributions satisfy the full form of this relationship. These are also the three discrete distributions among the six members of the natural exponential family with quadratic variance functions (NEF–QVF).

More general distributions can be defined by fixing some initial values of pj and applying the recursion to define subsequent values. This can be of use in fitting distributions to empirical data. However, some further well-known distributions are available if the recursion above need only hold for a restricted range of values of k:[1] for example the logarithmic distribution and the discrete uniform distribution.

The (a, b, 0) class of distributions has important applications in actuarial science in the context of loss models.[2]

Properties

Sundt[3] proved that only the binomial distribution, the Poisson distribution and the negative binomial distribution belong to this class of distributions, with each distribution being represented by a different sign of a. Furthermore, it was shown by Fackler[4] that there is a universal formula for all three distributions, called the (united) Panjer distribution.

The more usual parameters of these distributions are determined by both a and b. The properties of these distributions in relation to the present class of distributions are summarised in the following table. Note that <math>W_N(x)\,</math> denotes the probability generating function.

Distribution <math> P[N=k]\, </math> <math> a\, </math> <math> b \,</math> <math> p_0\, </math> <math> W_N(x)\, </math> <math> \operatorname E[N]\, </math> <math> \operatorname{Var}(N)\, </math>
Binomial <math> \binom{n}{k} p^k (1-p)^{n-k} </math> <math> \frac{-p}{1-p} </math> <math> \frac{p(n+1)}{1-p} </math> <math> (1-p)^n\, </math> <math> (px+(1-p))^{n} \,</math> <math> np\, </math> <math> np(1-p) \,</math>
Poisson <math> e^{-\lambda}\frac{ \lambda^k}{k!}\, </math> <math> 0\, </math> <math> \lambda \,</math> <math> e^{- \lambda}\, </math> <math> e^{\lambda(x-1)} \,</math> <math> \lambda\, </math> <math> \lambda \,</math>
Negative binomial <math> \frac{\Gamma(r+k)}{k!\,\Gamma(r)}\,p^r\,(1-p)^k \,</math> <math> 1-p\, </math> <math> (1-p)(r-1)\, </math> <math> p^r \,</math> <math> \left( \frac{p}{1 - x(1-p)}\right) ^r \,</math> <math> \frac{r(1-p)}{p} \, </math> <math> \frac{r(1-p)}{p^2} \,</math>
Panjer distribution <math> \left( 1+\frac{\lambda}{\alpha} \right)^{-\alpha} \frac{\lambda^k}{k!} \prod_{i=0}^{k-1}\frac{\alpha+i}{\alpha+\lambda} \,</math> <math> \frac{\lambda}{\alpha+\lambda}\, </math> <math> \frac{(\alpha-1)\lambda}{\alpha+\lambda}\, </math> <math> \left( 1+\frac{\lambda}{\alpha} \right)^{-\alpha} \,</math> <math> \left( 1-\frac{\lambda}{\alpha}(x-1)\right)^{-\alpha} \,</math> <math> \lambda \, </math> <math> \lambda \left( 1 + \frac{\lambda}{\alpha} \right) \,</math>

Note that the Panjer distribution reduces to the Poisson distribution in the limit case <math>\alpha \rightarrow \pm\infty</math>; it coincides with the negative binomial distribution for positive, finite real numbers <math>\alpha\in \mathbb{R}_{>0}</math>, and it equals the binomial distribution for negative integers <math> -\alpha \in \mathbb{Z}</math>.

Plotting

An easy way to quickly determine whether a given sample was taken from a distribution from the (a,b,0) class is by graphing the ratio of two consecutive observed data (multiplied by a constant) against the x-axis.

By multiplying both sides of the recursive formula by <math>k</math>, you get

<math>k \, \frac{p_k}{p_{k-1}} = ak + b,</math>

which shows that the left side is obviously a linear function of <math>k</math>. When using a sample of <math>n</math> data, an approximation of the <math>p_k</math>'s need to be done. If <math>n_k</math> represents the number of observations having the value <math>k</math>, then <math>\hat{p}_k = \frac{n_k}{n}</math> is an unbiased estimator of the true <math>p_k</math>.

Therefore, if a linear trend is seen, then it can be assumed that the data is taken from an (a,b,0) distribution. Moreover, the slope of the function would be the parameter <math>a</math>, while the ordinate at the origin would be <math>b</math>.

See also

References

Шаблон:Reflist


Шаблон:ProbDistributions

  1. Ошибка цитирования Неверный тег <ref>; для сносок Schmidt не указан текст
  2. Ошибка цитирования Неверный тег <ref>; для сносок Klugman не указан текст
  3. Ошибка цитирования Неверный тег <ref>; для сносок Sundt не указан текст
  4. Ошибка цитирования Неверный тег <ref>; для сносок Fackler не указан текст