Английская Википедия:Generalized extreme value distribution

In probability theory and statistics, the generalized extreme value (GEV) distribution^[1] is a family of continuous probability distributions developed within extreme value theory to combine the Gumbel, Fréchet and Weibull families also known as type I, II and III extreme value distributions. By the extreme value theorem the GEV distribution is the only possible limit distribution of properly normalized maxima of a sequence of independent and identically distributed random variables.^[2] Note that a limit distribution needs to exist, which requires regularity conditions on the tail of the distribution. Despite this, the GEV distribution is often used as an approximation to model the maxima of long (finite) sequences of random variables.

In some fields of application the generalized extreme value distribution is known as the Fisher–Tippett distribution, named after Ronald Fisher and L. H. C. Tippett who recognised three different forms outlined below. However usage of this name is sometimes restricted to mean the special case of the Gumbel distribution. The origin of the common functional form for all 3 distributions dates back to at least Jenkinson, A. F. (1955),^[3] though allegedly^[4] it could also have been given by von Mises, R. (1936).^[5]

Specification

Using the standardized variable <math>\ s \equiv \frac{\ x - \mu\ }{\sigma}\ ,</math> where <math>\ \mu\ ,</math> the location parameter, can be any real number, and <math>\ \sigma > 0\ </math> is the scale parameter; the cumulative distribution function of the GEV distribution is then

<math display="block"> F(\ s;\ \xi\ ) = \begin{cases} \exp\! \Bigl( -e^{-s} \Bigr) & ~~ \text{ for } ~~ \xi = 0\ , \\ {} \\

\exp\! \Bigl( - \bigl( 1 + \xi s \bigr)^{-\tfrac{\ 1\ }{ \xi } }\Bigr) & ~~ \text{ for } ~~ \xi \neq 0 ~~ \text{ and } ~~ \xi\ s > -1\ , \\ {} \\ 0 & ~~ \text{ for } ~~ \xi > 0 ~~ \text{ and } ~~ s \le -\tfrac{\ 1\ }{\xi}\ , \\ {} \\ 1 & ~~ \text{ for } ~~ \xi < 0 ~~ \text{ and } ~~ s \ge \tfrac{ 1 }{\ |\ \xi\ |\ }\ ; \end{cases}</math>

where <math>\ \xi\ ,</math> the shape parameter, can be any real number. Thus, for <math>\ \xi > 0\ ,</math> the expression is valid for <math>\ s > -\tfrac{\ 1\ }{\xi}\ ,</math> while for <math>\ \xi < 0\ </math> it is valid for <math>\ s < - \tfrac{\ 1\ }{\xi} ~.</math> In the first case, <math>\ -\tfrac{\ 1\ }{\xi}\ </math> is the negative, lower end-point, where <math>\ F\ </math> is Шаблон:Math ; in the second case, <math>\ -\tfrac{\ 1\ }{\xi}\ </math> is the positive, upper end-point, where <math>F</math> is 1. For <math>\ \xi = 0\ </math> the second expression is formally undefined and is replaced with the first expression, which is the result of taking the limit of the second, as <math>\ \xi \to 0\ </math> in which case <math>\ s\ </math> can be any real number.

In the special case of <math>\ x = \mu\ ,</math> so <math>\ s = 0\ </math> and <math>\ F(\ 0;\ \xi\ ) = e^{-1}\ \approx 0.368\ </math> for whatever values <math>\ \xi\ </math> and <math>\ \sigma\ </math> might have.

The probability density function of the standardized distribution is

<math display="block">f(\ s;\ \xi\ ) = \begin{cases} e^{-s} \exp\! \Bigl( -e^{-s} \Bigr) & ~~ \text{ for } ~~ \xi = 0 , \\ {} \\

\Bigl(\ 1 + \xi s\ \Bigr)^{-\left( 1 + \tfrac{\ 1\ }{\xi} \right)}\ \exp\! \Bigl( -\left( 1 + \xi s \right)^{\tfrac{\ -1\ }{\xi}} \Bigr) & ~~ \text{ for } ~~ \xi \neq 0 ~~ \text{ and } ~~ \xi\ s > -1\ , \\ {} \\ 0 & ~~ \text{ otherwise; } \end{cases}</math>

again valid for <math>\ s > -\tfrac{\ 1\ }{\xi}\ </math> in the case <math>\ \xi > 0\ ,</math> and for <math>\ s < -\tfrac{\ 1\ }{\xi}\ </math> in the case <math>\ \xi < 0 ~.</math> The density is zero outside of the relevant range. In the case <math>\ \xi = 0\ </math> the density is positive on the whole real line.

Since the cumulative distribution function is invertible, the quantile function for the GEV distribution has an explicit expression, namely

<math display="block">\ Q(\ p;\ \mu,\ \sigma,\ \xi\ ) = \begin{cases}

\mu - \sigma\ \ln\! \Bigl( -\ln(p)\ \Bigr) & ~ \text{ for } ~ \xi = 0 ~ \text{ and } ~ p \in (\ 0\ ,\ 1\ )\ , \\ {} \\ \mu + \displaystyle{ \frac{\ \sigma\ }{\ \xi\ }} \left( \Bigl( -\ln(p)\ \Bigr)^{-\xi} - 1 \right) & ~ \text{ for } ~ \xi > 0 ~ \text{ and } ~ p \in [\ 0\ ,\ 1\ )\ , \\

{} & ~~ \text{ or } ~ \, \xi < 0 ~ \text{ and } ~ p \in \ (\ 0\ ,\ 1\ ]\ ; \end{cases}</math>

and therefore the quantile density function, <math>\ q \equiv \frac{\;\mathrm{d}\ Q\;}{\mathrm{d}\ p}\ ,</math> is

<math>\ q(\ p;\ \sigma,\ \xi\ ) = \frac{\sigma}{\ \Bigl( - \ln(p)\ \Bigr)^{\xi + 1}\ p \;} \quad \text{ for } ~~ p \in (\ 0\ ,\ 1\ )\ ,</math>

valid for <math>\ \sigma > 0\ </math> and for any real <math>\ \xi ~.</math>

Example of probability density functions for distributions of the GEV family. ^[6]

Summary statistics

Some simple statistics of the distribution are:Шаблон:Citation needed

<math>\operatorname{\mathbb E}(X) = \mu + (g_1-1)\frac{\sigma}{\xi}</math> for <math>\xi < 1</math>

<math>\operatorname{Var}(X) = (g_2-g_1^2)\frac{\sigma^2}{\xi^2} ,</math>

<math>\operatorname{Mode}(X) = \mu+\frac{\sigma}{\xi}[(1+\xi)^{-\xi}-1] .</math>

The skewness is for ξ>0

<math>\operatorname{skewness}(X) = \frac{g_3-3g_2g_1+2g_1^3}{(g_2-g_1^2)^{3/2}} </math>

For ξ < 0, the sign of the numerator is reversed.

The excess kurtosis is:

<math>\operatorname{kurtosis\ excess}(X) = \frac{g_4-4g_3g_1+6g_2g_1^2-3g_1^4}{(g_2-g_1^2)^2}-3 ~.</math>

where <math>\ g_k = \Gamma(1 - k\ \xi)\ ,</math> <math>\ k=1,2,3,4\ ,</math> and <math>\ \Gamma(t)\ </math> is the gamma function.

Link to Fréchet, Weibull, and Gumbel families

The shape parameter <math>\ \xi\ </math> governs the tail behavior of the distribution. The sub-families defined by three cases: <math>\ \xi = 0\ ,</math> <math>\ \xi > 0\ ,</math> and <math>\ \xi < 0\ ;</math> these correspond, respectively, to the Gumbel, Fréchet, and Weibull families, whose cumulative distribution functions are displayed below.

Type I or Gumbel extreme value distribution, case <math>~ \xi = 0\ , \quad</math> for all <math> \quad x \in \Bigl(\ -\infty\ ,\ +\infty\ \Bigr)\ :</math>

<math> F(\ x;\ \mu,\ \sigma,\ 0\ ) = \exp \left( - \exp \left( -\frac{\ x - \mu\ }{\sigma} \right) \right) ~. </math>

Type II or Fréchet extreme value distribution, case <math>~ \xi > 0\ , \quad </math> for all <math> \quad x \in \left(\ \mu - \tfrac{\sigma}{\ \xi\ }\ ,\ +\infty\ \right)\ :</math>

Let <math>\quad \alpha \equiv \tfrac{\ 1\ }{ \xi } > 0 \quad </math> and <math> \quad y \equiv 1 + \tfrac{\xi}{\sigma} (x-\mu)\ ;</math>

<math> F(\ x;\ \mu,\ \sigma,\ \xi\ ) = \begin{cases} 0 & y \leq 0 \quad \mathsf{~ or\ equiv. ~} \quad x \leq \mu - \tfrac{\sigma}{\ \xi\ } \\ \exp\left( -\frac{1}{~ y^\alpha\ } \right) & y > 0 \quad \mathsf{~ or\ equiv. ~} \quad x > \mu - \tfrac{\sigma}{\ \xi\ } ~. \end{cases}</math>

Type III or reversed Weibull extreme value distribution, case <math>~ \xi < 0\ , \quad </math> for all <math> \quad x \in \left( -\infty\ ,\ \mu + \tfrac{ \sigma }{\ |\ \xi\ |\ }\ \right)\ :</math>

Let <math> \quad \alpha \equiv - \tfrac{1}{\ \xi\ } > 0 \quad </math> and <math> \quad y \equiv 1 - \tfrac{\ |\ \xi\ |\ }{\sigma} (x - \mu)\ ;</math>

<math> F(\ x;\ \mu,\ \sigma,\ \xi\ ) = \begin{cases} \exp\left( -y^{\alpha} \right) & y > 0 \quad \mathsf{~ or\ equiv. ~} \quad x < \mu + \tfrac{ \sigma }{\ |\ \xi\ |\ } \\ 1 & y \leq 0 \quad \mathsf{~ or\ equiv. ~} \quad x \geq \mu + \tfrac{ \sigma }{\ |\ \xi\ |\ } ~. \end{cases}</math>

The subsections below remark on properties of these distributions.

Modification for minima rather than maxima

The theory here relates to data maxima and the distribution being discussed is an extreme value distribution for maxima. A generalised extreme value distribution for data minima can be obtained, for example by substituting <math>\ -x\;</math> for <math>\;x\;</math> in the distribution function, and subtracting the cumulative distribution from one: That is, replace <math>\ F(x)\ </math> with <math>\ 1 - F(-x)\ </math> . Doing so yields yet another family of distributions.

Alternative convention for the Weibull distribution

The ordinary Weibull distribution arises in reliability applications and is obtained from the distribution here by using the variable <math>\ t = \mu - x\ ,</math> which gives a strictly positive support, in contrast to the use in the formulation of extreme value theory here. This arises because the ordinary Weibull distribution is used for cases that deal with data minima rather than data maxima. The distribution here has an addition parameter compared to the usual form of the Weibull distribution and, in addition, is reversed so that the distribution has an upper bound rather than a lower bound. Importantly, in applications of the GEV, the upper bound is unknown and so must be estimated, whereas when applying the ordinary Weibull distribution in reliability applications the lower bound is usually known to be zero.

Ranges of the distributions

Note the differences in the ranges of interest for the three extreme value distributions: Gumbel is unlimited, Fréchet has a lower limit, while the reversed Weibull has an upper limit. More precisely, Extreme Value Theory (Univariate Theory) describes which of the three is the limiting law according to the initial law Шаблон:Mvar and in particular depending on its tail.

Distribution of log variables

One can link the type I to types II and III in the following way: If the cumulative distribution function of some random variable <math>\ X\ </math> is of type II, and with the positive numbers as support, i.e. <math>\ F(\ x;\ 0,\ \sigma,\ \alpha\ )\ ,</math> then the cumulative distribution function of <math>\ln X</math> is of type I, namely <math>\ F(\ x;\ \ln \sigma,\ \tfrac{1}{\ \alpha\ },\ 0\ ) ~.</math> Similarly, if the cumulative distribution function of <math>\ X\ </math> is of type III, and with the negative numbers as support, i.e. <math>\ F(\ x;\ 0,\ \sigma,\ -\alpha\ )\ ,</math> then the cumulative distribution function of <math>\ \ln (-X)\ </math> is of type I, namely <math>\ F(\ x;\ -\ln \sigma,\ \tfrac{\ 1\ }{\alpha},\ 0\ ) ~.</math>

Link to logit models (logistic regression)

Multinomial logit models, and certain other types of logistic regression, can be phrased as latent variable models with error variables distributed as Gumbel distributions (type I generalized extreme value distributions). This phrasing is common in the theory of discrete choice models, which include logit models, probit models, and various extensions of them, and derives from the fact that the difference of two type-I GEV-distributed variables follows a logistic distribution, of which the logit function is the quantile function. The type-I GEV distribution thus plays the same role in these logit models as the normal distribution does in the corresponding probit models.

Properties

The cumulative distribution function of the generalized extreme value distribution solves the stability postulate equation.Шаблон:Citation needed The generalized extreme value distribution is a special case of a max-stable distribution, and is a transformation of a min-stable distribution.

Applications

The GEV distribution is widely used in the treatment of "tail risks" in fields ranging from insurance to finance. In the latter case, it has been considered as a means of assessing various financial risks via metrics such as value at risk.^[7]^[8]

Файл:GEV Surinam.png

Fitted GEV probability distribution to monthly maximum one-day rainfalls in October, Surinam^[9]

However, the resulting shape parameters have been found to lie in the range leading to undefined means and variances, which underlines the fact that reliable data analysis is often impossible.^[10]
In hydrology the GEV distribution is applied to extreme events such as annual maximum one-day rainfalls and river discharges.^[11] The blue picture, made with CumFreq, illustrates an example of fitting the GEV distribution to ranked annually maximum one-day rainfalls showing also the 90% confidence belt based on the binomial distribution. The rainfall data are represented by plotting positions as part of the cumulative frequency analysis.

Example for Normally distributed variables

Let <math>\ \left\{\ X_i\ \big|\ 1 \le i \le n\ \right\}\ </math> be i.i.d. normally distributed random variables with mean Шаблон:Math and variance Шаблон:Math. The Fisher–Tippett–Gnedenko theorem^[12] tells us that <math>\ \max \{\ X_i\ \big|\ 1 \le i \le n\ \} \sim GEV(\mu_n, \sigma_n, 0)\ ,</math> where

<math> \begin{align}

  \mu_n &= \Phi^{-1}\left( 1 - \frac{\ 1\ }{ n } \right) \\
  \sigma_n &= \Phi^{-1}\left( 1 - \frac{ 1 }{\ n\ \mathrm{e}\ } \right)- \Phi^{-1}\left(1-\frac{\ 1\ }{ n } \right) ~.

\end{align} </math>

This allow us to estimate e.g. the mean of <math>\ \max \{\ X_i\ \big|\ 1 \le i \le n\ \}\ </math> from the mean of the GEV distribution:

<math> \begin{align} \operatorname{\mathbb E}\left\{\ \max\left\{\ X_i\ \big|\ 1 \le i \le n\ \right\}\ \right\} & \approx \mu_n + \gamma_{\mathsf E}\ \sigma_n \\ &= (1 - \gamma_{\mathsf E})\ \Phi^{-1}\left( 1 - \frac{\ 1\ }{ n } \right) + \gamma_{\mathsf E}\ \Phi^{-1}\left( 1 - \frac{1}{\ e\ n\ } \right) \\ &= \sqrt{\log \left(\frac{ n^2 }{\ 2 \pi\ \log \left(\frac{n^2}{2\pi} \right)\ }\right) ~}\ \cdot\ \left(1 + \frac{ \gamma }{\ \log n\ } + \mathcal{o} \left(\frac{ 1 }{\ \log n\ } \right) \right)\ , \end{align} </math>

where <math>\ \gamma_{\mathsf E}\ </math> is the Euler–Mascheroni constant.

Related distributions

If <math>\ X \sim \textrm{GEV}(\mu,\,\sigma,\,\xi)\ </math> then <math>\ m X + b \sim \textrm{GEV}(m \mu+b,\ m\sigma,\ \xi)\ </math>
If <math>\ X \sim \textrm{Gumbel}(\mu,\ \sigma)\ </math> (Gumbel distribution) then <math>\ X \sim \textrm{GEV}(\mu,\,\sigma,\,0)\ </math>
If <math>\ X \sim \textrm{Weibull}(\sigma,\,\mu)\ </math> (Weibull distribution) then <math>\ \mu\left(1-\sigma\log \tfrac{X}{\sigma} \right) \sim \textrm{GEV}(\mu,\,\sigma,\,0)\ </math>
If <math>\ X \sim \textrm{GEV}(\mu,\,\sigma,\,0)\ </math> then <math>\ \sigma \exp (-\tfrac{X-\mu}{\mu \sigma} ) \sim \textrm{Weibull}(\sigma,\,\mu)\ </math> (Weibull distribution)
If <math>\ X \sim \textrm{Exponential}(1)\ </math> (Exponential distribution) then <math>\ \mu - \sigma \log X \sim \textrm{GEV}(\mu,\,\sigma,\,0)\ </math>
If <math>\ X \sim \mathrm{Gumbel}(\alpha_X, \beta)\ </math> and <math>\ Y \sim \mathrm{Gumbel}(\alpha_Y, \beta)\ </math> then <math>\ X-Y \sim \mathrm{Logistic}(\alpha_X-\alpha_Y,\beta)\ </math> (see Logistic distribution).
If <math>\ X\ </math> and <math>\ Y \sim \mathrm{Gumbel}(\alpha, \beta)\ </math> then <math>\ X+Y \nsim \mathrm{Logistic}(2 \alpha,\beta)\ </math> (The sum is not a logistic distribution).

Note that <math>\ \operatorname{\mathbb E}\{\ X + Y\ \} = 2\alpha+2\beta\gamma \neq 2\alpha = \operatorname{\mathbb E}\left\{\ \operatorname{Logistic}(2 \alpha,\beta)\ \right\} ~.</math>

Proofs

4. Let <math>\ X \sim \textrm{ Weibull }(\sigma,\,\mu)\ ,</math> then the cumulative distribution of <math>\ g(x) = \mu\left(1-\sigma\log\frac{X}{\sigma} \right)\ </math> is:

<math>

\begin{align} \operatorname{\mathbb P}\left\{\ \mu \left(1-\sigma\log\frac{\ X\ }{ \sigma } \right) < x\ \right\} &= \operatorname{\mathbb P}\left\{\ \log\frac{X}{\sigma} < \frac{1 - x/\mu}{\sigma}\ \right\} \\ {} \\ & \mathsf{\ Since\ the\ logarithm\ is\ always\ increasing:\ } \\ {} \\ &= \operatorname{\mathbb P}\left\{\ X < \sigma \exp\left[ \frac{1 - x/\mu}{\sigma} \right]\ \right\} \\ &= 1 - \exp\left( - \left(\cancel{\sigma} \exp\left[ \frac{1 - x/\mu}{\sigma} \right] \cdot \cancel{\frac{1}{\sigma}} \right)^\mu \right) \\ &= 1 - \exp\left( - \left( \exp\left[ \frac{\cancelto{\mu}{1} - x/\cancel{\mu}}{\sigma} \right] \right)^\cancel{\mu} \right) \\ &= 1 - \exp\left( - \exp\left[ \frac{\mu - x}{\sigma} \right] \right) \\ &= 1 - \exp\left( - \exp\left[ - s \right] \right), \quad s = \frac{x - \mu}{\sigma}\ , \end{align} </math>

which is the cdf for <math>\sim \textrm{GEV}(\mu,\,\sigma,\,0) ~.</math>

5. Let <math>\ X \sim \textrm{Exponential}(1)\ ,</math> then the cumulative distribution of <math>\ g(X) = \mu - \sigma \log X\ </math> is:

<math>

\begin{align} \operatorname{\mathbb P}\left\{\ \mu - \sigma \log X < x\ \right\} &= \operatorname{\mathbb P}\left\{\ \log X < \frac{\mu - x}{\sigma}\ \right\} \\ {} \\ & \mathsf{\ Since\ the\ logarithm\ is\ always\ increasing:\ } \\ {} \\ &= \operatorname{\mathbb P}\left\{\ X < \exp\left( \frac{\ \mu - x\ }{ \sigma } \right)\ \right\} \\ &= 1 - \exp\left[- \exp\left( \frac{\ \mu - x\ }{ \sigma } \right) \right] \\ &= 1 - \exp\left[- \exp(-s) \right]\ , \quad ~\mathsf{ where }~ \quad s \equiv \frac{x - \mu}{\sigma}\ ; \end{align} </math>

which is the cumulative distribution of <math>\ \operatorname{GEV}(\mu, \sigma, 0) ~.</math>

References

Шаблон:Reflist

Партнерские ресурсы
Криптовалюты	Обмен криптовалют - www.bestchange.ru Криптовалютная биржа CoinEx Криптовалютная биржа Binance HIVE OS - операционная система для майнинга e4pool - Мультивалютный пул для майнинга.
Магазины	AliExpress — глобальная виртуальная (в Интернете) торговая площадка, предоставляющая возможность покупать товары производителей из КНР; computeruniverse.net - Интернет-магазин компьютеров(Промо код 5 Евро на первую покупку:FWWC3ZKQ);
Хостинг	DigitalOcean - американский провайдер облачных инфраструктур, с главным офисом в Нью-Йорке и с центрами обработки данных по всему миру;
Разное	Викиум - Онлайн-тренажер для мозга Like Центр - Центр поддержки и развития предпринимательства. Gamersbay - лучший магазин по бустингу для World of Warcraft. Ноотропы OmniMind N°1 - Усиливает мозговую активность. Повышает мотивацию. Улучшает память. Санкт-Петербургская школа телевидения - это федеральная сеть образовательных центров, которая имеет филиалы в 37 городах России. Lingualeo.com — интерактивный онлайн-сервис для изучения и практики английского языка в увлекательной игровой форме. Junyschool (Джунискул) – международная школа программирования и дизайна для детей и подростков от 5 до 17 лет, где ученики осваивают компьютерную грамотность, развивают алгоритмическое и креативное мышление, изучают основы программирования и компьютерной графики, создают собственные проекты: игры, сайты, программы, приложения, анимации, 3D-модели, монтируют видео. Умназия - Интерактивные онлайн-курсы и тренажеры для развития мышления детей 6-13 лет SkillBox - это один из лидеров российского рынка онлайн-образования. Среди партнеров Skillbox ведущий разработчик сервисного дизайна AIC, медиа-компания Yoola, первое и самое крупное русскоязычное аналитическое агентство Tagline, онлайн-школа дизайна и иллюстрации Bang! Bang! Education, оператор PR-рынка PACO, студия рисования Draw&Go, агентство performance-маркетинга Ingate, scrum-студия Sibirix, имидж-лаборатория Персона. «Нетология» — это университет по подготовке и дополнительному обучению специалистов в области интернет-маркетинга, управления проектами и продуктами, дизайна, Data Science и разработки. В рамках Нетологии студенты получают ценные теоретические знания от лучших экспертов Рунета, выполняют практические задания на отработку полученных навыков, общаются с экспертами и единомышленниками. Познакомиться со всеми продуктами подробнее можно на сайте https://netology.ru, линейка курсов и профессий постоянно обновляется. StudyBay Brazil – это онлайн биржа для португалоговорящих студентов и авторов! Студент получает уникальную работу любого уровня сложности и больше свободного времени, в то время как у автора появляется дополнительный заработок и бесценный опыт. Автор24 — самая большая в России площадка по написанию учебных работ: контрольные и курсовые работы, дипломы, рефераты, решение задач, отчеты по практике, а так же любой другой вид работы. Сервис сотрудничает с более 70 000 авторов. Более 1 000 000 работ уже выполнено. StudyBay – это онлайн биржа для англоязычных студентов и авторов! Студент получает уникальную работу любого уровня сложности и больше свободного времени, в то время как у автора появляется дополнительный заработок и бесценный опыт.

Английская Википедия:Generalized extreme value distribution

Содержание

Specification

Summary statistics

Link to Fréchet, Weibull, and Gumbel families

Modification for minima rather than maxima

Alternative convention for the Weibull distribution

Ranges of the distributions

Distribution of log variables

Link to logit models (logistic regression)

Properties

Applications

Example for Normally distributed variables

Related distributions

Proofs

See also

References

Further reading

Навигация

Действия на странице

Действия на странице

Персональные инструменты

Навигация

Поиск

Инструменты