Английская Википедия:Huber loss

Шаблон:Short description In statistics, the Huber loss is a loss function used in robust regression, that is less sensitive to outliers in data than the squared error loss. A variant for classification is also sometimes used.

Definition

Файл:Huber loss.svg

Huber loss (green, <math>\delta=1</math>) and squared error loss (blue) as a function of <math>y - f(x)</math>

The Huber loss function describes the penalty incurred by an estimation procedure Шаблон:Mvar. Huber (1964) defines the loss function piecewise by^[1]

<math>

L_\delta (a) = \begin{cases}

\frac{1}{2}{a^2}                   & \text{for } |a| \le \delta, \\
\delta \cdot \left(|a| - \frac{1}{2}\delta\right), & \text{otherwise.}

\end{cases} </math>

This function is quadratic for small values of Шаблон:Mvar, and linear for large values, with equal values and slopes of the different sections at the two points where <math>|a| = \delta</math>. The variable Шаблон:Mvar often refers to the residuals, that is to the difference between the observed and predicted values <math>a = y - f(x)</math>, so the former can be expanded to^[2]

<math>

L_\delta(y, f(x)) = \begin{cases}

\frac{1}{2}(y - f(x))^2                   & \text{for } |y - f(x)| \le \delta, \\
\delta\ \cdot \left(|y - f(x)| - \frac{1}{2}\delta\right), & \text{otherwise.}

\end{cases} </math>

The Huber loss is the convolution of the absolute value function with the rectangular function, scaled and translated. Thus it "smoothens out" the former's corner at the origin.

Файл:Comparison of loss functions.png

Comparison of Huber loss with other loss functions used for robust regression.

Motivation

Шаблон:Unreferenced section Two very commonly used loss functions are the squared loss, <math>L(a) = a^2</math>, and the absolute loss, <math>L(a)=|a|</math>. The squared loss function results in an arithmetic mean-unbiased estimator, and the absolute-value loss function results in a median-unbiased estimator (in the one-dimensional case, and a geometric median-unbiased estimator for the multi-dimensional case). The squared loss has the disadvantage that it has the tendency to be dominated by outliers—when summing over a set of <math>a</math>'s (as in <math display="inline">\sum_{i=1}^n L(a_i) </math>), the sample mean is influenced too much by a few particularly large <math>a</math>-values when the distribution is heavy tailed: in terms of estimation theory, the asymptotic relative efficiency of the mean is poor for heavy-tailed distributions.

As defined above, the Huber loss function is strongly convex in a uniform neighborhood of its minimum <math>a=0</math>; at the boundary of this uniform neighborhood, the Huber loss function has a differentiable extension to an affine function at points <math> a=-\delta </math> and <math> a = \delta </math>. These properties allow it to combine much of the sensitivity of the mean-unbiased, minimum-variance estimator of the mean (using the quadratic loss function) and the robustness of the median-unbiased estimator (using the absolute value function).

Pseudo-Huber loss function

The Pseudo-Huber loss function can be used as a smooth approximation of the Huber loss function. It combines the best properties of L2 squared loss and L1 absolute loss by being strongly convex when close to the target/minimum and less steep for extreme values. The scale at which the Pseudo-Huber loss function transitions from L2 loss for values close to the minimum to L1 loss for extreme values and the steepness at extreme values can be controlled by the <math>\delta</math> value. The Pseudo-Huber loss function ensures that derivatives are continuous for all degrees. It is defined as^[3]^[4]

<math>L_\delta (a) = \delta^2\left(\sqrt{1+(a/\delta)^2}-1\right).</math>

As such, this function approximates <math>a^2/2</math> for small values of <math>a</math>, and approximates a straight line with slope <math>\delta</math> for large values of <math>a</math>.

While the above is the most common form, other smooth approximations of the Huber loss function also exist.^[5]

Variant for classification

For classification purposes, a variant of the Huber loss called modified Huber is sometimes used. Given a prediction <math>f(x)</math> (a real-valued classifier score) and a true binary class label <math>y \in \{+1, -1\}</math>, the modified Huber loss is defined as^[6]

<math>

L(y, f(x)) = \begin{cases}

\max(0, 1 - y \, f(x))^2 & \textrm{for }\, \,  y \, f(x) > -1, \\
-4y \, f(x)              & \textrm{otherwise.}

\end{cases} </math>

The term <math>\max(0, 1 - y \, f(x))</math> is the hinge loss used by support vector machines; the quadratically smoothed hinge loss is a generalization of <math>L</math>.^[6]

Applications

The Huber loss function is used in robust statistics, M-estimation and additive modelling.^[7]

References

Шаблон:Reflist

↑ Шаблон:Cite journal
↑ Шаблон:Cite book Compared to Hastie et al., the loss is scaled by a factor of ½, to be consistent with Huber's original definition given earlier.
↑ Шаблон:Cite journal
↑ Шаблон:Cite book
↑ Шаблон:Cite journal
↑ ^6,0 ^6,1 Шаблон:Cite conference
↑ Шаблон:Cite journal

[1] Шаблон:Cite journal

[2] Шаблон:Cite book Compared to Hastie et al., the loss is scaled by a factor of ½, to be consistent with Huber's original definition given earlier.

[3] Шаблон:Cite journal

[4] Шаблон:Cite book

[5] Шаблон:Cite journal

[zhang-6] 6,0 ^6,1 Шаблон:Cite conference

[7] Шаблон:Cite journal

[1]

[2]

[3]

[4]

[5]

[6]

[7]

Партнерские ресурсы
Криптовалюты	Обмен криптовалют - www.bestchange.ru Криптовалютная биржа CoinEx Криптовалютная биржа Binance HIVE OS - операционная система для майнинга e4pool - Мультивалютный пул для майнинга.
Магазины	AliExpress — глобальная виртуальная (в Интернете) торговая площадка, предоставляющая возможность покупать товары производителей из КНР; computeruniverse.net - Интернет-магазин компьютеров(Промо код 5 Евро на первую покупку:FWWC3ZKQ);
Хостинг	DigitalOcean - американский провайдер облачных инфраструктур, с главным офисом в Нью-Йорке и с центрами обработки данных по всему миру;
Разное	Викиум - Онлайн-тренажер для мозга Like Центр - Центр поддержки и развития предпринимательства. Gamersbay - лучший магазин по бустингу для World of Warcraft. Ноотропы OmniMind N°1 - Усиливает мозговую активность. Повышает мотивацию. Улучшает память. Санкт-Петербургская школа телевидения - это федеральная сеть образовательных центров, которая имеет филиалы в 37 городах России. Lingualeo.com — интерактивный онлайн-сервис для изучения и практики английского языка в увлекательной игровой форме. Junyschool (Джунискул) – международная школа программирования и дизайна для детей и подростков от 5 до 17 лет, где ученики осваивают компьютерную грамотность, развивают алгоритмическое и креативное мышление, изучают основы программирования и компьютерной графики, создают собственные проекты: игры, сайты, программы, приложения, анимации, 3D-модели, монтируют видео. Умназия - Интерактивные онлайн-курсы и тренажеры для развития мышления детей 6-13 лет SkillBox - это один из лидеров российского рынка онлайн-образования. Среди партнеров Skillbox ведущий разработчик сервисного дизайна AIC, медиа-компания Yoola, первое и самое крупное русскоязычное аналитическое агентство Tagline, онлайн-школа дизайна и иллюстрации Bang! Bang! Education, оператор PR-рынка PACO, студия рисования Draw&Go, агентство performance-маркетинга Ingate, scrum-студия Sibirix, имидж-лаборатория Персона. «Нетология» — это университет по подготовке и дополнительному обучению специалистов в области интернет-маркетинга, управления проектами и продуктами, дизайна, Data Science и разработки. В рамках Нетологии студенты получают ценные теоретические знания от лучших экспертов Рунета, выполняют практические задания на отработку полученных навыков, общаются с экспертами и единомышленниками. Познакомиться со всеми продуктами подробнее можно на сайте https://netology.ru, линейка курсов и профессий постоянно обновляется. StudyBay Brazil – это онлайн биржа для португалоговорящих студентов и авторов! Студент получает уникальную работу любого уровня сложности и больше свободного времени, в то время как у автора появляется дополнительный заработок и бесценный опыт. Автор24 — самая большая в России площадка по написанию учебных работ: контрольные и курсовые работы, дипломы, рефераты, решение задач, отчеты по практике, а так же любой другой вид работы. Сервис сотрудничает с более 70 000 авторов. Более 1 000 000 работ уже выполнено. StudyBay – это онлайн биржа для англоязычных студентов и авторов! Студент получает уникальную работу любого уровня сложности и больше свободного времени, в то время как у автора появляется дополнительный заработок и бесценный опыт.

Английская Википедия:Huber loss

Содержание

Definition

Motivation

Pseudo-Huber loss function

Variant for classification

Applications

See also

References

Навигация

Действия на странице

Действия на странице

Персональные инструменты

Навигация

Поиск

Инструменты