Английская Википедия:Bayes error rate

In statistical classification, Bayes error rate is the lowest possible error rate for any classifier of a random outcome (into, for example, one of two categories) and is analogous to the irreducible error.^[1]^[2]

A number of approaches to the estimation of the Bayes error rate exist. One method seeks to obtain analytical bounds which are inherently dependent on distribution parameters, and hence difficult to estimate. Another approach focuses on class densities, while yet another method combines and compares various classifiers.^[2]

The Bayes error rate finds important use in the study of patterns and machine learning techniques.^[3]

Error determination

In terms of machine learning and pattern classification, the labels of a set of random observations can be divided into 2 or more classes. Each observation is called an instance and the class it belongs to is the label. The Bayes error rate of the data distribution is the probability an instance is misclassified by a classifier that knows the true class probabilities given the predictors.

For a multiclass classifier, the expected prediction error may be calculated as follows:^[3]

where x is the instance, <math>E[]</math> the expectation value, C_k is a class into which an instance is classified, P(C_k|x) is the conditional probability of label k for instance x, and L() is the 0–1 loss function:

<math>L(x,y)= 1-\delta_{x,y}=\begin{cases}0 & \text{if } x=y \\ 1 & \text{if } x\neq y \end{cases},</math>

where <math>\delta_{x,y}</math> is the Kronecker delta.

When the learner knows the conditional probability, then one solution is:

This solution is known as the Bayes classifier.

The corresponding expected Prediction Error is called the Bayes error rate:

<math> BE = E_x[\sum_{k=1}^K L(C_k, \hat{C}_B(x))P(C_k|x)] = E_x[\sum_{k=1, \ C_k \neq \hat{C}_B(x) }^K P(C_k|x)] = E_x[1-P(\hat{C}_B(x)|x)] </math>,

where the sum can be omitted in the last step due to considering the counter event. By the definition of the Bayes classifier, it maximizes <math>P(\hat{C}_B(x)|x)</math> and, therefore, minimizes the Bayes error BE.

The Bayes error is non-zero if the classification labels are not deterministic, i.e., there is a non-zero probability of a given instance belonging to more than one class.^[4] In a regression context with squared error, the Bayes error is equal to the noise variance.^[3]

Proof of Minimality

Proof that the Bayes error rate is indeed the minimum possible and that the Bayes classifier is therefore optimal, may be found together on the Wikipedia page Bayes classifier.

Plug-in Rules for Binary Classifiers

A plug-in rule uses an estimate of the posterior probability <math>\eta</math> to form a classification rule. Given an estimate <math>\tilde \eta</math>, the excess Bayes error rate of the associated classifier is bounded above by:

<math display='block'>2 \mathbb E [|\eta(X) - \tilde \eta (X)|].</math>

To see this, note that the excess Bayes error is equal to 0 where the classifiers agree, and equal to <math>2|\eta(X) - 1/2|</math> where they disagree. To form the bound, notice that <math>\tilde \eta</math> is at least as far as <math>1/2</math> when the classifiers disagree.

References

Шаблон:Reflist

Шаблон:Statistics-stub

↑ Шаблон:Cite book
↑ ^{Перейти обратно: 2,0} ^2,1 K. Tumer, K. (1996) "Estimating the Bayes error rate through classifier combining" in Proceedings of the 13th International Conference on Pattern Recognition, Volume 2, 695–699
↑ ^{Перейти обратно: 3,0} ^3,1 ^3,2 Шаблон:Cite book
↑ Шаблон:Cite book

[stat-1] Шаблон:Cite book

[Tumer-2] {Перейти обратно: 2,0} ^2,1 K. Tumer, K. (1996) "Estimating the Bayes error rate through classifier combining" in Proceedings of the 13th International Conference on Pattern Recognition, Volume 2, 695–699

[ESL-3] {Перейти обратно: 3,0} ^3,1 ^3,2 Шаблон:Cite book

[4] Шаблон:Cite book

[1]

[2]

[3]

[4]

развернуть Партнерские ресурсы
Криптовалюты	Обмен криптовалют - www.bestchange.ru Криптовалютная биржа CoinEx Криптовалютная биржа Binance HIVE OS - операционная система для майнинга e4pool - Мультивалютный пул для майнинга.
Магазины	AliExpress — глобальная виртуальная (в Интернете) торговая площадка, предоставляющая возможность покупать товары производителей из КНР; computeruniverse.net - Интернет-магазин компьютеров(Промо код 5 Евро на первую покупку:FWWC3ZKQ);
Хостинг	DigitalOcean - американский провайдер облачных инфраструктур, с главным офисом в Нью-Йорке и с центрами обработки данных по всему миру;
Разное	Викиум - Онлайн-тренажер для мозга Like Центр - Центр поддержки и развития предпринимательства. Gamersbay - лучший магазин по бустингу для World of Warcraft. Ноотропы OmniMind N°1 - Усиливает мозговую активность. Повышает мотивацию. Улучшает память. Санкт-Петербургская школа телевидения - это федеральная сеть образовательных центров, которая имеет филиалы в 37 городах России. Lingualeo.com — интерактивный онлайн-сервис для изучения и практики английского языка в увлекательной игровой форме. Junyschool (Джунискул) – международная школа программирования и дизайна для детей и подростков от 5 до 17 лет, где ученики осваивают компьютерную грамотность, развивают алгоритмическое и креативное мышление, изучают основы программирования и компьютерной графики, создают собственные проекты: игры, сайты, программы, приложения, анимации, 3D-модели, монтируют видео. Умназия - Интерактивные онлайн-курсы и тренажеры для развития мышления детей 6-13 лет SkillBox - это один из лидеров российского рынка онлайн-образования. Среди партнеров Skillbox ведущий разработчик сервисного дизайна AIC, медиа-компания Yoola, первое и самое крупное русскоязычное аналитическое агентство Tagline, онлайн-школа дизайна и иллюстрации Bang! Bang! Education, оператор PR-рынка PACO, студия рисования Draw&Go, агентство performance-маркетинга Ingate, scrum-студия Sibirix, имидж-лаборатория Персона. «Нетология» — это университет по подготовке и дополнительному обучению специалистов в области интернет-маркетинга, управления проектами и продуктами, дизайна, Data Science и разработки. В рамках Нетологии студенты получают ценные теоретические знания от лучших экспертов Рунета, выполняют практические задания на отработку полученных навыков, общаются с экспертами и единомышленниками. Познакомиться со всеми продуктами подробнее можно на сайте https://netology.ru, линейка курсов и профессий постоянно обновляется. StudyBay Brazil – это онлайн биржа для португалоговорящих студентов и авторов! Студент получает уникальную работу любого уровня сложности и больше свободного времени, в то время как у автора появляется дополнительный заработок и бесценный опыт. Автор24 — самая большая в России площадка по написанию учебных работ: контрольные и курсовые работы, дипломы, рефераты, решение задач, отчеты по практике, а так же любой другой вид работы. Сервис сотрудничает с более 70 000 авторов. Более 1 000 000 работ уже выполнено. StudyBay – это онлайн биржа для англоязычных студентов и авторов! Студент получает уникальную работу любого уровня сложности и больше свободного времени, в то время как у автора появляется дополнительный заработок и бесценный опыт.

Английская Википедия:Bayes error rate

Содержание

Error determination

Proof of Minimality

Plug-in Rules for Binary Classifiers

See also

References

Навигация

Действия на странице

Действия на странице

Персональные инструменты

Навигация

Поиск

Инструменты