Английская Википедия:Causal graph

In statistics, econometrics, epidemiology, genetics and related disciplines, causal graphs (also known as path diagrams, causal Bayesian networks or DAGs) are probabilistic graphical models used to encode assumptions about the data-generating process.

Causal graphs can be used for communication and for inference. They are complementary to other forms of causal reasoning, for instance using causal equality notation. As communication devices, the graphs provide formal and transparent representation of the causal assumptions that researchers may wish to convey and defend. As inference tools, the graphs enable researchers to estimate effect sizes from non-experimental data,^[1]^[2]^[3]^[4]^[5] derive testable implications of the assumptions encoded,^[1]^[6]^[7]^[8] test for external validity,^[9] and manage missing data^[10] and selection bias.^[11]

Causal graphs were first used by the geneticist Sewall Wright^[12] under the rubric "path diagrams". They were later adopted by social scientists^[13]^[14]^[15]^[16]^[17]^[18] and, to a lesser extent, by economists.^[19] These models were initially confined to linear equations with fixed parameters. Modern developments have extended graphical models to non-parametric analysis, and thus achieved a generality and flexibility that has transformed causal analysis in computer science, epidemiology,^[20] and social science.^[21]

Construction and terminology

The causal graph can be drawn in the following way. Each variable in the model has a corresponding vertex or node and an arrow is drawn from a variable X to a variable Y whenever Y is judged to respond to changes in X when all other variables are being held constant. Variables connected to Y through direct arrows are called parents of Y, or "direct causes of Y," and are denoted by Pa(Y).

Causal models often include "error terms" or "omitted factors" which represent all unmeasured factors that influence a variable Y when Pa(Y) are held constant. In most cases, error terms are excluded from the graph. However, if the graph author suspects that the error terms of any two variables are dependent (e.g. the two variables have an unobserved or latent common cause) then a bidirected arc is drawn between them. Thus, the presence of latent variables is taken into account through the correlations they induce between the error terms, as represented by bidirected arcs.

Fundamental tools

A fundamental tool in graphical analysis is d-separation, which allows researchers to determine, by inspection, whether the causal structure implies that two sets of variables are independent given a third set. In recursive models without correlated error terms (sometimes called Markovian), these conditional independences represent all of the model's testable implications.^[22]

Example

Suppose we wish to estimate the effect of attending an elite college on future earnings. Simply regressing earnings on college rating will not give an unbiased estimate of the target effect because elite colleges are highly selective, and students attending them are likely to have qualifications for high-earning jobs prior to attending the school. Assuming that the causal relationships are linear, this background knowledge can be expressed in the following structural equation model (SEM) specification.

Model 1

<math>

\begin{align} Q_1 &= U_1\\ C &= a \cdot Q_1 + U_2\\ Q_2 &= c \cdot C + d \cdot Q_1 + U_3\\ S &= b \cdot C + e \cdot Q_2 + U_4, \end{align}</math>

where <math>Q_1</math> represents the individual's qualifications prior to college, <math>Q_2</math> represents qualifications after college, <math>C</math> contains attributes representing the quality of the college attended, and <math>S</math> the individual's salary.

Файл:College notID.png

Figure 1: Unidentified model with latent variables (<math>Q_1</math> and <math> Q_2 </math>) shown explicitly

Файл:College notID proj.png

Figure 2: Unidentified model with latent variables summarized

Figure 1 is a causal graph that represents this model specification. Each variable in the model has a corresponding node or vertex in the graph. Additionally, for each equation, arrows are drawn from the independent variables to the dependent variables. These arrows reflect the direction of causation. In some cases, we may label the arrow with its corresponding structural coefficient as in Figure 1.

If <math>Q_1</math> and <math>Q_2</math> are unobserved or latent variables their influence on <math>C</math> and <math>S</math> can be attributed to their error terms. By removing them, we obtain the following model specification:

Model 2

<math>

\begin{align} C &= U_C \\ S &= \beta C + U_S \end{align}</math>

The background information specified by Model 1 imply that the error term of <math>S</math>, <math>U_S</math>, is correlated with CШаблон:'s error term, <math>U_C</math>. As a result, we add a bidirected arc between S and C, as in Figure 2.

Файл:College.png

Figure 3: Identified model with latent variables (<math>Q_1</math> and <math> Q_2 </math>) shown explicitly

Файл:College proj.png

Figure 4: Identified model with latent variables summarized

Since <math>U_S</math> is correlated with <math>U_C</math> and, therefore, <math>C</math>, <math>C</math> is endogenous and <math>\beta</math> is not identified in Model 2. However, if we include the strength of an individual's college application, <math>A</math>, as shown in Figure 3, we obtain the following model:

Model 3

<math>

\begin{align} Q_1 &= U_1\\ A &= a \cdot Q_1 + U_2 \\ C &= b \cdot A + U_3\\ Q_2 &= e \cdot Q_1 + d \cdot C + U_4\\ S &= c \cdot C + f \cdot Q_2 + U_5, \end{align}</math>

By removing the latent variables from the model specification we obtain:

Model 4

<math>

\begin{align} A &= a \cdot Q_1 + U_A \\ C &= b \cdot A + U_C\\ S &= \beta \cdot C + U_S, \end{align}</math>

with <math>U_A</math> correlated with <math>U_S</math>.

Now, <math>\beta</math> is identified and can be estimated using the regression of <math>S</math> on <math>C</math> and <math>A</math>. This can be verified using the single-door criterion,^[1]^[23] a necessary and sufficient graphical condition for the identification of a structural coefficients, like <math>\beta</math>, using regression.

References

Шаблон:Reflist

[causality-1] 1,0 ^1,1 ^1,2 Шаблон:Cite book

[2] Шаблон:Cite book

[3] Шаблон:Cite journal

[4] Шаблон:Cite journal

[5] Шаблон:Cite book

[6] Шаблон:Cite book

[7] Шаблон:Cite journal

[8] Шаблон:Cite journal

[9] Шаблон:Cite journal

[10] Шаблон:Cite journal

[11] Шаблон:Cite journal

[12] Шаблон:Cite journal

[13] Шаблон:Cite journal

[14] Шаблон:Cite journal

[15] Шаблон:Cite journal

[16] Шаблон:Cite journal

[17] Шаблон:Cite book

[18] Шаблон:Cite journal

[19] Шаблон:Cite journal

[20] Шаблон:Cite book

[21] Шаблон:Cite book

[22] Шаблон:Cite journal

[23] Шаблон:Cite journal

[1]

[2]

[3]

[4]

[5]

[6]

[7]

[8]

[9]

[10]

[11]

[12]

[13]

[14]

[15]

[16]

[17]

[18]

[19]

[20]

[21]

[22]

[23]

Партнерские ресурсы
Криптовалюты	Обмен криптовалют - www.bestchange.ru Криптовалютная биржа CoinEx Криптовалютная биржа Binance HIVE OS - операционная система для майнинга e4pool - Мультивалютный пул для майнинга.
Магазины	AliExpress — глобальная виртуальная (в Интернете) торговая площадка, предоставляющая возможность покупать товары производителей из КНР; computeruniverse.net - Интернет-магазин компьютеров(Промо код 5 Евро на первую покупку:FWWC3ZKQ);
Хостинг	DigitalOcean - американский провайдер облачных инфраструктур, с главным офисом в Нью-Йорке и с центрами обработки данных по всему миру;
Разное	Викиум - Онлайн-тренажер для мозга Like Центр - Центр поддержки и развития предпринимательства. Gamersbay - лучший магазин по бустингу для World of Warcraft. Ноотропы OmniMind N°1 - Усиливает мозговую активность. Повышает мотивацию. Улучшает память. Санкт-Петербургская школа телевидения - это федеральная сеть образовательных центров, которая имеет филиалы в 37 городах России. Lingualeo.com — интерактивный онлайн-сервис для изучения и практики английского языка в увлекательной игровой форме. Junyschool (Джунискул) – международная школа программирования и дизайна для детей и подростков от 5 до 17 лет, где ученики осваивают компьютерную грамотность, развивают алгоритмическое и креативное мышление, изучают основы программирования и компьютерной графики, создают собственные проекты: игры, сайты, программы, приложения, анимации, 3D-модели, монтируют видео. Умназия - Интерактивные онлайн-курсы и тренажеры для развития мышления детей 6-13 лет SkillBox - это один из лидеров российского рынка онлайн-образования. Среди партнеров Skillbox ведущий разработчик сервисного дизайна AIC, медиа-компания Yoola, первое и самое крупное русскоязычное аналитическое агентство Tagline, онлайн-школа дизайна и иллюстрации Bang! Bang! Education, оператор PR-рынка PACO, студия рисования Draw&Go, агентство performance-маркетинга Ingate, scrum-студия Sibirix, имидж-лаборатория Персона. «Нетология» — это университет по подготовке и дополнительному обучению специалистов в области интернет-маркетинга, управления проектами и продуктами, дизайна, Data Science и разработки. В рамках Нетологии студенты получают ценные теоретические знания от лучших экспертов Рунета, выполняют практические задания на отработку полученных навыков, общаются с экспертами и единомышленниками. Познакомиться со всеми продуктами подробнее можно на сайте https://netology.ru, линейка курсов и профессий постоянно обновляется. StudyBay Brazil – это онлайн биржа для португалоговорящих студентов и авторов! Студент получает уникальную работу любого уровня сложности и больше свободного времени, в то время как у автора появляется дополнительный заработок и бесценный опыт. Автор24 — самая большая в России площадка по написанию учебных работ: контрольные и курсовые работы, дипломы, рефераты, решение задач, отчеты по практике, а так же любой другой вид работы. Сервис сотрудничает с более 70 000 авторов. Более 1 000 000 работ уже выполнено. StudyBay – это онлайн биржа для англоязычных студентов и авторов! Студент получает уникальную работу любого уровня сложности и больше свободного времени, в то время как у автора появляется дополнительный заработок и бесценный опыт.

Английская Википедия:Causal graph

Содержание

Construction and terminology

Fundamental tools

Example

References

Навигация

Действия на странице

Действия на странице

Персональные инструменты

Навигация

Поиск

Инструменты