Английская Википедия:1000 Genomes Project

Шаблон:Short description Шаблон:Update The 1000 Genomes Project (abbreviated as 1KGP), taken place from January 2008 to 2015, was an international research effort to establish the most detailed catalogue of human genetic variation at the time. Scientists planned to sequence the genomes of at least one thousand anonymous healthy participants from a number of different ethnic groups within the following three years, using advancements in newly developed technologies. In 2010, the project finished its pilot phase, which was described in detail in a publication in the journal Nature.^[1] In 2012, the sequencing of 1092 genomes was announced in a Nature publication.^[2] In 2015, two papers in Nature reported results and the completion of the project and opportunities for future research.^[3]^[4]

Many rare variations, restricted to closely related groups, were identified, and eight structural-variation classes were analyzed.^[5]

The project united multidisciplinary research teams from institutes around the world, including China, Italy, Japan, Kenya, Nigeria, Peru, the United Kingdom, and the United States contributing to the sequence dataset and to a refined human genome map freely accessible through public databases to the scientific community and the general public alike.^[2]

The International Genome Sample Resource was created to host and expand on the data set after the project's end.^[6]

Файл:Genetic Variation.jpg

Changes in the number and order of genes (A-D) create genetic diversity within and between populations.

Background

Since the completion of the Human Genome Project advances in human population genetics and comparative genomics enabled further insight into genetic diversity.^[7] The understanding about structural variations (insertions/deletions (indels), copy number variations (CNV), retroelements), single-nucleotide polymorphisms (SNPs), and natural selection were being solidified.^[8]^[9]^[10]^[11]

The diversity of Human genetic variation such as that Indels were being uncovered and investigating human genomic variationsШаблон:Cn

Natural selection

It also aimed to provide evidence that can be used to explore the impact of Natural selection on population differences. Patterns of DNA polymorphisms can be used to reliably detect signatures of selection and may help to identify genes that might underlie variation in disease resistance or drug metabolism.^[12]^[13] Such insights could improve understanding of phenotypic variations, genetic disorders and Mendelian inheritance and their effects on survival and/or reproduction of different human populations.

Project description

Шаблон:Update

Goals

The 1000 Genomes Project was designed to bridge the gap of knowledge between rare genetic variants that have a severe effect predominantly on simple traits (e.g. cystic fibrosis, Huntington disease) and common genetic variants have a mild effect and are implicated in complex traits (e.g. cognition, diabetes, heart disease).^[14]

The primary goal of this project was to create a complete and detailed catalogue of human genetic variations, which can be used for association studies relating genetic variation to disease. The consortium aimed to discover >95 % of the variants (e.g. SNPs, CNVs, indels) with minor allele frequencies as low as 1% across the genome and 0.1-0.5% in gene regions, as well as to estimate the population frequencies, haplotype backgrounds and linkage disequilibrium patterns of variant alleles.^[15]

Secondary goals included the support of better SNP and probe selection for genotyping platforms in future studies and the improvement of the human reference sequence. The completed database was expected be a useful tool for studying regions under selection, variation in multiple populations and understanding the underlying processes of mutation and recombination.^[15]

Outline

The human genome consists of approximately 3 billion DNA base pairs and is estimated to carry around 20,000 protein coding genes. In designing the study the consortium needed to address several critical issues regarding the project metrics such as technology challenges, data quality standards and sequence coverage.^[15]

Over the course of the next three years,Шаблон:Clarify scientists at the Sanger Institute, BGI Shenzhen and the National Human Genome Research Institute’s Large-Scale Sequencing Network planned to sequence a minimum of 1,000 human genomes. Due to the large amount of sequence data that was required, recruiting additional participants was maintained.^[14]

Almost 10 billion bases were to be sequenced per day over a period of the two year production phase, equating to more than two human genomes every 24 hours. The intended sequence dataset was to comprise 6 trillion DNA bases, 60-fold more sequence data than what has been published in DNA databases at the time.^[14]

To determine the final design of the full project three pilot studies were to be carried out within the first year of the project. The first pilot intends to genotype 180 people of 3 major geographic groups at low coverage (2×). For the second pilot study, the genomes of two nuclear families (both parents and an adult child) are going to be sequenced with deep coverage (20× per genome). The third pilot study involves sequencing the coding regions (exons) of 1,000 genes in 1,000 people with deep coverage (20×).^[14]^[15]

It was estimated that the project would likely cost more than $500 million if standard DNA sequencing technologies were used. Several newer technologies (e.g. Solexa, 454, SOLiD) were to be applied, lowering the expected costs to between $30 million and $50 million. The major support will be provided by the Wellcome Trust Sanger Institute in Hinxton, England; the Beijing Genomics Institute, Shenzhen (BGI Shenzhen), China; and the NHGRI, part of the National Institutes of Health (NIH).^[14]

In keeping with Fort Lauderdale principles Шаблон:Webarchive, all genome sequence data (including variant calls) is freely available as the project progresses and can be downloaded via ftp from the 1000 genomes project webpage.

Human genome samples

Файл:1000 Genomes Project.svg

Locations of population samples of 1000 Genomes Project.^[16] Each circle represents the number of sequences in the final release.

Based on the overall goals for the project, the samples will be chosen to provide power in populations where association studies for common diseases are being carried out. Furthermore, the samples do not need to have medical or phenotype information since the proposed catalogue will be a basic resource on human variation.^[15]

For the pilot studies human genome samples from the HapMap collection will be sequenced. It will be useful to focus on samples that have additional data available (such as ENCODE sequence, genome-wide genotypes, fosmid-end sequence, structural variation assays, and gene expression) to be able to compare the results with those from other projects.^[15]

Complying with extensive ethical procedures, the 1000 Genomes Project will then use samples from volunteer donors. The following populations will be included in the study: Yoruba in Ibadan (YRI), Nigeria; Japanese in Tokyo (JPT); Chinese in Beijing (CHB); Utah residents with ancestry from northern and western Europe (CEU); Luhya in Webuye, Kenya (LWK); Maasai in Kinyawa, Kenya (MKK); Toscani in Italy (TSI); Peruvians in Lima, Peru (PEL); Gujarati Indians in Houston (GIH); Chinese in metropolitan Denver (CHD); people of Mexican ancestry in Los Angeles (MXL); and people of African ancestry in the southwestern United States (ASW).^[14]

ID	Place	Population	Detail
ASW	Шаблон:Flagicon*	African Ancestry in Southwestern USA	Detail
ACB	Шаблон:Flagicon*	African Caribbean in Barbados	Detail
BEB	Шаблон:Flagicon	Bengali in Bangladesh	Detail
GBR	Шаблон:Flagicon	British from England and Scotland	Detail
CDX	Шаблон:Flagicon	Chinese Dai in Xishuangbanna, China	Detail
CLM	Шаблон:Flagicon	Colombian in Medellín, Colombia	Detail
ESN	Шаблон:Flagicon	Esan in Nigeria	Detail
FIN	Шаблон:Flagicon	Finnish in Finland	Detail
GWD	Шаблон:Flagicon	Gambian in Western Division – Mandinka	Detail
GIH	Шаблон:Flagicon*	Gujarati Indians in Houston, Texas, United States	Detail
CHB	Шаблон:Flagicon	Han Chinese in Beijing, China	Detail
CHS	Шаблон:Flagicon	Han Chinese South, China	Detail
IBS	Шаблон:Flagicon	Iberian populations in Spain	Detail
ITU	Шаблон:Flagicon*	Indian Telugu in the U.K.	Detail
JPT	Шаблон:Flagicon	Japanese in Tokyo, Japan	Detail
KHV	Шаблон:Flagicon	Kinh in Ho Chi Minh City, Vietnam	Detail
LWK	Шаблон:Flagicon	Luhya in Webuye, Kenya	Detail
MSL	Шаблон:Flagicon	Mende in Sierra Leone	Detail
MXL	Шаблон:Flagicon*	Mexican Ancestry in Los Angeles, California, United States	Detail
PEL	Шаблон:Flagicon	Peruvian in Lima, Peru	Detail
PUR	Шаблон:Flagicon	Puerto Rican in Puerto Rico	Detail
PJL	Шаблон:Flagicon	Punjabi in Lahore, Pakistan	Detail
STU	Шаблон:Flagicon*	Sri Lankan Tamil in the U.K.	Detail
TSI	Шаблон:Flagicon	Toscani in Italia	Detail
YRI	Шаблон:Flagicon	Yoruba in Ibadan, Nigeria	Detail
CEU	Шаблон:Flagicon*	Utah residents with Northern and Western European ancestry from the CEPH collection	Detail

* Population that was collected in diaspora

Community meeting

Data generated by the 1000 Genomes Project is widely used by the genetics community, making the first 1000 Genomes Project one of the most cited papers in biology.^[17] To support this user community, the project held a community analysis meeting in July 2012 that included talks highlighting key project discoveries, their impact on population genetics and human disease studies, and summaries of other large-scale sequencing studies.^[18]

Project findings

Pilot phase

The pilot phase consisted of three projects:

low-coverage whole-genome sequencing of 179 individuals from 4 populations
high-coverage sequencing of 2 trios (mother-father-child)
exon-targeted sequencing of 697 individuals from 7 populations

It was found that on average, each person carries around 250–300 loss-of-function variants in annotated genes and 50-100 variants previously implicated in inherited disorders. Based on the two trios, it is estimated that the rate of de novo germline mutation is approximately 10⁻⁸ per base per generation.^[1]

References

Шаблон:Reflist

External links

1000 Genomes - A Deep Catalog of Human Genetic Variation - official web page
International HapMap Project Шаблон:Webarchive - official web page
Human Genome Project Information

Шаблон:Wellcome Trust Шаблон:Personal genomics

↑ ^1,0 ^1,1 Шаблон:Cite journal
↑ ^2,0 ^2,1 Шаблон:Cite journal
↑ Шаблон:Cite journal
↑ Шаблон:Cite journal
↑ Шаблон:Cite news
↑ Шаблон:Cite web
↑ Шаблон:Cite journal
↑ JC Long, Human Genetic Variation: The mechanisms and results of microevolution, American Anthropological Association (2004)
↑ Шаблон:Cite journal
↑ Шаблон:Cite journal
↑ Шаблон:Cite journal
↑ EE Harris et al., The molecular signature of selection underlying human adaptations, Yearbook of Physical Anthropology 49: 89-130 (2006)
↑ Шаблон:Cite journal
↑ ^14,0 ^14,1 ^14,2 ^14,3 ^14,4 ^14,5 G Spencer, International Consortium Announces the 1000 Genomes Project, EMBARGOED (2008) http://www.nih.gov/news/health/jan2008/nhgri-22.htm
↑ ^15,0 ^15,1 ^15,2 ^15,3 ^15,4 ^15,5 Meeting Report: A Workshop to Plan a Deep Catalog of Human Genetic Variation, (2007) http://www.1000genomes.org/sites/1000genomes.org/files/docs/1000Genomes-MeetingReport.pdf
↑ Шаблон:Cite journal
↑ C. King (2012) The Hottest Research of 2011. Science Watch http://archive.sciencewatch.com/newsletter/2012/201203/hottest_research_2012/
↑ 1000 Genomes Project Community Analysis Meeting http://1000gconference.sph.umich.edu/

[Pilot_phase-1] 1,0 ^1,1 Шаблон:Cite journal

[nature.com-2] 2,0 ^2,1 Шаблон:Cite journal

[3] Шаблон:Cite journal

[4] Шаблон:Cite journal

[5] Шаблон:Cite news

[6] Шаблон:Cite web

[neilsen2012-7] Шаблон:Cite journal

[ref2-8] JC Long, Human Genetic Variation: The mechanisms and results of microevolution, American Anthropological Association (2004)

[ref3-9] Шаблон:Cite journal

[ref4-10] Шаблон:Cite journal

[ref5-11] Шаблон:Cite journal

[ref7-12] EE Harris et al., The molecular signature of selection underlying human adaptations, Yearbook of Physical Anthropology 49: 89-130 (2006)

[ref8-13] Шаблон:Cite journal

[ref1-14] 14,0 ^14,1 ^14,2 ^14,3 ^14,4 ^14,5 G Spencer, International Consortium Announces the 1000 Genomes Project, EMBARGOED (2008) http://www.nih.gov/news/health/jan2008/nhgri-22.htm

[ref9-15] 15,0 ^15,1 ^15,2 ^15,3 ^15,4 ^15,5 Meeting Report: A Workshop to Plan a Deep Catalog of Human Genetic Variation, (2007) http://www.1000genomes.org/sites/1000genomes.org/files/docs/1000Genomes-MeetingReport.pdf

[pmid26568821-16] Шаблон:Cite journal

[hotpapers-17] C. King (2012) The Hottest Research of 2011. Science Watch http://archive.sciencewatch.com/newsletter/2012/201203/hottest_research_2012/

[community-18] 1000 Genomes Project Community Analysis Meeting http://1000gconference.sph.umich.edu/

[1]

[2]

[3]

[4]

[5]

[6]

[7]

[8]

[9]

[10]

[11]

[12]

[13]

[14]

[15]

[16]

[17]

[18]

Партнерские ресурсы
Криптовалюты	Обмен криптовалют - www.bestchange.ru Криптовалютная биржа CoinEx Криптовалютная биржа Binance HIVE OS - операционная система для майнинга e4pool - Мультивалютный пул для майнинга.
Магазины	AliExpress — глобальная виртуальная (в Интернете) торговая площадка, предоставляющая возможность покупать товары производителей из КНР; computeruniverse.net - Интернет-магазин компьютеров(Промо код 5 Евро на первую покупку:FWWC3ZKQ);
Хостинг	DigitalOcean - американский провайдер облачных инфраструктур, с главным офисом в Нью-Йорке и с центрами обработки данных по всему миру;
Разное	Викиум - Онлайн-тренажер для мозга Like Центр - Центр поддержки и развития предпринимательства. Gamersbay - лучший магазин по бустингу для World of Warcraft. Ноотропы OmniMind N°1 - Усиливает мозговую активность. Повышает мотивацию. Улучшает память. Санкт-Петербургская школа телевидения - это федеральная сеть образовательных центров, которая имеет филиалы в 37 городах России. Lingualeo.com — интерактивный онлайн-сервис для изучения и практики английского языка в увлекательной игровой форме. Junyschool (Джунискул) – международная школа программирования и дизайна для детей и подростков от 5 до 17 лет, где ученики осваивают компьютерную грамотность, развивают алгоритмическое и креативное мышление, изучают основы программирования и компьютерной графики, создают собственные проекты: игры, сайты, программы, приложения, анимации, 3D-модели, монтируют видео. Умназия - Интерактивные онлайн-курсы и тренажеры для развития мышления детей 6-13 лет SkillBox - это один из лидеров российского рынка онлайн-образования. Среди партнеров Skillbox ведущий разработчик сервисного дизайна AIC, медиа-компания Yoola, первое и самое крупное русскоязычное аналитическое агентство Tagline, онлайн-школа дизайна и иллюстрации Bang! Bang! Education, оператор PR-рынка PACO, студия рисования Draw&Go, агентство performance-маркетинга Ingate, scrum-студия Sibirix, имидж-лаборатория Персона. «Нетология» — это университет по подготовке и дополнительному обучению специалистов в области интернет-маркетинга, управления проектами и продуктами, дизайна, Data Science и разработки. В рамках Нетологии студенты получают ценные теоретические знания от лучших экспертов Рунета, выполняют практические задания на отработку полученных навыков, общаются с экспертами и единомышленниками. Познакомиться со всеми продуктами подробнее можно на сайте https://netology.ru, линейка курсов и профессий постоянно обновляется. StudyBay Brazil – это онлайн биржа для португалоговорящих студентов и авторов! Студент получает уникальную работу любого уровня сложности и больше свободного времени, в то время как у автора появляется дополнительный заработок и бесценный опыт. Автор24 — самая большая в России площадка по написанию учебных работ: контрольные и курсовые работы, дипломы, рефераты, решение задач, отчеты по практике, а так же любой другой вид работы. Сервис сотрудничает с более 70 000 авторов. Более 1 000 000 работ уже выполнено. StudyBay – это онлайн биржа для англоязычных студентов и авторов! Студент получает уникальную работу любого уровня сложности и больше свободного времени, в то время как у автора появляется дополнительный заработок и бесценный опыт.

Английская Википедия:1000 Genomes Project

Содержание

Background

Natural selection

Project description

Goals

Outline

Human genome samples

Community meeting

Project findings

Pilot phase

See also

References

External links

Навигация

Действия на странице

Действия на странице

Персональные инструменты

Навигация

Поиск

Инструменты