Английская Википедия:End of Term Web Archive

Материал из Онлайн справочника
Перейти к навигацииПерейти к поиску

Шаблон:Infobox project Шаблон:United States presidential transitions series

The End of Term Web Archive preserves U.S. federal government websites during administration changes.[1]

Background

The End of Term Web Archive was set up following a 2008 announcement from National Archives and Records Administration (NARA) that they would not be archiving government websites during transition, after carrying out such crawls in 2000 and 2004.[2] The 2004 federal web harvest can be accessed alongside congressional web harvests, beginning with the 109th United States Congress, at National Archives.[3]

The first project partners were the Library of Congress, George Washington University, Stanford University, University of North Texas, the US Government Publishing Office, California Digital Library and the Internet Archive, all members of the International Internet Preservation Consortium. The project was initially sketched out after a General Assembly of the IIPC in 2008.[4] NARA and the Environmental Data & Governance Initiative (EDGI) joined the 2020/21 project.[5]

The project

Screenshot of a "Page Not Found" message relayed by whitehouse.gov.
Custom error page used to direct whitehouse.gov visitors as the website changed in 2009.

The project archives websites and documents for public access and research use.[6] A UNT study into the risk to document files found that 83% of PDFs on the .gov domain in 2008 were missing four years later.[7] This is consistent with the requirement to manage websites, but their status means that changes may be of interest to the public and watchdog groups.[8] Evidence of the demand for continued access to historical web material can be found in an announcement made by the EPA in response to concerns about changes in 2017, stating that pages from the previous administration would be carefully archived.[9] These snapshot pages were clearly marked to distinguish them from contemporary content.[10]

The archive prioritizes sites administering areas regarded as likely to be updated or removed over the period of transition.[11] The public are encouraged to nominate important sites and these are combined with broad crawls of government domains to create the collection.[12][13] Although it is extensive - the 2016 crawl preserved 11,382 sites - it stops short of being comprehensive.[14][15] Researchers have used these collections to examine the history of climate change policy and reuse of suspended U.S. government Twitter accounts.[16][17]

See also

References

Шаблон:Reflist


Шаблон:US-gov-stub

Шаблон:United States presidential transitions