Web archiving
From The Art and Popular Culture Encyclopedia
Related e |
Featured: ![]() Kunstformen der Natur (1904) by Ernst Haeckel |
Web archiving is the process of collecting portions of the World Wide Web and ensuring the collection is preserved in an archive, such as an archive site, for future researchers, historians, and the public. Due to the massive size of the Web, web archivists typically employ web crawlers for automated collection. The largest web archiving organization based on a crawling approach is the Internet Archive which strives to maintain an archive of the entire Web. National libraries, national archives and various consortia of organizations are also involved in archiving culturally important Web content. Commercial web archiving software and services are also available to organizations who need to archive their own web content for corporate heritage, regulatory, or legal purposes.
See also
- Archive
- Archive site
- Archive Team
- Digital preservation
- Heritrix
- International Internet Preservation Consortium
- Internet Archive Wayback Machine
- Internet memory
- Library of Congress Digital Library project
- List of Web archiving initiatives
- Memento Project
- National Digital Information Infrastructure and Preservation Program
- PADICAT
- Pandora Archive
- Portuguese Web Archive
- Project MINERVA
- UK Web Archiving Consortium
- Virtual artifact
- WebCite
- Web crawling