McClatchy, Advance Local, Tribune Publishing and other major newspaper chains are restricting the nonprofit's archiving bots.
Uh-oh, Internet! A new report from Nieman Lab (via Gizmodo) reveals that there was a steep decline in snapshots collected by the Internet Archive’s Wayback Machine beginning in May of this year. Of ...
As major news outlets cut off the Wayback Machine, journalists and advocacy groups are rallying to protect the Internet Archive’s vast collection of web pages. USA Today Co., the publishing ...
The Internet Archive has been responsible for saving and providing access to trillions of websites over the past 30 years. AI is putting a damper on the organization’s work, as large language models ...
ST. PETERSBURG, Fla. (Dec. 2, 2025) — The Poynter Institute partners with Internet Archive and Investigative Reporters and Editors (IRE) to bring preservation and web archive training to 300 news ...
The feed offers a close-up look of how microfiche — the sheets of films that store multiple documents — are digitized and uploaded to the Archive. The feed offers a close-up look of how microfiche — ...
Local news outlets are blocking Internet Archive from preserving their journalism, erasing digital history as AI fears drive ...
The New York Times, CNN, USA Today, The Guardian, and at least 241 other news organisations across nine countries have moved to restrict the Archive’s crawlers, a decision the Archive’s own director ...
Last month, the Internet Archive’s Wayback Machine archived its trillionth webpage, and the nonprofit invited its more than 1,200 library partners and 800,000 daily users to join a celebration of the ...
Reddit is now blocking the Internet Archive (IA) from indexing popular Reddit threads after allegedly catching sneaky AI firms—restricted from scraping Reddit—instead simply scraping data from IA’s ...