How much of the Web is archived? Why the answer matters.

By Alexis Madrigal

January 8, 2013

Here's the challenge: new Internet is being made all the time. Oftentimes, these new pages are added to existing networks on Tumblr or Facebook or Twitter or Livejournal. But other times, someone fires up a web server that's off the standard map, and it the web's crawlers, try as they might, may not find that page for a while, if ever.

That means some percentage of the web is not being archived by anyone (or anything, really), not even the Internet Archive's invaluable Wayback machine.

And certainly, few sites are being archived with any kind of regularity, even those (like TheAtlantic.com) that are changing constantly. So, how much of the web is humanity missing?

Read more at The Atlantic.


By Alexis Madrigal

January 8, 2013

http://www.govexec.comhttp://www.nextgov.com/cloud-computing/2013/01/how-much-web-archived-why-answer-matters/60527/