Digital archives keep track of lost websites, lapsed domain names from the early years | Latest News India - Hindustan Times
close_game
close_game

Digital archives keep track of lost websites, lapsed domain names from the early years

Hindustan Times, New Delhi | By
Aug 14, 2020 12:09 AM IST

Users got their first taste of power in 1994, with public-journal formats that allowed anyone to post their thoughts for the world. We didn’t even call them blogs until 1999, but by 2004, Blog was the Oxford English Dictionary’s Word of the Year.

What do you remember of the first time you used the internet? The screee-beep of the modem connecting? The joy of seeing little computer icons linking to each other in your taskbar, indicating that your dial-up was working? Or paying the cyber café guy 10 to book your train ticket?

The internet has more than 60 trillion web pages(Photo by Waseem Andrabi / Hindustan Times)(Representative Image)
The internet has more than 60 trillion web pages(Photo by Waseem Andrabi / Hindustan Times)(Representative Image)

When India first got online 25 years ago, it was slow going. Pages took ages to load (with the hourglass icon turning endlessly). But once we got comfortable, we were hooked. You filled your Muzik folder with MP3 files, saw the MSN-Yahoo rivalry unfold like a repeat of the ’80s cola wars, did a web search through Ask Jeeves. And it was all free, and ad-free.

HT launches Crick-it, a one stop destination to catch Cricket, anytime, anywhere. Explore now!

Users got their first taste of power in 1994, with public-journal formats that allowed anyone to post their thoughts for the world. We didn’t even call them blogs until 1999, but by 2004, Blog was the Oxford English Dictionary’s Word of the Year.

The internet’s first-ever website, Info.cern.ch, has been saved for posterity by the guys who built the internet. But where do you go to see the rest of it? When domain names lapse, or companies collapse, the websites can vanish without a trace. You can’t pull an old one from the shelf. That’s probably why The Internet Archive is such a gem.

HIT REWIND

Born 24 years ago — just a year after India connected to the Internet — The Internet Archive is an American non-profit organisation that digitises films, books, letters, images, audio, video and software programmes. Its key project, The Wayback Machine trawls the web, copying pages, to build a library of the internet itself.

The Machine lets you see what a page looked like when it was archived, even if the site has changed or been taken down. More than 458 billion pages have been saved so far.

It’s not nearly enough. The internet has more than 60 trillion web pages. And with social media, there’s more to archive than ever.

Meanwhile, not everyone’s thrilled about record-keeping. Because the Wayback Machine collects site caches without asking, it’s raised questions about copyright infringement and privacy. In 2017, Internet Archive was among 2,600 sites banned by the Indian government as part of the fight against digital piracy. And earlier in the coronavirus disease lockdown, publishers sued the non-profit for making its digitised library available globally.

PIECES OF THE PAST

Symbolics.com, the first domain name ever registered on the internet, was already 10 years old when India logged on in 1995. It used to be a computer programming business. But it is now the Big Internet Museum.

Meanwhile, ArchiveTeam.org has been keeping track of sites and companies about to go bust and archiving their online material, since 2009. They describe themselves as “a loose collective of rogue archivists, programmers, writers and loudmouths dedicated to saving our digital heritage”, and also offer advice on how to archive your own data.

Some governments are taking big steps too. In Sweden, every web page that ends in .se has been saved by the National Library’s Web Archive division. The British Library has archived 6 billion pages.

India doesn’t have such an archive, except for election-related data. But the Wayback Machine has been paying attention to India almost from the start. There are more than 34,000 captures for just the Hindustan Times site, for instance, some dating back to 2001.

Discover the complete story of India's general elections on our exclusive Elections Product! Access all the content absolutely free on the HT App. Download now!

Get Current Updates on India News, Lok Sabha Election 2024 live, Elections 2024, Election 2024 Date along with Latest News and Top Headlines from India and around the world.
SHARE THIS ARTICLE ON
Share this article
  • ABOUT THE AUTHOR
    author-default-90x90

    Rachel Lopez is a a writer and editor with the Hindustan Times. She has worked with the Times Group, Time Out and Vogue and has a special interest in city history, culture, etymology and internet and society.

SHARE
Story Saved
Live Score
OPEN APP
Saved Articles
Following
My Reads
Sign out
New Delhi 0C
Thursday, April 18, 2024
Start 14 Days Free Trial Subscribe Now
Follow Us On