This website is an attempt at #archiving every website seizure banner image or page created by government agencies.
This website is an attempt at #archiving every website seizure banner image or page created by government agencies.
Have you listened to the Future Knowledge #podcast from @internetarchive + @AuthorsAlliance?
Catch up on past episodes & get ready for a new one this week!
Listen & subscribe wherever you get your podcasts: https://futureknowledge.transistor.fm
Anyone with experience digitizing old VHS tapes?
I have an old VHS tape I’d like to convert and the old VHS player in the image. What I’d like to do is connect the player to my PC (running Linux), press play, and simply record the output on my PC. Nothing fancy.
Obviously I need some kind of SCART to HDMI/DP/USB cable, but what exactly? And what about software?
thou shalt not dupe!
or shall thou?
it’s 2025…
thou shall dupe ALL!
all? NO, ALL!
#TIL that;
"Ngā Taonga Sound & Vision was formed by the amalgamation of the New Zealand Film Archive Ngā Kaitiaki O Ngā Taonga Whitiāhua, Sound Archives Ngā Taonga Kōrero, and the Television New Zealand Archive between 2012 and 2014."
https://www.ngataonga.org.nz/about-us/who-we-are/about-nga-taonga-sound-vision/#our-story
#HatTip to the audio-visual archivists, helping to defend us all from the Ministry of Truth and the memory hole, one recording at a time : )
I trust #InternetArchive with government documents more than I do the government! (Or #AI for that matter...)
Internet Archive is now an official #USGovernment document #library
by Steve Dent
Fri, July 25, 2025
"The US Senate has granted the Internet Archive federal depository status, making it officially part of an 1,100-library network that gives the public access to government documents, #KQED reported. The designation was made official in a letter from California Senator Alex Padilla to the Government Publishing Office that oversees the network. 'The Archive's digital-first approach makes it the perfect fit for a modern #FederalDepositoryLibrary, expanding access to federal government publications amid an increasingly digital landscape,' he wrote.
"Established by Congress in 1813, the Federal Depository Library Program is designed to help the public access government records. Each congressional member can designate up to two libraries, which include government information like budgets, a code of federal regulations, presidential documents, economic reports and census data.
"With its new status, the Internet Archive will be gain improved access to government materials, founder Brewster Kahle said in a statement. 'By being part of the program itself, it just gets us closer to the source of where the materials are coming from, so that it’s more reliably delivered to the Internet Archive, to then be made available to the patrons of the Internet Archive or partner libraries.' The Archive could also help other #libraries move toward #DigitalPreservation, given its experience in that area.
"It's some good news for the site which has faced legal battles of late. It was sued by major #publishers over loans of #DigitalBooks during the #Coronavirus epidemic and was forced by a federal court in 2023 to remove more than half a million titles. And more recently, major music labels filed lawsuits over its #Great78Project that strove to preserve #78RPM records. If it loses that case it could owe more than $700 million damages and possibly be forced to shut down.
"The new designation likely won't aid its legal problems, but it does affirm the site's importance to the public. 'In October, the Internet Archive will hit a milestone of 1 trillion pages,' Kahle wrote. 'And that 1 trillion is not just a testament to what libraries are able to do, but actually the sharing that people and governments have to try and create an #EducatedPopulace.' "
https://tech.yahoo.com/general/articles/internet-archive-now-official-us-123036550.html
University of Minnesota is archiving photos of U.S. National Parks signs before they're removed:
https://sites.google.com/umn.edu/save-our-signs/home
Submission form: http://saveoursigns.org/
Archive.today seems to have stopped being able to capture Sharkey posts. Archive.org already couldn't capture them. This is a problem for my wiki. Does anyone have any idea what changed?
Here's my attempt today to capture a page:
https://archive.today/RA19G
Notice none of the content is visible, although it is still present in the source. By contrast, here's a successful capture of a post last year.
#Sharkey #Archiving
https://archive.ph/hRTQf
@jonasgeiler @Habbie Thanks, Jonas :)
Also see Look Over There! – which you can easily deploy yourself on a tiny VPS – it’s how I’m redirecting our old sites to archive.org (https://look-over-there.small-web.org/)
Today is #InternationalArchivesDay and I once again would like to share my thanks to https://archive.org and all archives and libraries for preserving so much things that would otherwise be lost to history.
Just noticed an issue with Look Over There!¹ where some of the sites I was forwarding to archive.org began to fail.
I’ve now documented the proper way to redirect to archive.org in Look Over There! and I also wrote a brief post on the relevant Reddit to alert the folks at The Internet Archive about this and suggest an improvement that could benefit findability on sites with redirects in general:
¹ e.g., See our instance for Small Technology Foundation at https://look-over-there.small-web.org
We're in the process of scanning 48 bankers boxes of the MIT MULTICS archive. We'd like to get a decent high speed document scanner to make this process go easier. Can you help us with a donation? https://sdf.org
Once the scanning is complete all of these will be made available online for everyone.
So here’s a really useful tip if you’re using archive.org to look up a site and it redirects to some other site because that’s what they saw at a later crawl…
Prefix your search (in the URL bar) with:
https://web.archive.org/web/*/
So, for example, if you search for https://www.blendwebmix.com/ on archive.org, you’ll get a redirect to a different site.
But if you enter the following URL in your browser:
https://web.archive.org/web/*/https://www.blendwebmix.com/
You’ll get the calendar view with all the crawls of the URL.
Thanks to iarchivist for the tip (https://archive.org/post/1012493/wayback-how-to-deal-with-redirects).
(I’m going through and fixing all the links in a decade of talks for Laura and myself on the Kitten version of the Small Technology Foundation site and most of the conference sites are either gone or don’t have archives so the Internet Archive is coming through again bigtime.)
"This destruction reveals not only the brutality of war, but also a long-standing neglect of the importance of digital archiving and the use of technical advances to preserve documents and archives. This neglect threatens to lose a history that could have been documented for the present and built upon for the future."
New Kitten Release
• Implements cascading archives support
https://kitten.small-web.org/reference/#cascading-archives
Cascading archives¹ is useful if you have a static archive of the old version of your site and you don’t want to host it somewhere else and use the 404→307 (evergreen web) technique (https://kitten.small-web.org/reference/#evergreen-web-404-307) (the latter is useful if the old version of your site is a dynamic site and you cannot take a static archive of it).
If a URL cannot be found on your app, Kitten will trying it in the archive folders:
__archive__1
__archive__2
__archive__3
(In that order.)
So you can three older static versions of your site served without breaking older URLs unless they are shadowed by newer URLs in your site/app.
Enjoy!
¹ This is a feature that I originally implemented in Site.js (that’s going to be shut down tomorrow when Let’s Encrypt stops issuing certificates with OCSP-stapling – I don’t have the bandwidth to maintain two servers/frameworks; Kitten is Site.js’s successor). I’m planning on implementing this differently in Kitten going forward (so you can use the Settings interface to upload a zipped archive and it will serve it) but I need this for my own site for tomorrow’s shutdown so we have this simpler implementation in the meanwhile. Leaving things to the last minute? Who? Me? Never! ;)
Ooh, what’s this?… Look Over There!
(With apologies to Jaida Essence Hall)
So the little app I teased earlier is ready and deployed and I have our own instance running at:
https://look-over-there.small-web.org
Look Over There! lets you forward multiple domains to different URLs with full HTTPS support.
Why?
We have a number of older sites that are becoming a chore/expensive to maintain and yet I don’t want to break the web. So I thought, hey, I’ll just use the “url forwarding” feature of my domain registrar to forward them to their archived versions on archive.org.
Ah, not so fast, young cricket… seems some domain registrars’ implementations of this feature do not work if the domain being forwarded is accessed via HTTPS (yes, in 2025).
So, given Kitten¹ uses Auto Encrypt² to automatically provision Let’s Encrypt certificates, I added a domain forwarding feature to it and created Look Over There! as a friendly/simple app that provides a visual interface to it.
To see it in action, hit https://cleanuptheweb.org and you should get forwarded to the archived version of it on archive.org. I’m going to be adding more of our sites to the list in the coming days as part of an effort to reduce my maintenance load and cut down our expenses at Small Technology Foundation.
Since it’s Small Web, this particular instance is just for us. However, you can run your own copy on a VPS (or even a little single-board computer at home, etc.) A link to the source code repository is on the site. Once Domain³ is ready for use (later this year ), setting up your own instance of a Small Web app at your own server will take less than a minute.
I hope this little tool, along with the 404→307 (evergreen web) technique⁴, helps us to nurture an evergreen web and avoid link rot. (And the source code, as little as there is because Kitten does so much for you, is a good resource if you want to learn about Kitten’s new class-based component and page model which I haven’t yet had a chance to properly document.)
Enjoy!
¹ https://kitten.small-web.org
² https://codeberg.org/small-tech/auto-encrypt
³ https://codeberg.org/domain/app
⁴ https://4042307.org
I’m going through our events page on the Small Technology Foundation web site¹ and porting the entries there to the new version of our web site that I’m building in Kitten² and it’s depressing how many event sites have just disappeared.
Thank goodness for archive.org.
does someone have a file server hosted at home (or privately enough that it is YOURS) and wants to archive all Linus Tech Tips Floatplane Exclusives??? (in a way I could still download/access them if I want to)
I really need some storage space :'D and I want to archive this forever if possible
it's 154GB of 1080p30fps videos, all the Floatplane exclusives from when they started being a thing, until 20th February (I really could only afford one month so some recent vids are already missing)
(please boost for maximum reach )
@gutenberg_org @internetarchive is anyone doing anything about the idea of including the codec with the scan for future future generations? I love digital archiving but at the same time there are already videos in my own archive that I can no longer easily decode.
#software #codec #transcoding #archiving #archival #digitalArchaeology