Web pages are predictably untrustworthy to remain online as they were when you dropped by to see them. They can change while their address remains the same, can redirect an address to someplace else entirely, and entire websites can disappear meaning there’s no response when attempting to visit an address.

This means that when I link to something here, there’s no guarantee at all that when you click such a link that you actually get to see what I thought I’ve linked to. And no, screenshots don’t help: fake is easy and we need to back up our words with hyperlinks.

The same is true for stuff I don’t link to here, but save to my archive. That’s why I don’t just save URLs but the entire article for future reference to my markdown notes. That still means the actual source might disappear, without me having a way of proving what I saved is what I saw. This not only is relevant to the content itself, but also for instance for licensing information. There are photos in this blog that were openly licensed when I used them, but no longer, leaving it impossible for me to prove I still can use the image because of the license at the time.

This makes an archiving service useful, like Archive.org. I can use that to store URLs I find interesting and I do that with some regularity. It is why I am a monthly donor to the Web Archive, I’d like it to remain a more robust reference point on the web.

Currently I have one way of adding web pages to an archive, using the Wayback Machine add-on in my browser. The same add-on helps me find previous versions of a page already archived, tweets about that page, and annotations made by others. Very useful, during browsing.


The Web Archive browser add-on bookmarklet

Writing blogposts and saving webpages as markdown in my local notes, or starting to annotate a page in Hypothes.is are another matter however. There I’d like to automate getting or creating an archive link.

In all cases it would need to be an archive link next to the original. If I link to something in a blogpost here I want to still send WebMentions to the linked site, and that requires the link to the original to be in my posting. Similarly for my notes, I want to have the original url as well, although it would be reconstructable from the archive link. For online social annotations in Hypothes.is, the original link is needed because that is how you find other people’s annotations alongside your own. The last one is probably easiest, by using the browser add-on manually and adding the result as a first annotation for instance.


An archive link as first Hypothes.is annotation

On the page in the IndieWeb wiki about using the Internet Archive there are some code snippets to be found to use with the Archive’s API, or using the basic string to save something https://web.archive.org/save/urlhere. It also mentions bloggers who either send the URLs they mention, or their own postings, or both (e.g. when sending a WebMention) to the Internet Archive.
When posting to my blog from my local markdown notes I could potentially add a function to the markdown-to-html parser I use where it detects external links, runs them through the Archive and writes the html for both the direct and the archived link.

For saving web articles as local notes in markdown there are several options to explore:

  • When saving from the browser using the Markdownload add-on, first saving and copying the saved url using the Achive add-on, then pasting that archive link in the dialog box.
  • Adding [Opslaan in Internet Archive](https://web.archive.org/save/{baseURI}) to the Markdownload template so I can directly save a URL from within the local note later, if wanted. I added this experimentally, to see if I would actually use it like this.
  • When saving to notes from my microsub feedreader I could add a function to the html-to-markdown parser I use there to run external links through the Archive and write the Archive link in markdown after the original link.

Do y’all understand how easy it is to make a fake tweet from a screenshot? Like by inspecting the browser and changing the text? …. I don’t trust posts I can’t search up on archives. And if you do have a link, archive it (not in an image but using an reputable archiving service).

Jacky Alciné’s words are true, so I thought I’d illustrate.

The general principle here is: if you make a statement about someone or something other than yourself or your personal opinions, you need to back it up with a link to supporting material. “X said on Twitter” needs to be linked to that tweet. Leaving googling for your source as an exercise to your readers isn’t just merely convenient to you, it is actively destructive of the web. The web is links, and they’re a key piece of information for your readers to judge if what you tweeted/said/blogged might be signal or noise. No links means it’s likely noise and it will degrade your standing as a source of signals. No links is aiding and abetting the bots, trolls and fakesters, as it allows them to hide in more noise.

Adding a screen-shot as Jacky Alciné says is not enough ‘proof’, as they can easily be altered directly in your browser. An example:

Yesterday I posted my first Tweet from my recent brain implant. It was awesome! So awesome in fact, I made a screenshot of it to preserve the moment for posterity.

In reality I posted from Indigenous (see there’s a link there!), a mobile app that provides my phone with IndieWeb reading and publishing capabilities, which I syndicated to my Twitter account (see there’s another link!). Also awesome, but much less awesome than blogging from a brain implant.

The difference between those two screenshots, getting from true to fake, is that I altered the text of the Twitter website in my browser. Every browser allows you to see a website you visit in ‘developer’ mode. It is helpful to e.g. play around with colors, to see what might work better for your site. But you can also use it to alter content. It’s all the same to your browser. See this screenshot, where I am in the process of changing ‘Indigenous’ into ‘brain implant’

But, you say, tweets might have been deleted and grabbing a screenshot is a good way of making sure I still have some proof if a tweet does get deleted. That’s true, tweets and other content do get deleted. Like self-congratulatory tweets/VK/FB messages about the downing of MH17 by separatist supporting accounts, before it became clear a regular line flight was shot out of the air, and those accounts were quickly scrubbed (See Bellingcat‘s overview). Having a screenshot is useful, but isn’t enough. If only for the reason that the originator may simply say you faked it, as it can so easily be done in a browser (see above). You still need to provide a link.

Using the Web Archive, or another archiving site, is your solution. The Web Archive has preserving as much of the web and other online content as possible as its mission. It is a trustable source. They save web pages on their own initiative, but you can submit any URL for preservation yourself and it will immediately be saved to the archive. Each archived page has its own URL as well, so you can always reference it. (Many links in Wikipedia point to the archived version of a page from the point in time it was referenced in Wikipedia for this reason).

I submitted my tweet from yesterday to the Web Archive, where it now has a web address that neither I, nor Twitter can change. This makes it acceptable proof of what I did in fact send out as a tweet yesterday.