It started quite innocently, with overhauling this blog and moving it to a new domain – goodbye pagerank, meh – and I thought:
What if Medium.COM goes the same way as some Storify or some link shorteners, and I lose all my essays?
It turns out that Medium have a “Download Your Information” feature – which is handy, but unhelpfully produces a zipfile full of HTML as a download format.
This led to a work-in-progress python script that uses BeautifulSoup to parse and rewrite all the HTML, and Requests to fetch and store the images. It’s not large but I don’t want to share it until I’ve had a chance to test the output / see what it looks like in WordPress.
Slight problem with that, though: the WordPress REST API requires credentials which Apache is stripping, and nothing seems inclined to fix the problem. Sigh.
Ergo: converting my blogserver to Ubuntu64 on Raspberry Pi. Watch this space.
Leave a Reply