Skip to content

Leonid Mamchenkov

Life, universe, and everything else

Home
Archives
About
Contact me

Email
Skype
LinkedIn
GitHub
Facebook
Twitter
Instagram
Flickr
YouTube
SlideShare
RSS Feed

Search for:

On this day...

2019: PHP CodeSniffer: Ignoring rules
2018: Killed by Google
2018: Stack Overflow Buddy
2017: Querying CSV with SQL
2014: Daily dose of Instagram
2013: On the cover of Forbes … (not really)
2013: Google Apps : End of support for Microsoft Internet Explorer 9
2013: GoPro HERO3: Almost as Epic as the HERO3+
2013: Snowflakes, close-up
2013: Space technologies on Earth
2013: iwStack – Cloud services by Prometeus
2013: WordPress 3.8 plans responsive redesign of admin area
2013: HostGator.com website hosting for $0.01
2011: Day in brief – 2011-11-23
2010: Day in brief
2010: 20 must see TED videos for Computer Science people
2010: Adding Google Apps GTalk account to Pidgin
2010: Red
2010: More improvements for Movie Reviews
2010: Due Date

Archiving web sites

LWN runs an interesting article, covering different ways of archiving a website. It sounds trivial, but it’s not. Even the simplest of ways – wget – will probably take you a few dozen attempts to figure out the following:

$ wget --mirror --execute robots=off --no-verbose --convert-links \
       --backup-converted --page-requisites --adjust-extension \
       --base=./ --directory-prefix=./ --span-hosts \
       --domains=www.example.com,example.com http://www.example.com/

There a few other interesting tools (like pywb) mentioned.

Share:

Click to share on Twitter (Opens in new window)
Click to share on Facebook (Opens in new window)
Click to share on LinkedIn (Opens in new window)
Click to share on Pinterest (Opens in new window)
Click to share on Pocket (Opens in new window)
Click to share on Reddit (Opens in new window)
Click to email a link to a friend (Opens in new window)
More

Click to share on WhatsApp (Opens in new window)
Click to share on Telegram (Opens in new window)
Click to share on Tumblr (Opens in new window)
Click to print (Opens in new window)

Related

Posted on November 23, 2018Author Leonid MamchenkovCategories All, Linux, Sysadmin, Technology, Web work, WordPressTags backup, command line, web development, web hosting

Leave a CommentCancel reply

Post navigation

Previous Previous post: Killed by Google

Next Next post: Learn Git Branching

Proudly powered by WordPress

This website uses cookies. Purely for technical reasons. Accept Reject Read more

Privacy & Cookies Policy

Privacy Overview

This website uses cookies to improve your experience while you navigate through the website. Out of these, the cookies that are categorized as necessary are stored on your browser as they are essential for the working of basic functionalities of the website. We also use third-party cookies that help us analyze and understand how you use this website. These cookies will be stored in your browser only with your consent. You also have the option to opt-out of these cookies. But opting out of some of these cookies may affect your browsing experience.

Necessary

Always Enabled

Necessary cookies are absolutely essential for the website to function properly. This category only includes cookies that ensures basic functionalities and security features of the website. These cookies do not store any personal information.

Non-necessary

Any cookies that may not be particularly necessary for the website to function and is used specifically to collect user personal data via analytics, ads, other embedded contents are termed as non-necessary cookies. It is mandatory to procure user consent prior to running these cookies on your website.

SAVE & ACCEPT

Go to mobile version