Re: [PLUG] archiving websites

Jeff Abrahamson on 22 Mar 2004 03:03:02 -0000

[Date Prev] [Date Next] [Thread Prev] [Thread Next] [Date Index] [Thread Index]

Re: [PLUG] archiving websites

From: Jeff Abrahamson <jeff@purple.com>

To: plug@lists.phillylinux.org

Subject: Re: [PLUG] archiving websites

Date: Sun, 21 Mar 2004 21:54:17 -0500

Mail-followup-to: plug@lists.phillylinux.org

Reply-to: plug@lists.phillylinux.org

Sender: plug-admin@lists.phillylinux.org

User-agent: Mutt/1.5.5.1+cvs20040105i

On Sun, Mar 21, 2004 at 09:35:08PM -0500, M. Jackson Wilkinson wrote: > [23 lines, 142 words, 1035 characters] Top characters: e_itnsol > > Hey everyone, > > The college for which I work is redesigning their website, and in the > process wants to archive their current site for posterity's sake. Since > all of the pages are dynamically-generated, it doesn't make much sense > from an archival standpoint to simply copy the web tree to disk, and we > want to find a way to archive the site as it's generated. > > Have any of you been in a similar situation and found a solution? We > want something flexible so we can say "start at this URL and go 3 levels > deep, but don't archive jpgs and gifs" and modify those parameters as is > appropriate. > > Heretrix looks like it could be in the right direction, but it clearly > isn't ready yet... man wget I do this often enough that I made an alias: jeff@asterix:jeff $ type mirror mirror is a function mirror () { echo fast mirror; wget --no-host-directories --convert-links --mirror --no-parent $1 } jeff@asterix:jeff $ -- Jeff Jeff Abrahamson <http://www.purple.com/jeff/> GPG fingerprint: 1A1A BA95 D082 A558 A276 63C6 16BF 8C4C 0D1D AE4B
Attachment: signature.asc
Description: Digital signature

Follow-Ups:

Re: [PLUG] archiving websites
From: "William H. Magill" <magill@mcgillsociety.org>

Re: [PLUG] archiving websites
From: McLinux <mclinux@myrealbox.com>

References:

[PLUG] archiving websites
From: "M. Jackson Wilkinson" <jackw@jounce.net>

Prev by Date: Re: [PLUG] archiving websites

Next by Date: Re: [PLUG] Re: SPF

Previous by thread: Re: [PLUG] archiving websites

Next by thread: Re: [PLUG] archiving websites

Index(es):

Date

Thread