M. Jackson Wilkinson on 22 Mar 2004 02:36:02 -0000


[Date Prev] [Date Next] [Thread Prev] [Thread Next] [Date Index] [Thread Index]

[PLUG] archiving websites


Hey everyone,

The college for which I work is redesigning their website, and in the process wants to archive their current site for posterity's sake. Since all of the pages are dynamically-generated, it doesn't make much sense from an archival standpoint to simply copy the web tree to disk, and we want to find a way to archive the site as it's generated.

Have any of you been in a similar situation and found a solution? We want something flexible so we can say "start at this URL and go 3 levels deep, but don't archive jpgs and gifs" and modify those parameters as is appropriate.

Heretrix looks like it could be in the right direction, but it clearly isn't ready yet...

Thanks for any ideas,
Jack Wilkinson

___________________________________________________________________________
Philadelphia Linux Users Group         --        http://www.phillylinux.org
Announcements - http://lists.phillylinux.org/mailman/listinfo/plug-announce
General Discussion  --   http://lists.phillylinux.org/mailman/listinfo/plug