Why not just use archive.org's crawler to do this, instead of rewriting the wheel.
http://crawler.archive.org/
Looks easy enough, but I don't have a hosting spot with a big chunk of storage space. As political sites may have video or other large files I'd imagine 50 sites, every 2-4 hours, would add up rather fast.
If anyone isn't up to taking this on, but has hosting space, give me a PM and I can do it.
http://crawler.archive.org/
Looks easy enough, but I don't have a hosting spot with a big chunk of storage space. As political sites may have video or other large files I'd imagine 50 sites, every 2-4 hours, would add up rather fast.
If anyone isn't up to taking this on, but has hosting space, give me a PM and I can do it.