How To Read .ARC Files From The Heritrix Crawler Using Python?
I looked at the Heritrix documentation website, and they listed a Python .ARC file reader. However, it is 404 not found when I clicked on it. http://crawler.archive.org/articles/de
Solution 1:
Nothing a little Googling can't find: http://archive-access.cvs.sourceforge.net/viewvc/archive-access/archive-access/projects/hedaern/
Post a Comment for "How To Read .ARC Files From The Heritrix Crawler Using Python?"