Archives for November 2009
Scrapers and Irregularities
I am always keen to learn about Scrapers who allow excerpting structured XML data from “almost” structured HTML. Unfortunately, I haven’t yet found one who is robust enough against small irregularities.
