With Web-Harvest you can parse data from web sites and collect them in a common format. All of this is highly configurable via XML files, and therefore usable for all kinds of meta-crawling web applications. Best of all, it’s licensed under BSD License.