All posts from June 2008

BHL News, Blog Reel, Tech Updates

Updated Harvesting Process from the Internet Archive

Note: This is a revision of our previous blog post that described our process for harvesting digitized books from the Internet Archive. Their query interface changed, and we’ve updated our process & documentation accordingly.

June 13, 2008byChris Freeland

BHL News, Blog Reel, Tech Updates

WonderFetch(tm) & IA _meta.xml fields

WonderFetch is the term used for prepopulating the Internet Archives metadata forms (so named because it is more wonderful than regular z39.50 fetching). Using WonderFetch, partner libraries can populate fields with data that would not normally be populated as part of the standard IA process, and then store those values in the foobar_meta.xml file alongside each scanned item in the IA repository. Part of the impetus for implementing WonderFetch was not just to automate the inclusion of volume and issue information for serials – which was important – but to also capture due diligence, rights, and licensing information related to each item. (And yes, the TM is a little joke! No rights reserved).

June 4, 2008byKeri Thompson

About BHL

The Biodiversity Heritage Library (BHL) is the world’s largest open access digital library for biodiversity literature and archives. Headquartered at the Smithsonian Libraries and Archives in Washington, D.C., BHL operates as a worldwide consortium of natural history, botanical, research, and national libraries working together to digitize the natural history literature held in their collections and make it freely available for open access as part of a global “biodiversity community.”

All posts from June 2008

Updated Harvesting Process from the Internet Archive

WonderFetch(tm) & IA _meta.xml fields

Help Support BHL

search

About BHL

Follow BHL

Join Our Mailing List

Subscribe to Blog via Email

Subscribe to Blog Via RSS

BHL on Twitter