The NEH-funded Art of Life project recently held its second face to face meeting November 2013 in St. Louis, Missouri. Institutions represented were from the Missouri Botanical Garden (MOBOT), Indianapolis Museum of Art (IMA), the University of Colorado Boulder (CU-Boulder); Washington University, St Louis (WUSTL), and Smithsonian Institution Libraries (SIL). The team focused primarily on how to bring the algorithm work to a close. The IMA developed four algorithms for identifying which pages in the BHL corpus contain images. Those algorithms were run across a gold standard set of 40k pages to determine their accuracy and performance. Two of the four algorithms were deemed to be useful (accuracy ratings were above 80%).
Continue reading