Inspiring discovery through free access to biodiversity knowledge.

The Biodiversity Heritage Library improves research methodology by collaboratively making biodiversity literature openly available to the world as part of a global biodiversity community.
BHL also serves as the literature component of the Encyclopedia of Life .

  

Art of Life: Face to Face meeting

Oct 4-5th, 2012, STL

Location

Center for Biodiversity Informatics, Missouri Botanical Garden, St. Louis
4651 Shaw Blvd, St. Louis, MO 63110
Map: http://bit.ly/J6rqxY


Accommodations

The Chase Park Plaza Hotel
212 N. Kingshighway, Saint Louis, MO 63108
Map: http://bit.ly/J6IRhM
*2 rooms have already been reserved for Charlie & Ed. Payment due upon checkout.

Trelease House
4466 Castleman Avenue (corner of Maury), St Louis MO 63110
Map
https://maps.google.com/maps?hl=en&client=firefox-a&q=4466+Castleman+Avenue,+St.+Louis,+MO&ie=UTF-8&hq=&hnear=0x87d8b4fdf1a6bdb3:0x8e0042bcbd8a85cc,4466+Castleman+Ave,+St+Louis,+MO+63110&gl=us&ei=l8s7UL6xL8HvygGP6oD4Aw&ved=0CB4Q8gEwAA
*2 rooms have been reserved for Rob & Gaurav




Agenda

Thurs Oct 4th

10-12:30pm Introductions, Art of Life workflow diagram and discussion
12:30-2:00pm Lunch and garden tour
2:00-5pm Schema update and discussion
5:00 - 7:00 Charlie and Ed to hotel, Rob & Gaurav to Trelease House
7:15pm - Dinner in Central West End


Fri Oct 5th

8:30am : Breakfast at CBI Office
9:00 - 11:00 Algorithm update and discussion
11:00 - 12:00pm : Defining system requirements for deploying algorithm on BHL Cluster
12:00 - 1:30 : Lunch (skype call with Richard from Wikimedia)
1:30 - 3:00 : Wrapup/next steps

Meeting Notes

Action Items from Oct 2012 face to face meeting
EXTRACT
ACTION ITEM: William will have a call with Martin & Nathan at MBL Woods Hole to discuss use of cluster.
ACTION ITEM: Look into long term hosting of algorithm and results. (William)
ACTION ITEM: Determine how JP2 images will be converted to JPEGS whether its kakadusoftware, ImageMagick, Jasper or some other tool (Mike and Ed)
ACTION ITEM: Give Ed list of titles that have already been paginated (Mike)
ACTION ITEM: extract and display pixel info from scandata file in extraction algorithm analyzer UI (Ed)
ACTION ITEM: Test exporting algorithm results as JSON file and then ingesting into portal. (Mike and Ed)
ACTION ITEM: Investigate Python algorithm Orange for improving recall (Ed)
ACTION ITEM: Check the compression ration numbers. (Ed) [I’m not totally sure what this action item means but it was in relation to the algorithm finding book covers]
ACTION ITEM: Add sum of block coverage for text to Algorithm analyzer UI (Ed)

CLASSIFY
ACTION ITEM: Determine what metrics from extraction belong in the schema (Trish)
ACTION ITEM: Investigate further how to apply guids and whether we need separate guids for metadata record separate from the bhl page url (Trish)
ACTION ITEM: Investigate which metadata should be stored in image header. (Trish)
ACTION ITEM: Investigate how we could utilize citizen scientists to complete the classify step (Trish)
ACTION ITEM: Add to classifier functionality the ability to flag a page to be sent to the description tool (Trish)
ACTION ITEM: Investigate further whether the page url should be its own element as in the Core and update schema if needed (Trish)
ACTION ITEM: Determine which set of images to start with having the least amount of copyright issues in testing uploads to Wikimedia and Flickr (Trish)
ACTION ITEM: Generate list of unique values in MARC 260|a then review list for most recurring cities of publication – note this should only be for publication post 1840? Completion of this action item will depend on copyright strategy for choosing images (Trish and Mike)
ACTION ITEM: pull geographic subject value from 650 |z info into descriptive metadata record within the Classifier UI (Trish)
[ACTION ITEM] Get clarification from Chris as to what is meant by “Incorporate existing API to find scientific names on images”. Person tags subject: scientific name, once brought back into BHL, name is run through ubio. Need to rethink when this step happens. (Trish and William)

DESCRIBE
ACTION ITEM: Look into Flickr API to see if we could push full schema from Wikimedia commons to Flickr machine tags. (Trish)
ACTION ITEM: Investigate further the tools available on the web to verify when an image was last updated on Description platforms and how that info can be used to update image record in BHL portal. (Trish and Mike)
ACTION ITEM: Determine how pages already in Flickr will be updated when those same pages are run through the algorithm and reidentified to have illustrations. (Trish and Mike)
ACTION ITEM: Look into Zooniverse as potential tool for crowdsourcing the description of the images http://www.citizensciencealliance.org/index.html (Trish)
ACTION ITEM: Work with local wikipedians and others interested in BHL in batch uploading (Trish)
ACTION ITEM: Talk to other large uploaders to Wikimedia as to their experiences (Trish)

SHARE
ACTION ITEM: Determine how to pull the url for an image from Flickr and Wikimedia when pushing tagged metadata record back into BHL portal. Find a place to display that info in the portal UI add text such as “this BHL image has been described in Flickr. If you would like to add or update information about this image please do so at http://www.flickr.com/photos/biodivlibrary/8112362021/ and that information will update the record in BHL” (Trish and Mike)
ACTION ITEM: Look into how we can utilize taxonomic name web services to do query term expansion in BHL portal so that an image tagged with a species name can also be found if user is searching on the higher level taxonomic levels such as order or family. (Trish and Mike)
ACTION ITEM: Determine easiest way for ARTstor to pull our data (Trish)

OTHER
ACTION ITEM: Review budget for money available for second face to face meeting. (Trish and William)



View Terms Of Use | Privacy
Revised: trosesandler1 Oct 25, 2012 9:35 am (19 revisions)
links to this page | print this page | Visit http://biodiversitylibrary.org
[Invalid Include: Page not found: HTML_div_close]
Contributions to https://biodivlib.wikispaces.com/ are licensed under a Creative Commons Attribution Share-Alike 3.0 License. Creative Commons Attribution Share-Alike 3.0 License
Portions not contributed by visitors are Copyright 2018 Tangient LLC
TES: The largest network of teachers in the world