Home

Project Info

The University of Illinois Open Archives Initiative Metadata Harvesting Project

 

Illinois OAI Protocol Metadata Harvesting Project

Status Report Covering Quarters 1 and 2 of the Project

(17 January 2002)

Summary of Accomplishments & Events 1 July 2001 through 31 December 2001:

July

August

September

October

November

December

Harvest Activities to Date:

Preliminary investigation into appropriate OAI Providers to harvest for this project began in September. We first selected Data Provider sites that appear to provide materials significant to cultural heritage from the list of registered OAI Data Providers offered through the Open Archives Initiative website (http://www.openarchives.org).  (We continue to monitor http://www.openarchives.org for new sites providing content relevant to this project.)  Only sites that contain at least some cultural heritage records are being harvested. 

In addition to harvesting OAI-registered Data Providers, we also are harvesting surrogate OAI Data Provider sites containing snapshots of metadata provided to us by a means other than OAI.  These surrogate sites are maintained on Illinois servers. The institutions providing this metadata are not yet OAI-registered Data Providers, but have been very cooperative with this research project and have expressed their intention to make their data available directly via OAI at a later date. Illinois will maintain surrogate sites until such time as owning institutions are ready to make their records directly available for harvesting as OAI-registered Data Providers.

To date, Illinois has harvested over one million unique records from the institutions listed below.  For scalability testing all sites are fully harvested.  Complete site harvests are done monthly, incremental harvests (harvesting only records that have been added to a site or that have changed since last harvest) are done on shorter intervals as appropriate by site.  We estimate that about 500,000 records are relevant to the cultural heritage domain.

 (* Denotes institutions that are not yet registered OAI Data Providers and for whom we are   hosting snapshots of their metadata on Illinois servers.)

Harvested metadata spans a wide range. We have metadata representing collections of cultural and natural history materials, early motion pictures, sheet music, photographs, poetry, letters and manuscripts, finding aids, biographical and bibliographical information, books, and scholarly papers related to cultural heritage.  Specific collection emphases:

Progress & Additional Details Regarding Specific Tasks:

Task One (85% Complete):

Construction of Baseline Harvesting Service (July 1 – December 31, 2001)

Solicitation / acquisition of Metadata:

Harvesting Service Technical Infrastructure

Task 2 (50% complete):

Portal Creation and Development (September 1-December 31, 2001)