Home

Project Info

The University of Illinois Open Archives Initiative Metadata Harvesting Project

Should the format element be normalized?

From the Dublin Core web site:

Name: Format

Identifier: Format

Definition: The physical or digital manifestation of the resource.

Comment: Typically, Format may include the media-type or dimensions of the resource. Format may be used to determine the software, hardware or other equipment needed to display or operate the resource. Examples of dimensions include size and duration. Recommended best practice is to select a value from a controlled vocabulary (for example, the list of Internet Media Types [MIME] defining computer media formats).

 

In the analysis of the metadata in the repository, we discovered that the format element was used for:

Given the specificity of the value of each format element and that there are very few repeating format values, we question the benefit of trying to normalize these values. The benefits to end-users are few particularly given the values being used.

Two other possibilities were discussed but have yet to be decided or acted on. The first is adding an additional value that reflects a condition or information about the value of the format element that might benefit the end-user by clarifying what is meant. For instance, would adding the term 'extent' to all archival records aid users in deciphering what '3 cubic feet' means? The second possibility is adding another value that reflected information found elsewhere in the record. For instance, it may be that the format element could be used to indicate whether there is a digital image of an item by building off of the identifier element. This would be an added value and would use the format element as DCMI intended. For instance, if an identifier ended in 'jpg', this could be added to the format element. Before implementing either of these possibilities, it would be important to understand whether the time and effort it would take to make these changes is worth the benefits they would give the end user.

Sarah Shreeves

Posted 7-3-02