ALA2003 OAI

Information about ALA2003 OAI

Published on October 4, 2007

Author: Alien

Source: authorstream.com

Content

Using OAI-PMH to Aggregate Metadata Describing Cultural Heritage Resources:  Using OAI-PMH to Aggregate Metadata Describing Cultural Heritage Resources Timothy W. Cole ([email protected]) University of Illinois at Urbana-Champaign http://dli.grainger.uiuc.edu/Publications/TWCole/ALA2003OAI/ ALA/CLA Annual Meeting 22 June 2003 Toronto, CA Order of Presentation:  Order of Presentation Perspectives on OAI-PMH Illinois OAI metadata harvesting project Goals & objectives Findings regarding metadata Findings regarding search & discovery New OAI projects at Illinois IMLS digital collections & content CIC OAI metadata harvesting project OAI Protocol for Metadata Harvesting:  OAI Protocol for Metadata Harvesting Harvesting approach to interoperability at metadata level Divides world into Metadata Providers & Service Providers Builds on HTTP, XML, & Dublin Core http://www.openarchives.org/ OAI Antecedents:  OAI Antecedents Call to other E-Print archives (July 1999) Paul Ginsparg, Rick Luce, & Herbert Von de Sompel: “…mobilize core group to work towards achieving a universal service for author self-archived scholarly literature.” Santa Fe Mtgs. (Oct. 1999 & June 2000) OAI – PMH version history: First Alpha Release, Sept. 2000 1.0 (Beta) Release January 2001 1.1 (Beta 2) Release July 2001 2.0 (Production) Release June 2002 Original OAI Organization:  Original OAI Organization OAI Executive: Carl Lagoze & Herbert Van de Sompel OAI Steering Committee: Co-Chairs: Dan Greenstein, Cliff Lynch OAI Technical Committee Funded by NSF, DLF & CNI Seeks to be user community driven OAI-PMH as a tool:  OAI-PMH as a tool All about moving metadata around Designed to be a building block, useable by many different communities Can facilitate (in some cases enable) services & functions Assumes widely distributed content, but centralized indexing(!) & services Build once, use for many applications Focus of OAI is interoperability Harvesting vs. Broadcast :  Harvesting vs. Broadcast Competing approaches to interoperability Distributed/Broadcast searching: search and discovery over remote services and data Harvesting is when data/metadata is transferred from the remote source to the destination where search & discovery services are located (e.g. Union catalogs) OAI-PMH is a harvesting protocol As Compared to Z39.50:  As Compared to Z39.50 Metadata vs. Resources:  Metadata vs. Resources Resource refers to information objects or digital representations of information objects Metadata item is a collection of properties about a resource (e.g. title, author, etc.) Metadata record is a metadata item expressed in a specific syntax according to an XSD OAI focuses on metadata, with the implicit understanding that metadata contains useful links to the source information object(s) Data and Service Providers:  Data and Service Providers Data Providers (Repositories) refer to entities who possess resources & metadata and are willing to share metadata with others via well-defined OAI protocols Service Providers (Harvesters) are entities who harvest metadata from Data Providers in order to supply higher-level services to users (e.g. search & discovery) OAI uses these denotations for its client/server model (data=server, service=client) Reliance on HTTP & XML:  Reliance on HTTP & XML OAI-PMH is a REpresentational State Transfer (REST) protocol (unlike RPC, SOAP) OAI requests and responses are sent via the HTTP protocol OAI Requests are encoded as HTTP GET or POST operations OAI Responses are valid XML documents XML Namespaces and Schema:  XML Namespaces and Schema Consistency and data “quality” is ensured by using XML Schema Definitions (XSD) for all responses XML Namespaces are used where necessary to clearly define which parts of the responses are actual metadata and which support the Metadata Harvesting Protocol OAI-PMH Use of Dublin Core:  OAI-PMH Use of Dublin Core DC is OAI’s lowest common denominator OAI supports & encourages use of other, community-driven metadata schemas Typically, metadata provider stores metadata in ‘best’ schema as dictated by material & resources Crosswalk (semantic mapping) to simpler schemas Semantic mapping at metadata delivery (rather than at time of search) As with Z39.50, can’t search for what’s not there When to use OAI-PMH:  When to use OAI-PMH Metadata is sufficient for services desired Normalization, dedupping, metadata augmentation desired Content is widely distributed across small, non-Z39.50 enabled repositories OAI-PMH is more lightweight than Z39.50 Portals can use BOTH Z39.50 & OAI-PMH What OAI-PMH Is Not:  What OAI-PMH Is Not Not search & discovery on its own Not a database management system Not a single metadata schema Not OAIS How OAI Works:  How OAI Works OAI “VERBS” Identify ListMetadataFormats ListSets ListIdentifiers ListRecords GetRecord H A R V E S T E R R E P O S I T O R Y OAI OAI Service Provider Metadata Provider HTTP Request HTTP Response (OAI Verb) (Valid XML) OAI Provider Architectures:  OAI Provider Architectures Descriptive Metadata OAI Administrative Metadata, e.g., Ids, datestamps, sets, formats OAI Harvesters A few projects using OAI-PMH:  A few projects using OAI-PMH Basic building block of the National Science Digital Library Large-scale implementations in E-Prints, OLAC, NDLTD, … Built into ENCompass, ContentDM, Michigan’s DLXS, D-Space, and other products Open Archives Forum in Europe; will be part of federation activities in the UK and EU Univ. of Illinois OAI Metadata Harvesting Project:  Univ. of Illinois OAI Metadata Harvesting Project Funded by Andrew W. Mellon Foundation (July 2001 – May 2003) Primary objectives: Develop & make available OAI harvesting tools Build search services for aggregated metadata in the domain of cultural heritage Examine metadata aggregation issues, including use of EAD in OAI context Investigate utility of aggregated metadata, including preliminary testing with end-users Type of resources:  Type of resources 39 data providers academic libraries Museums / cultural orgs digital libraries public library 1.1 million original DC records + 1.5 million derived from EAD Variations in DC element usage:  Variations in DC element usage Records containing subject & description element Many different controlled and local vocabularies in use Granularity: a record may describe a collection of coins — or one coin Excerpt of a metadata record describing a cotton coverlet:  Excerpt of a metadata record describing a cotton coverlet Description: Digital image of a single-sized cotton coverlet for a bed with embroidered butterfly design. Handmade by Anna F. Ginsberg Hayutin. Source: Materials: cotton and embroidery floss. Dimensions: 71 in. x 86 in. Markings: top right hand corner has 1 1/2 in. x 1/2 in. label cut outs at upper left and right hand side for head board; fabric is woven in a variation of a rib weave; color each of yellow and gray; hand-embroidered cotton butterflies and flowers from two shades of each color of embroidery floss - blue, pink, green and purple and single top 20 in. bordered with blue and black cotton embroidery thread; stitches used for embroidery: running stitch, chain stitch, French knot and back stitches; selvage edges left unfinished; lower edges turned under and finished with large gray running stitches made with embroidery floss. Format: Epson Expression 836 XL Scanner with Adobe Photoshop version 5.5; 300 dpi; 21-53K bytes. Available via the World Wide Web. Coverage: — Date Created: 2001-09-19 09:45:18; Updated: 20011107162451; Created: 2001-04-05; Created: 1912-1920? Type: Image Excerpt of a metadata record describing "American woven coverlet“:  Excerpt of a metadata record describing "American woven coverlet“ Description: Materials: Textile--Multi, Pigment—Dye; Manufacturing Process: Weaving--Hand, Spinning, Dyeing, Hand-loomed blue wool and white linen coverlet, worked in overshot weave in plain geometric variant of a checkerboard pattern.Coverlet is constructed from finely spun, indigo-dyed wool and undyed linen, woven with considerable skill. Although the pattern is simpler, the overall craftsmanship is higher than 1934.01.0094A. - D. Schrishuhn, 11/19/99 This coverlet is an example of early "overshot" weaving construction, probably dating to the 1820's and is not attributable to any particular weaver. -- Georgette Meredith, 10/9/1973 Source: — Format: 228 x 169 x 1.2 cm (1,629 g) Coverage: Euro-American; America, North; United States; Indiana? Illinois? Date: Early 19th c. CE Type: cultural; physical object; original Implications:  Implications Service providers Automatically normalize metadata encoding where possible (e.g., dates) Normalize for and co-locate by type / format where possible Metadata providers Create metadata for interoperability Consider more expressive schema – e.g., Qualified DC, MARC Original interface:  Original interface Portal had two search pages—simple (keyword) and advanced. Pilot study with student teachers:  Pilot study with student teachers 23 users in honors-level C&I class Assignment: Use the site in preparing a lesson plan (high school social studies) __________ Introduced to “aggregated metadata” concept Focus group interviews conducted Students’ papers examined Transaction logs analyzed Results of initial user testing:  Results of initial user testing 1. Users expected all links pointed to digital objects Some records pointed to finding aids Some records pointed to collection’s web site Some records described analog objects 2. Users unable to make use of search results Simple searches produced 1000s of unranked results Advanced search (with limits) rarely used 3. Distinction between portal and data providers unimportant to users What does “online access” mean?:  What does “online access” mean? To librarian & curator To student teacher Response to test results:  Response to test results EAD-derived records segregated Analog only collections excluded Categories of resource types reduced to 3: Images and Video Text, Sheet Music, and Websites Museums and Archival Collections Revised interface:  Revised interface Simple keyword & advanced search put on one page Clarify “online access” Natural language in Boolean operators Revised search results:  Revised search results Link goes to finding aid or collection page? “Learn more.” Link displays object? “View item.” Subj/Desc expanded IMLS Digital Collections & Content:  IMLS Digital Collections & Content Build a registry of all National Leadership Grant collections with digital content. Assist and guide NLG projects in making item-level metadata sharable using OAI. Build a repository and search & discovery tools for integrated access to the content of NLG collections (unique metadata schema?). Research best practices for sharing metadata about diverse digital content and for supporting the interests of diverse user communities. Slide34:  http://imlsdcc.grainger.uiuc.edu/ CIC OAI metadata harvesting:  CIC OAI metadata harvesting Univ. of Illinois at UC will host an OAI-PMH metadata harvesting service for 10 CIC libraries Project Goals (3 year experimentation phase) Improve access to selected resources at CIC libraries Advertise these resources (internally & externally) Prepare member institutions for future grant-mandated OAI-based resource sharing Serve as a useful testbed for experimentation with OAI-PMH, development of metadata best practices, usability and user needs testing, etc. Using OAI-PMH to Aggregate Metadata Describing Cultural Heritage Resources:  Using OAI-PMH to Aggregate Metadata Describing Cultural Heritage Resources http://dli.grainger.uiuc.edu/Publications/TWCole/ALA2003OAI/ Timothy W. Cole ([email protected]) University of Illinois at Urbana-Champaign

Related presentations


Other presentations created by Alien

Physical Security Lecture
05. 01. 2008
0 views

Physical Security Lecture

GREEK THEATRE
15. 10. 2007
0 views

GREEK THEATRE

Singapore National Symbols
14. 09. 2007
0 views

Singapore National Symbols

Origins of the Cold War
23. 12. 2007
0 views

Origins of the Cold War

CG43SlideSet
30. 04. 2008
0 views

CG43SlideSet

kaiser pres
28. 04. 2008
0 views

kaiser pres

GoldDifferences
22. 04. 2008
0 views

GoldDifferences

visn8
17. 04. 2008
0 views

visn8

Nov24 Regulatory approaches
16. 04. 2008
0 views

Nov24 Regulatory approaches

dr rom
14. 04. 2008
0 views

dr rom

file 6943
13. 04. 2008
0 views

file 6943

The Peak Oil Context Tom Petrie
10. 04. 2008
0 views

The Peak Oil Context Tom Petrie

H106g
09. 04. 2008
0 views

H106g

JapaneseGeography
07. 04. 2008
0 views

JapaneseGeography

Hamburg 2007
14. 09. 2007
0 views

Hamburg 2007

lfg
14. 09. 2007
0 views

lfg

Eddie Final Presentation
14. 09. 2007
0 views

Eddie Final Presentation

chalmers
14. 09. 2007
0 views

chalmers

The Rain Forest Final
14. 09. 2007
0 views

The Rain Forest Final

ECAKnowledgeFair
12. 10. 2007
0 views

ECAKnowledgeFair

Ch18part1
15. 10. 2007
0 views

Ch18part1

WNV AVB 02212006
21. 10. 2007
0 views

WNV AVB 02212006

giraffe pp
14. 09. 2007
0 views

giraffe pp

giraffe powerpoint
14. 09. 2007
0 views

giraffe powerpoint

giraffe
14. 09. 2007
0 views

giraffe

COOL STUFF ABOUT GIRAFFES
14. 09. 2007
0 views

COOL STUFF ABOUT GIRAFFES

ub041104
23. 10. 2007
0 views

ub041104

STORY OF THEME AND PLOT
23. 10. 2007
0 views

STORY OF THEME AND PLOT

PhiladelphiaZooPPP
14. 09. 2007
0 views

PhiladelphiaZooPPP

qu10 11
01. 12. 2007
0 views

qu10 11

Angelos CME Energetics
02. 11. 2007
0 views

Angelos CME Energetics

pptPanama s
22. 10. 2007
0 views

pptPanama s

hirotani
13. 11. 2007
0 views

hirotani

bon2003 mpls
29. 10. 2007
0 views

bon2003 mpls

PROF AZZA
23. 10. 2007
0 views

PROF AZZA

Fenton
29. 10. 2007
0 views

Fenton

Countering Offshore
29. 12. 2007
0 views

Countering Offshore

walters082902
23. 11. 2007
0 views

walters082902

razbash
26. 11. 2007
0 views

razbash

DVMRPandMOSPF
01. 01. 2008
0 views

DVMRPandMOSPF

One 783Ngupta
04. 01. 2008
0 views

One 783Ngupta

Chapter 18 PPT
22. 10. 2007
0 views

Chapter 18 PPT

History of NAIS John Wiemers
20. 08. 2007
0 views

History of NAIS John Wiemers

BSP2D
14. 09. 2007
0 views

BSP2D

costarica1 ftparraud
22. 10. 2007
0 views

costarica1 ftparraud

mcmc2000a
06. 11. 2007
0 views

mcmc2000a

050317lc
16. 11. 2007
0 views

050317lc

fwing
22. 10. 2007
0 views

fwing

acute 060727 transfusionmed
23. 10. 2007
0 views

acute 060727 transfusionmed

bckexpk3b
09. 07. 2007
0 views

bckexpk3b

anorexia
09. 07. 2007
0 views

anorexia

070207 Adjektiv
09. 07. 2007
0 views

070207 Adjektiv

A Brachmann
09. 10. 2007
0 views

A Brachmann

mueller jun07
19. 10. 2007
0 views

mueller jun07

Late Classic Maya Collapse
16. 02. 2008
0 views

Late Classic Maya Collapse

ISLAS GALAPAGOS
14. 09. 2007
0 views

ISLAS GALAPAGOS

Heatingoilwebsection ppp
24. 02. 2008
0 views

Heatingoilwebsection ppp

PIndustrialTrucks
26. 02. 2008
0 views

PIndustrialTrucks

ethanap
14. 09. 2007
0 views

ethanap

Propulsion CEV
07. 11. 2007
0 views

Propulsion CEV

MichelleWatt
20. 02. 2008
0 views

MichelleWatt

newsletterfall04
11. 03. 2008
0 views

newsletterfall04

EC T9 2008 Conference Proposal
12. 03. 2008
0 views

EC T9 2008 Conference Proposal

drugstatistics
17. 12. 2007
0 views

drugstatistics

icfa chep06
23. 10. 2007
0 views

icfa chep06

Hubert CW8
14. 09. 2007
0 views

Hubert CW8

A mi Papi 2089
19. 06. 2007
0 views

A mi Papi 2089

An ode to Mothers
19. 06. 2007
0 views

An ode to Mothers

LoffPresentation
17. 10. 2007
0 views

LoffPresentation

Maschera
19. 06. 2007
0 views

Maschera

manual
19. 06. 2007
0 views

manual

Luces De Navidad 1848
19. 06. 2007
0 views

Luces De Navidad 1848

leer
19. 06. 2007
0 views

leer

Lean Six SigmaATL011706
19. 06. 2007
0 views

Lean Six SigmaATL011706

lexisnexis
05. 10. 2007
0 views

lexisnexis

OAT Presentation v5
19. 06. 2007
0 views

OAT Presentation v5

moscatelli
19. 06. 2007
0 views

moscatelli

moon split
19. 06. 2007
0 views

moon split

money plus
19. 06. 2007
0 views

money plus

MKCL
19. 06. 2007
0 views

MKCL

Journey of the Spirit Lesson 6
01. 10. 2007
0 views

Journey of the Spirit Lesson 6

2 Jornada BISHOP
10. 10. 2007
0 views

2 Jornada BISHOP

No esperes
19. 06. 2007
0 views

No esperes

Amores locos 1992
19. 06. 2007
0 views

Amores locos 1992

College English book 2 Unit 7
24. 02. 2008
0 views

College English book 2 Unit 7

A vista de pajaro II 2109
19. 06. 2007
0 views

A vista de pajaro II 2109

Ammosov Vladimir ammosov pra
12. 10. 2007
0 views

Ammosov Vladimir ammosov pra

Amber la mejor de todas
19. 06. 2007
0 views

Amber la mejor de todas

CP317 lecture 6 Huck II 05
11. 12. 2007
0 views

CP317 lecture 6 Huck II 05

AHQA031204Mck
09. 07. 2007
0 views

AHQA031204Mck

Evergreen
03. 01. 2008
0 views

Evergreen

04 NJIT3
02. 01. 2008
0 views

04 NJIT3

Poster A4 Glasgow nov04
04. 10. 2007
0 views

Poster A4 Glasgow nov04

Ally McBeal
09. 07. 2007
0 views

Ally McBeal

sara paige
14. 09. 2007
0 views

sara paige

36181003
24. 10. 2007
0 views

36181003

MusicApprecBaroque 2
22. 11. 2007
0 views

MusicApprecBaroque 2

ELE386 Malware
20. 08. 2007
0 views

ELE386 Malware

RohanShah
12. 10. 2007
0 views

RohanShah

1022MAS net big picture
03. 01. 2008
0 views

1022MAS net big picture

Lo Suficiente 1744
19. 06. 2007
0 views

Lo Suficiente 1744

gm3 jp item14 Mangrove ITTO
22. 10. 2007
0 views

gm3 jp item14 Mangrove ITTO

2005AuditResults
09. 07. 2007
0 views

2005AuditResults

HABIC1 summary
17. 11. 2007
0 views

HABIC1 summary

aro ald informalsession
24. 10. 2007
0 views

aro ald informalsession

etu ambassadeurs juin 07 en
13. 03. 2008
0 views

etu ambassadeurs juin 07 en

Gobert poster
03. 10. 2007
0 views

Gobert poster

Kistenev
15. 11. 2007
0 views

Kistenev

6 History of Chemistry I
12. 10. 2007
0 views

6 History of Chemistry I

Jan2000report
04. 01. 2008
0 views

Jan2000report

course 4
03. 01. 2008
0 views

course 4