
|
INTRO |
Spam Indexing |
Mark Up |
MetaTags |
SGML/XML |
Dublin Core |
PICS |
Metametadata |
..RDF RDF RDF RDF RDF
|
|
What is metadata? Metametadata?
What is W3C, what do they do? What are some current metametadata applications? Given what you know of metadata, cataloging, and indexing - what are the strengths and weaknesses of the system? |
RDF is metametadata. It permits the incorporation of various metadata
schemes into a single format. RDF is written in XML and uses the html format.
Note
Table 1, the Dublin Core and metatag markup
for the index page for this course is presented. Table
2 presents the the Dublin Core but not metatag data in RDF markup.
Table
3 presents us with the triples for each metadata element. Triples restate
in a different format the metadata catagory (creator, date, format, etc),
the source URL analyzed, and the value.
|
The purpose of RDF is for the "metamessage" to carry with it enough defining information about the metadata (hence to coin a phases metametadata) for cross communication. It is therefore a crosswalk between various metadata systems in addition to a defining metadata scheme.
RDF is defined by "schemas." Dublin Core is one such schema, and note in Table 2 that Dublin Core metadata are so identified, for example the rights statement is rendered "dc:rights="Copyright Wallace Koehler 2000 All Rights". It is concievable that a second metascheme might use the same "rights" label. That second metadata language -- let's call it OUCore -- might use the "rights" field not for a copyright statement, but to define property easements. RDF would interpret the ou:rights and dc:rights statements as representing different metadata catagories and would map them appropriately. Had my Web page carried PICS classifications, RDF would interpret the ratings codes.
Much of the above is derived from: http://www.w3.org/Metadata/Activity.html
Resource Description Framework
According to the World Wide Consortium (W3Ca) "Resource Description
Framework (RDF) is a foundation for processing metadata; it provides interoperability
between applications that exchange machine-understandable information on
the Web." RDF is metametadata. It permits the incorporation of various
metadata schemes into a single format. RDF is written in XML and uses the
html format.
The purpose of RDF is for the "metamessage" to carry with it enough defining information about the metadata (hence to coin a phases metametadata) for cross communication. It is therefore a crosswalk between various metadata systems in addition to a defining metadata scheme.
RDF is defined by "schemas." Dublin Core is one such schema. For example
a rights statement might be rendered in Dublin Core: "dc:rights="Copyright
John Doe 2002 All Rights Reserved". It is conceivable that a second metascheme
might use the same "rights" label. That second metadata language -- let's
call it ACore -- might use the "rights" field not for a copyright statement,
but to define property easements. RDF would interpret the a:rights and
dc:rights statements as representing different metadata categories and
would map them appropriately.
UniSys - see http://www.unisys.com/content/default-02.asp
Prowler - see http://www.infozone-group.org/prowlerDocs/html/proposal.html
XCM (aka eXended Content Management) - a cooperative venture among corporate vendors that includes metadata standards for the B-2-C push market. see - http://www.vignette.com/CDA/Site/0,2097,1-1-30-1458-1146-1743,00.html or http://www.planetit.com/techcenters/docs/enterprise_apps_systems-enterprise_apps/news/PIT20001114S0020
RDF consists of three elements: resources, properties and statements.
The resource is the thing being described and may include Web pages
or parts of Web pages.The follow in Table 4 describes these elements for
my ethics page. The rdf and /rdf
headers are marked in red. The resource
definition is in orange. This defines the source of the information. Properties,
shown
in green, describe the resource. Statements
are the combination of properties and resources.This is everything between
the red mark up. This metadata statement can be read by looking at the
page source code (including Dublin Core mark up) at http://www.ou.edu/cas/slis/ethics/EthicsBibOrg.htm
A W3C publication describes RDF Schemas for two mark ups: XML Serialization and Dublin Core. We must therefore consider RDF a standard in progress, but one that will be more fully developed over time. If indeed it becomes a metametadata standard, it could significantly improve the transfer of metadata and therefore document description among Web systems.
An RDF converter
is available at http://www.ukoln.ac.uk/cgi-bin/dcdot.pl?biblinkmode=on
ZSTARTS is a simplified variation on the Z39.50 standard designed for
Web-based applications (Denenberg 1996). It can be used to support searches
in metasearch engines and results filtering.
The semantic Web concept carries interesting baggage - a combination of relational databases and concept thasauri interlinked via hypertext to provide very specific meanings and to interrelate Web documents. The semantic Web differs from what Berners-Lee, Hendler, and Lassila call the "traditional knowledge-representation systems." Traditional knowledge-representation systems are "stifling," they say, because these systems require specific and agreed definitions for the thesaurus terms. These systems are also inherently centralized and bureaucratized. Bureaucratic and centralized systems are inherently slow to change and frought with debate and delay. Berners-Lee et al. are right. Centralized systems are slow to change - there is an ample literature that points to the experiences of say Dublin Core as it adapts and changes. Or, consider the history of change and modification for DDC or LCSH.
Instead, the semantic Web, accoding to Berners-Lee et al., will accept "...that paradoxes and unanswerable questions are a price that must be paid to achieve versatility." However, "[t]he challenge of the Semantic Web, therefore, is to provide a language that expresses both data and rules for reasoning about the data and that allows rules from any existing knowledge-representation system to be exported onto the Web." [cite] The Semantic Web, therefore, is conceived of as a middle ground between "stifling bureaucracy" and the "Wild Wild Web."
The power of RDF and, because it is an RDF based system, the Semantic Web lies in its use of "ontologies" -- the metametadata containers. Ontologies serve as metadata and data "containers" as well as providing definitional guidance to standardized definititional crosswalks to operate. Berners-Lee et al. provide the example of differences between "zip codes" and "postal codes" as metadata and the need for a common framework to understand their commonalities. Ontologies can be used to define much broader concepts.
So, what's good about the Semantic Web? It offers further elaboration of metadata and metametadata definition and exchange. What's wrong with it? It continues to be dependent on Web creator participation and assumes an expertise and sophistication one the part of those same Web creators that may be unwarrented.
That said, this one bears watching. It might even be smart to begin
to participate in the formulation of Semantic Web standards.
Charlotte Jenkins, Mike Jackson, Peter Burden, Jon Wallis. (n.d.) Automatic RDF Metadata Generation for Resource DiscoverySchool of Computing & IT, University of Wolverhampton. Available: http://www.scit.wlv.ac.uk/~ex1253/rdf_paper/
Paepcke, Andreas, Steve B. Cousins, Hector Garcia-Molina, Scott W. Hassan, Steven P. Ketchpel, Martin Röscheisen, and Terry Winograd (1996). “Using Distributed Objects for Digital Library Interoperability.” Computer. Available: http://computer.org/computer/dli/r50061/r50061.htm
Van de Sompel, Herbert and Carl Lagoze (2001). The Open Archives Initiative Protocol for Metadata Harvesting, Protocol Version 1.1 of 2001-07-02. Available: http://www.openarchives.org/OAI/openarchivesprotocol.htm#Record
W3C, Resource Description Framework (RDF) Schema Specification, W3C Proposed Recommendation 03 March 1999, Available: http://www.w3.org/TR/1998/WD-rdf-schema/
SemanticWeb.org Portal. http://www.SemanticWeb.org/
Ontology.org. http://www.ontology.org/ (B2B Resource)
Berners-Lee, Tim, James Hendler and Ora Lassila "The Semantic Web"Scientific American, May 2001. http://www.scientificamerican.com/2001/0501issue/0501berners-lee.html
Markup Languages and Ontologies, http://www.SemanticWeb.org/knowmarkup.html
W3C, Semantic Web Activity, http://www.w3.org/2001/sw/
The SHOE [Simple HTML Ontology Extensions] FAQ. http://www.cs.umd.edu/projects/plus/SHOE/faq.html#q1.1
SHOE Ontologies. http://www.cs.umd.edu/projects/plus/SHOE/onts/index.html#base