Web of Data for E-Commerce Tutorial ISWC2009
From Wiki of the E-Business and Web Science Research Group
ISWC 2009 Tutorial: The Web of Data for E-Commerce in Brief
A Hands-on Introduction to the GoodRelations Ontology, RDFa, and Yahoo! SearchMonkey
October 25, 2009, Westfields Conference Center near Washington, DC, USA
Organizers: Martin Hepp and Richard Cyganiak
Contents |
Video Recording of the Event
If you missed the tutorial at ISWC 2009, you can now watch a video recording of the full tutorial. It consists ot three parts:
Part 1: Overview, motivation, technology basics (suitable for business audiences and developers).
Part 2: The GoodRelations vocabulary in detail (suitable mostly for developers and data modeling experts; developers with a good grasp of GoodRelations could directly start with this.)
Part 3: Publishing and consuming GoodRelations data (suitable mostly for developers).
Abstract
In this tutorial, we will (1) explain the immediate business benefits of joining the Web of Data for Web shops, manufacturers of commodities, and service providers of any kind, (2) show how any commercial Web site can embed details of its business and offerings as RDFa metadata using the GoodRelations ontology, and (3) demonstrate the usage of the resulting data in multiple applications, namely Yahoo! SearchMonkey, queries on Semantic Web data repositories, Mashups, and the import from and export to popular Web shop software. Participants will learn how to use the GoodRelations ontology to augment Web shops and other Web applications with metadata on business entities, products and services, prices, warranty, shop locations, terms and conditions, etc. This will improve the visibility of an offering in next generation Web search engines, allow more precise search, and support partners in the value chain to extract and reuse product model data easily. At the same time, the tutorial will explain the modeling of more complex RDF patterns in RDFa.
The tutorial will also serve as a self-contained introduction of what the Web of Data is, which benefits it will provide for businesses, and why now is the time to get involved.
Current Relevance
There are three recent key developments in semantic technology that create a need for broad audiences to acquire a solid understanding of the presented technology, plus respective practical skills:
- RDFa has become a W3C Recommendation: This means there is now a stable, standard syntax for embedding RDF metadata into XHTML Web content, which paves the way to adoption by mainstream Web developers.
- GoodRelations ontology release and adoption: The GoodRelations ontology has been released and is experiencing strong support from major vendors and initiatives from the Semantic Web community and traditional corporations.
- Yahoo! SearchMonkey: Due to the official endorsement of GoodRelations by Yahoo! SearchMonkey, there is now an immediate, easy-to-communicate incentive for any business in the world to add respective metadata.
Tutorial Description
The GoodRelations ontology is now being adopted by major technology vendors and allows more precise product and services search, and frictionless product data interchange on the Web. Different from previous proposals, GoodRelations is stable and mature, and runs on current Semantic Web and Web infrastructure. Also, there is a direct business incentive to add respective metadata as of now, since Yahoo! SearchMonkey will crawl GoodRelations annotations and use that to display additional details of an offering. With RDFa being a W3C Recommendation, there now exists a powerful standard syntax for embedding respective data into existing Web content.
In this tutorial, participants will learn how to use the GoodRelations ontology to augment Web shops and other Web applications with metadata on business entities, products and services, prices, warranty, shop locations, terms and conditions, etc. This will improve the visibility of an offering in next generation Web search engines, allow more precise search, and support partners in the value chain to extract and reuse product model data easily.
We will explain the theoretical background and give hands-on, step-by-step instructions on augmenting existing static and dynamic Web sites by detailed Semantic Web metadata in RDFa. Then, we will show how this metadata can be used by Yahoo! SearchMonkey applications, and improve the appearance, detail, and visibility for precise queries.
Aims and Learning Goals
Participants will be empowered to use the GoodRelations conceptual structures and the RDFa syntax to augment static and dynamic Web sites by the various relevant details of a commercial Web presence, e.g. on the business entity, range of products and services, pricing and availability, etc. Since the GoodRelations ontology is much more sophisticated than simple vocabularies like foaf or Dublin Core, this also introduces RDFa modeling patterns for more complex RDF structures. On the data consumption and usage side, the tutorial will explain how the resulting metadata will be considered by Semantic Web search engines, repositories, and indexing services, and how it can be usefully combined with other open data on the Web, namely sources from the LOD cloud.
Target Audience
The tutorial is suited for anybody with a basic understanding of HTML/XHTML markup languages and Web architecture. It is well suited for practitioners and researchers from adjacent fields who are seeking a self-contained, concise, and hands-on introduction to using the Semantic Web for their needs. For experienced Semantic Web researchers, the tutorial will provide proven recipes and modeling patterns for using the GoodRelations ontology for their projects, and insight into the more complex aspects of RDFa.
Presentation Method
We will use a combination of
- presentations with clearly stated learning goals,
- hands-on exercises,
- quizzes for a quick check of understanding, and
- a final group project
to develop the practical skills and theoretical background.
Technical Requirements
All participants should bring their own computer. Respective software will be made available on this Web page prior to ISWC 2009.
Important: Please install at least the Twinkle tool on your computer and create bookmarks for the other tools from the software tools section below. You will need Internet access to use the tools and to complete the exercises.
Outline and Schedule
08:30-10:30 Part 1
- Overview and Motivation: Why the Web of Data is Now 30’
- Quick Review of Prerequisites 15’
- The GoodRelations Ontology: E-Commerce on the Web of Data 75’
10:30-10:45 Coffee Break
10:45-12:30 Part 2
- RDFa: Bridging the Web of Documents with the Web of Data 45’
- Expressing GoodRelations in RDFa: A Running Example 30’
- GoodRelations – Advanced Topics 30’
12:30-13:30 Lunch Break
13:30-16:00 Part 3
- Hands-on Exercise: Annotating a Web Shop 60’
- Querying the Web of Data for Offerings – SPARQL 15’
- Querying the Web of Data – Exercises 15’
16:00-16:30 Coffee Break
16:30-18:00 Part 4
- Publishing Semantic Web Data: Make Your RDF Available 30’
- Yahoo SearchMonkey and Yahoo BOSS 45’
- Discussion, Conclusion, Feedback Round 15’
Optional Topics
- Eligible Customers and Regions
- Delivery Options and Delivery Charge Specifications
- Payment Options and Payment Charge Specifications
- Warranty Promises
- Consumables, Accessories, Spare Parts, and Similar Products
- Creating GoodRelations-compliant Ontologies for Products and Services
- Reusing Catalog Group Structures
Materials
Software
- Twinkle: A SPARQL Query Tool
- Web Page: http://www.ldodds.com/projects/twinkle/
- Download: http://www.ldodds.com/projects/twinkle/twinkle-2.0-bin.zip
- Developer Info: http://code.google.com/p/twinkle-sparql-tools/
- Installation:
- Requires Java 1.5 or higher
- Download the distribution and unzip it into a new directory.
- Open a command-prompt and execute the following: java -jar twinkle.jar
- For our tutorial, please replace the file config.n3 in the "etc" subdirectory by the following file: config.n3.txt (file size: 10 KB, MIME type: text/plain) (Rename it to config.n3 after downloading.)
- Web Page: http://www.ldodds.com/projects/twinkle/
- RDF Validator (and Visualizer)
- http://www.w3.org/RDF/Validator/
- http://www.rdfabout.com/demo/validator/ (also supports N3/Turtle)
- GoodRelations Annotator
- PyRDFa: RDFa Extractor
- RDF2RDFa: turns RDF/XML content into simple RDFa snippets that can be easily pasted into any XHTML Web page
- RDF2DataRSS Conversion tool (can be used to create dataRSS for Yahoo from RDF/XML)
- Google Product Feed Converter: Converts feeds for Google Shopping into GoodRelations data for the Web
- BMEcat2GoodRelations: Converter that creates GoodRelations data from BMEcat 2005 XML catalogs
- RDF2XHTML service by Roberto Garcia, turns RDF/XML into XHTM+RDFa
Online Resources
- Wiki page
- GoodRelations Primer
- GoodRelations Documentation
- RDFa
- SPARQL
- Yahoo SearchMonkey
- Describe your data (voiD vocabulary)
Slides
- Part 1
- Slideshare: http://tr.im/griswc1
- PDF: iswc-gr1.pdf
- Slideshare: http://tr.im/griswc1
- Part 2
- Slideshare: http://tr.im/griswc2
- PDF: iswc-gr2.pdf
- Slideshare: http://tr.im/griswc2
- Part 3
- Slideshare: http://tr.im/griswc3
- PDF: iswc-gr3.pdf
- Slideshare: http://tr.im/griswc3
- Part 4
- Slideshare: http://tr.im/griswc4
- PDF: iswc-gr4.pdf
- Slideshare: http://tr.im/griswc4
Exercises
- Use the GoodRelations Annotator tool to create a basic description for a Web site of your choice.
- Optional: Paste the RDFa snippet into that page and publish it
- Optional: Extract the meta-data from the page as N3/Turtle using the PyRDFa tool and compare it with what you expected.
- Optional: Notify Sindice and Yahoo of the new page
- Optional: Download and publish the RDF/XML file variant from the Annotator.
- Optional: Notify PingTheSemanticWeb of the file once published.
- Optional: Create a DataRSS feed using the RDF2DataRSS tool, upload it, and register it using the Yahoo Site Explorer.
- Using Turtle & GoodRelations, describe the following business: Miller Tax Consulting Inc., having two stores, selling office supplies and tax consultancy.
- Convert the Turtle to RDF/XML and validate it.
- Convert the RDF/XML to RDFa using RDF2RDFa and paste it into a page of your choice.
- Create a SPARQL query that lists all business entities (gr:BusinessEntity) that have a Web page (using rdfs:seeAlso), and test the query using the LOD endpoint at http://lod.openlinksw.com/sparql
- Query the SPARQL endpoint at http://loc.openlinksw.com/sparql for the product properties and values for the product with the EAN/UPC code 8714574993836
SELECT ?p ?v ?uom WHERE
{
?uri rdf:type gr:ProductOrServiceModel.
?uri gr:hasEAN_UCC-13 "8714574993836"^^xsd:string.
?uri ?p ?qv.
?qv gr:hasValueFloat ?v.
?qv gr:hasUnitOfMeasurement ?uom.
}
Additional Links and References
- http://www.w3.org/2008/Talks/1026-ISCW-RDFa/RDFa-ISWC08.html
- http://www.heppnetz.de/projects/goodrelations/primer/
- http://events.linkeddata.org/iswc2008tutorial/
- http://www.sembase.at/index.php/Uad
- http://www.slideshare.net/mediasemanticweb/quick-linked-data-introduction/
- http://www.romaframework.org/
- Google's announcement on supporting structured data (RDFa and microformats)
- Google guide on how to mark-up your site with RDFa
- submit your marked-up site to Google
Presenters
The tutorial will be delivered by Martin Hepp and Richard Cyganiak.
Martin Hepp
Martin Hepp is a professor of General Management and E-business at Bundeswehr University Munich in Germany and a professor of Computer Science at the University of Innsbruck in Innsbruck, Austria, where he leads the research group “Semantics in Business Information Systems”. Martin holds a Master’s degree in Business Management and Business Information Systems and a Ph.D. in Business Information Systems from the University of Würzburg (Germany). He was the organizer of more than fifteen workshops and conference tracks on conceptual modeling, Semantic Web topics, and information systems and member of more than sixty conference and workshop program committees, including ASWC, ESWC, IEEE CEC/EEE, and ECIS.
Martin has taught more than 30 courses at the graduate and undergraduate level at universities in Germany, Austria, and in the USA.
Contact Details:
Prof. Dr. Martin Hepp
Chair of General Management and E-Business
E-Business and Web Science Research Group
Bundeswehr University Munich
Werner-Heisenberg-Weg 39
D-85579 Neubiberg, Germany
mhepp@computer.org
http://www.heppnetz.de (personal page)
http://www.unibw.de/ebusiness/ (group)
Phone: +49 89 6004-4217
Richard Cyganiak
Richard Cyganiak is a research engineer in the Linked Data Research Centre at DERI, NUI Galway, with an interest in web-scale information integration. He is a co-founder of the Linking Open Data initiative, a fellow of WSRI, and founder or major contributor to Linked Data related software projects D2RQ, Pubby, Neologism, Sigma, and Sindice.
Contact Details:
Richard Cyganiak
Linked Data Research Centre
Digital Enterprise Research Institute (DERI), NUI Galway, Ireland
Skype:richard.cyganiak
Phone: +353-91-49-5711
http://richard.cyganiak.de/
richard@cyganiak.de
