04.08.08
Google Summer Of Code LaTeX Project
The TeX Users Group will be participating as a mentoring organization in the Google Summer of Code program, and my project of a Dublin Core metadata interface was accepted. I’m hoping some enterprising youth with lots of experience in programming TeX will be interested in the project.
Here is an overview of the project: Dublin Core is a scheme for expressing metadata (data about data) in a reliable, machine-readable way. It can be expressed in various languages, including RDF, which can itself be serialized in XML. Adobe’s eXtensible Metadata Platform allows RDF+XML statements to be embedded into PDF files. This metadata can then be read by search engines in order to give context to the document besides its simple contents.
TeX and LaTeX can be used to produce PDF documents, and have rich macro-programming environments. One goal of the project is to add simple commands which can write the complex metadata statements into the PDF without the user needing to know anything about RDF, XML, or Dublin Core itself. Another goal would be for more advanced users to augment this interface with new Dublin Core element sets and vocabularies.
I’ve always taken an interest in the semantic web, and apparently there aren’t too many people in this group also interested in TeX. So if the project flies I think it could be very useful.
Tags: latex, dublin core, semantic web