I have left Harvard as of July 1, 2008 to take a position at NYU. This website has been cached and left static. Feel free to browse my new website, aka "What the heck is a Clinical Associate Professor?"

04.08.08

Google Summer Of Code LaTeX Project

Posted in LaTeX, Web at 8:48 am by leingang

The TeX Users Group will be participating as a mentoring organization in the Google Summer of Code program, and my project of a Dublin Core metadata interface was accepted.  I’m hoping some enterprising youth with lots of experience in programming TeX will be interested in the project.

Here is an overview of the project: Dublin Core is a scheme for expressing metadata (data about data) in a reliable, machine-readable way.  It can be expressed in various languages, including RDF, which can itself be serialized in XML.  Adobe’s eXtensible Metadata Platform allows RDF+XML statements to be embedded into PDF files.  This metadata can then be read by search engines in order to give context to the document besides its simple contents. 

TeX and LaTeX can be used to produce PDF documents, and have rich macro-programming environments.  One goal of the project is to add simple commands which can write the complex metadata statements into the PDF without the user needing to know anything about RDF, XML, or Dublin Core itself.  Another goal would be for more advanced users to augment this interface with new Dublin Core element sets and vocabularies.

I’ve always taken an interest in the semantic web, and apparently there aren’t too many people in this group also interested in TeX.  So if the project flies I think it could be very useful.

Blogged with the Flock Browser


Tags: , ,