User:Jeff Hammerbacher
Contents
User Info
Jeff Hammerbacher used to work as a quant in Manhattan. He now works as a Research Scientist at Facebook in Palo Alto. He's very interested in assisting the CC Developer community. You can reach him at jeff dot hammerbacher at gmail dot com.
To Do
- Educate myself about technology used for CC licensing.
- Information Sharing on the Semantic Web, by Heiner Stuckenschmidt and Frank van Harmelen.
- Creating the Semantic Web with RDF: Professional Developer's Guide by Johan Hjelm.
- Visualizing the Semantic Web, edited by Vladimir Geroimenko and Chaomei Chen.
- Ontological Engineering, by Asuncion Gomez-Perez, Oscar Corcho, and Mariano Fernandez-Lopez.
- Understand ccPublisher codebase, related technologies
- Via Nathan: get a first extension written that would do something like post to a user-specifed blog whenever you upload something (ala' "ping" ing)
- Get ccPublisher2 running from source
- Check the Extending ccPublisher 2 documentation
- Detailed understanding of ccPublisher2 API
Progress
Read the Information Sharing and RDF books, both were atrocious. W3C and Wikipedia content much more informative. Halfway through "Visualizing the Semantic Web", and obtained second edition. Began posting notes and links below from the Information Sharing book.
Polished my Python and identified first project to understand: ccPublisher. Read through wiki page and Nathan's talk.
Notes and Links
Information Sharing on the Semantic Web
This book's one good idea is that ontologies should be modular and have a separate layer to allow for easy integration with other ontologies. Also has interesting section on the ontology of statistics. The rest of the book lays out platitudes and mainly references other German researchers. The beginning of my notes (to be continued):
- Preface
- The main thesis of this book is that the problem of information sharing (finding and integration) is only solvable by giving the computer better access to the semantics of the information.
- Part 1: Information Sharing and Ontologies
- Chapter 1: Semantic Integration
- Explicit representations of information semantics are needed in a weakly structured environment.
- Focus on semantic integration and content-based filtering.
- Heterogeneity studied by intelligent information integration and distributed database communities.
- Heterogeneity problems can be divided into three categories: syntax, structure, and semantics.
- Syntax problems solved by standards: ODBC, HTML, SGML->XML (XML Schema, DTD), RDF (RDF Schema), etc.
- XML provides the framework but not the semantics.
- XML Schema constrains the structure of XML documents while RDF Schema defines the vocabulary used in RDF data models.
- Further Reading:
- The Semantic Web - On the respective Roles of XML and RDF
- Intelligent Integration of Information
- Information Retrieval: Data Structures & Algorithms
- Ontologies: Silver Bullet for Knowledge Management and Electronic Commerce
- The Design Space of Frame Knowledge Representation Systems
- The Description Logic Handbook
- Chapter 1: Semantic Integration