Difference between revisions of "Summer of Code 2013"

From Creative Commons
Jump to: navigation, search
(Ideas for GSoC 2013)
Line 1: Line 1:
 
Creative Commons is participates in Google's Summer of Code as a mentoring organization. Student submissions for SoC 2013 will take place April-May; see the [http://www.google-melange.com/gsoc/homepage/google/gsoc2013 GSoC 2013 homepage] for more details.
 
Creative Commons is participates in Google's Summer of Code as a mentoring organization. Student submissions for SoC 2013 will take place April-May; see the [http://www.google-melange.com/gsoc/homepage/google/gsoc2013 GSoC 2013 homepage] for more details.
  
This year Creative Commons is planning to launch new products and tools to help authors and consumers use, share, and remix content. Of interest are ideas that revolve around:
+
This year Creative Commons is planning to launch new products and tools to help authors and consumers use, share, and remix content. Here are some ideas to get you started.
 +
 
 +
=== Project: Media Widget ===
 +
 
 +
; Brief explanation
 +
: The CC media widget will allow content from a user's CC homepage (a product we're working on) to be embedded into other sites, like Tumblr, WordPress, etc. Our main interest is in specialized widgets that excel at displaying particular media types, not a generic "file list" widget. For example, a great image widget probably looks and feels different from an academic paper widget.
 +
 
 +
; Expected results
 +
: A widget that is able to take CC licensed files and their metadata and visualize it in neat ways that users find compelling. The widget must also include the licensing metadata/attribution information: which license it's released under, the author's name/handle, and so on. Clever ways of displaying this information in a way that is accessible but not annoying will be a plus.
 +
 
 +
; Knowledge Prerequisite
 +
: JavaScript, node.js
 +
 
 +
; Skill level
 +
: medium to low.
 +
 
 +
; Mentor
 +
: Dan Mills or other CC tech staff member
 +
 
 +
=== Project: Media Fingerprinting Library ===
 +
 
 +
; Brief explanation
 +
: CC would prefer that all content on the Web include correct licensing metadata. Alas, that is not the case. So we're interested in code that will allow us to identify a given item across the Web, even if there's no metadata alongside it. The tricky part is: people often crop or resize images, clip videos, re-encode content, or quote only pieces of text. So a simple hash is not sufficient: we need more intelligent fuzzy matching. That's what this project is about.
 +
 
 +
; Expected results
 +
: A library that provides two methods: 1) Given a media file, output a fingerprint, and 2) Given a file and a fingerprint, return the likelihood of the file matching the original file. You can focus your efforts on only one or two media types, or you can do more if it's possible.
 +
 
 +
: The library can be in a low-level language (C/C++) or you can use a higher-level language (JavaScript) if it's feasible. Speed is not a major concern at this point.
 +
 
 +
: Bonus: An additional API/method to detect content inside other files (e.g., a PowerPoint file that includes a CC licensed image, or a still image inside a video).
 +
 
 +
; Knowledge Prerequisite
 +
: Media formats/encodings, JavaScript, C/C++.
 +
 
 +
; Skill level
 +
: high.
 +
 
 +
; Mentor
 +
: Dan Mills or other CC tech staff member
 +
 
 +
=== Project: JS Attribution Helper ===
  
* Widgets that embed CC content across sites.
 
* Fingerprinting code that is able to identify the same content across the Web, even if changes have been applied or the content is embedded inside other content.
 
 
* Code to dynamically detect CC content on Web pages and add CC buttons/notices.
 
* Code to dynamically detect CC content on Web pages and add CC buttons/notices.
 +
 +
=== Project: Metadata Embedding Library ===
 +
 
* Code to embed licensing metadata inside various file formats (e.g., .png).
 
* Code to embed licensing metadata inside various file formats (e.g., .png).
 +
 +
=== Project: Donation Widget ===
 +
 
* Systems that enable donations to content authors.
 
* Systems that enable donations to content authors.

Revision as of 03:34, 2 April 2013

Creative Commons is participates in Google's Summer of Code as a mentoring organization. Student submissions for SoC 2013 will take place April-May; see the GSoC 2013 homepage for more details.

This year Creative Commons is planning to launch new products and tools to help authors and consumers use, share, and remix content. Here are some ideas to get you started.

Project: Media Widget

Brief explanation
The CC media widget will allow content from a user's CC homepage (a product we're working on) to be embedded into other sites, like Tumblr, WordPress, etc. Our main interest is in specialized widgets that excel at displaying particular media types, not a generic "file list" widget. For example, a great image widget probably looks and feels different from an academic paper widget.
Expected results
A widget that is able to take CC licensed files and their metadata and visualize it in neat ways that users find compelling. The widget must also include the licensing metadata/attribution information: which license it's released under, the author's name/handle, and so on. Clever ways of displaying this information in a way that is accessible but not annoying will be a plus.
Knowledge Prerequisite
JavaScript, node.js
Skill level
medium to low.
Mentor
Dan Mills or other CC tech staff member

Project: Media Fingerprinting Library

Brief explanation
CC would prefer that all content on the Web include correct licensing metadata. Alas, that is not the case. So we're interested in code that will allow us to identify a given item across the Web, even if there's no metadata alongside it. The tricky part is: people often crop or resize images, clip videos, re-encode content, or quote only pieces of text. So a simple hash is not sufficient: we need more intelligent fuzzy matching. That's what this project is about.
Expected results
A library that provides two methods: 1) Given a media file, output a fingerprint, and 2) Given a file and a fingerprint, return the likelihood of the file matching the original file. You can focus your efforts on only one or two media types, or you can do more if it's possible.
The library can be in a low-level language (C/C++) or you can use a higher-level language (JavaScript) if it's feasible. Speed is not a major concern at this point.
Bonus: An additional API/method to detect content inside other files (e.g., a PowerPoint file that includes a CC licensed image, or a still image inside a video).
Knowledge Prerequisite
Media formats/encodings, JavaScript, C/C++.
Skill level
high.
Mentor
Dan Mills or other CC tech staff member

Project: JS Attribution Helper

  • Code to dynamically detect CC content on Web pages and add CC buttons/notices.

Project: Metadata Embedding Library

  • Code to embed licensing metadata inside various file formats (e.g., .png).

Project: Donation Widget

  • Systems that enable donations to content authors.