Difference between revisions of "Metrics"
Paulproteus (talk | contribs) (→Quantitative metrics being prepared by Ankit as part of Summer of Code 2008) |
Paulproteus (talk | contribs) (→Quantitative metrics being prepared by Ankit as part of Summer of Code 2008) |
||
Line 26: | Line 26: | ||
== Quantitative metrics being prepared by Ankit as part of Summer of Code 2008 == | == Quantitative metrics being prepared by Ankit as part of Summer of Code 2008 == | ||
− | By the end of the summer, we hope to have a number of quantitative data from Ankit's Summer of Code 2008 project. These will hopefully include: | + | By the end of the summer, we hope to have a number of quantitative data from Ankit's Summer of Code 2008 project of analyzing our web server logs. These will hopefully include: |
* Historical data on license choices at the [http://creativecommons.org/license/ "License your work"] CC license chooser | * Historical data on license choices at the [http://creativecommons.org/license/ "License your work"] CC license chooser | ||
* Indications based on referrers to our image icons to see if people change the license on web pages or other licensed documents | * Indications based on referrers to our image icons to see if people change the license on web pages or other licensed documents | ||
− | * Indications based on image icon loading frequency as to which licenses are widely seen by users | + | * Indications based on image icon loading frequency as to which licenses are used by content widely seen by users |
− | * Analysis of search.creativecommons.org logs to see | + | * Analysis of search.creativecommons.org logs to see what sorts of licenses and items people search for - and if that has changed over time |
+ | * Analysis of license deeds to see if deed information affects choices in /license/ | ||
* Geographic sub-analysis of the above | * Geographic sub-analysis of the above | ||
* Code to reproducibly and consistently generate the above statistics. | * Code to reproducibly and consistently generate the above statistics. |
Revision as of 23:43, 30 June 2008
Approximate Minimum Total CC Licensed Works as of December 2007: 90 million |
The Metrics Portal is about gathering, processing and visualizing metrics about Creative Commons' related projects, with particular emphasis on the adoption and usage of Creative Commons licenses internationally. Join us!
The best place to communicate on this is through the cc-community mailing list for general discussion, cc-devel for technical discussion, and the #cc chat channel on irc.freenode.net. If really deep into this, please consider joining into the commons-research community. Join in the fun!
Contents
Aggregate web pages licensed
License statistics has basic information about linkback/page/search-engine-query-based license adoption statistics, as well as the code used to gather this data and dumps of the data itself.
Site-specific metrics
Content curators lists licensed works counts for many sites that collect CC-licensed works, e.g., about 69 million at Flickr.
Counts for selected top sites over time may be found on the license statistics page.
Eventually code above should be expanded to collect this and finer-grained information (e.g., per site license breakdown) from these sites.
Quantitative metrics being prepared by Ankit as part of Summer of Code 2008
By the end of the summer, we hope to have a number of quantitative data from Ankit's Summer of Code 2008 project of analyzing our web server logs. These will hopefully include:
- Historical data on license choices at the "License your work" CC license chooser
- Indications based on referrers to our image icons to see if people change the license on web pages or other licensed documents
- Indications based on image icon loading frequency as to which licenses are used by content widely seen by users
- Analysis of search.creativecommons.org logs to see what sorts of licenses and items people search for - and if that has changed over time
- Analysis of license deeds to see if deed information affects choices in /license/
- Geographic sub-analysis of the above
- Code to reproducibly and consistently generate the above statistics.
Other quantitative metrics
- creativecommons.org traffic:
- Number of journals, publishers (e.g., PLoS)
- Feature films that have been released in theaters.
- Universities?
- Press agencies?
- Add ideas and methodology for collecting metrics here.
Qualitative metrics
- Case Studies
- Awards Received (grammys, oscars, etc)
- "bestsellers" for some definition of best-seller from Books
- Add ideas here...
Data
- Queries against search engines for linkbacks to our licenses
- Scrubbed Creative Commons Apache logs
- Any other data sets? Add them here...
Tools
- ccTools used for license statistic gathering
- Add your tools here!
Numbers Explanation
The Approximate Minimum Total CC Licensed Works is based on licenses reported by Yahoo search queries and Flickr and is the minimum number of licensed works that would satisfy the distribution of license types across both of them.