Thursday, March 14, 2013

Candidate projects and data sources for course projects (in progress)

Bioquest
http://bioquest.org/2013schedule/

NCBI GEO data
 GEO includes gene expression, and NGS data
John Snow's cholera data
http://www.r-bloggers.com/john-snows-cholera-data-in-more-formats/

GHO Raw Data Download Web Service 

http://apps.who.int/gho/athena/

WHO childhood hunger data, used by Jeff Leek's course  
http://datadryad.org/
http://datadryad.org/resource/doi:10.5061/dryad.jr4dc oxidative stress and mutation

The loan data from Coursea.org data analysis course: 
For this analysis you will use the loans data available from here:

https://spark-public.s3.amazonaws.com/dataanalysis/loansData.csv
https://spark-public.s3.amazonaws.com/dataanalysis/loansData.rda

There is a code book for the variables in the data set available here:

https://spark-public.s3.amazonaws.com/dataanalysis/loansCodebook.pdf


http://cancergenome.nih.gov/

TCGA

linkedlifedata

data.gov

cancer genome atlas
http://cancergenome.nih.gov/

TCGA
https://github.com/tcga/tcga.github.com

http://imagejs.org/

Scientific survey data


No comments:

Post a Comment