TCGA Cancer Genomics Data in the Cloud

The properly rendered version of this document can be found at Read The Docs.

If you are reading this on github, you should instead click here.

Use the power of BigQuery to analyze the wealth of data created by The Cancer Genome Atlas (TCGA) project!

The Institute for Systems Biology (ISB) has created and made public a dataset based on the open-access TCGA data including somatic mutation calls, clinical data, mRNA and miRNA expression, DNA methylation and protein expression from 33 different tumor types. It’s part of their Cancer Genomics Cloud, funded by the National Cancer Institute. They’ve also created public github repositories so you can try out sample queries and analyses in R or Google Cloud Datalab.

Google Cloud Platform data locations


Have feedback or corrections? All improvements to these docs are welcome! You can click on the “Edit on GitHub” link at the top right corner of this page or file an issue.

Need more help? Please see https://cloud.google.com/genomics/support.