Skip to Main Content

Depositing Data in Repositories: cBioPortal

How to submit to cBioPortal

Unlike other repositories on this list, cBioPortal does not allow users to make submissions of their own files directly to the repository software. Instead, they offer a variety of ways to use the software's capacity for your own data, such as by hosting your own instance of cBioPortal, or to suggest your own data for public viewing. 

The boxes below clarify your options for using cBioPortal. For additional support using cBioPortal, take a look at their tutorials page, which includes how-to videos, explanatory slide decks, webinar recordings, and more. 

Download and install a local version of cBioPortal

You can install an instance of cBioPortal using the code and documentation on GitHub, which is available under a Affero GPL V3 license. The frontend and backend have separate GitHub repositories. The full installation consists of the following components: 

  • backend
    • MySQL database
    • REST API written in Java Spring
    • Redis cache for storing frequently used queries (optional)
  • validator checks file formats before importing data into the database
  • frontend built with React, Mobx and Bootstrap
  • session service for storing user saved data such as virtual studies and groups
    • REST API written in Java Spring enabling retrieval and writing to the database
    • MongoDB database
  • cBioPortal also uses the APIs from various external services to provide more information about a variant

Installing a local version of cBioPortal requires system administration skills, such as installing and configuring Tomcat and MySQL. Consult the GitHub repositories and documentation for more assistance; however, the cBioPortal team does not have the capacity to offer technical support beyond the existing public documentation. 

Submit your public datasets for curation

You can suggest your datasets for curation by contacting cBioPortal administration. All data featured on cBioPortal is completely public and clinical data must be de-identified. 

You can view the list of datasets currently submitted for consideration at cBioPortal's DataHub. cBioPortal also actively curates datasets from the literature, curated from the data published with the manuscripts. Admin occasionally reaches out to investigators to acquire additional data, such as clinical attributes. All mutation calls (in VCF or MAF format) are processed through an internal pipeline to annotate the variant effects in a consistent way across studies.

Visualize and analysis your data with cBioPortal and 3rd party tools

cBioPortal suggests three additional web-based tools for data analysis and visualization:

  • OncoPrinter: a tool for generating oncoprints and performing mutual exclusivity analysis from your own data. Input genomic alteration, clinical, or heatmap data to generate a print. 
  • MutationMapper: a tool for interpreting mutations with protein annotations. Input tab-delimited or space-delimited mutation data and select a reference gene. Read more about data formatting guidelines and view example data on the MutationMapper site.  
  • AVM for cBioPortal (maintained by Aqumin Medical, available for Windows users): Windows software that loads data into 3D Landscapes for interactive visualization and pathway analysis. Use your own data, or test it out with cBioPortal's current datasets. 

Commercial support

The Hyve is an open source software company that provides commercial support for cBioPortal. They can help with deployment, data loading, development, consulting and training. You can learn more about their services on their website, as well as explore the other softwares they support.