Dataset
Cancer genomics data for circulating cell-free DNA
University of the Sunshine Coast
2022
DOI:
https://doi.org/10.25907/00145
Appears in UniSC Research Data Collection
Abstract
We searched the NCBI BioProject database and downloaded 1,012 experiments with original sequences from 14 projects, involving 7 major types of head and neck cancer, lung cancer, breast cancer, prostate cancer, gastric cancer, colon cancer, and liver cancer. For sequence reading, we performed preprocessing steps and variant calling, followed by a series of filtering steps to remove non-functional variants and minimize false positives, which gave us a refined list of 6981 variants.
All the raw data are download from NCBI bioproject database at https://www.ncbi.nlm.nih.gov/bioproject/
The BioProject IDs are as below:
PRJNA485408
PRJNA448888
PRJEB15399
PRJNA281253
PRJEB4979
PRJNA343124
PRJNA603789
PRJNA603782
PRJNA575243
PRJNA475218
PRJNA281419
PRJEB32931
PRJNA307236
PRJNA407354
Details
- Title
- Cancer genomics data for circulating cell-free DNA
- Authors
- Min Zhao (Data Collector) - University of the Sunshine Coast, Queensland, GeneCology Research Centre - Legacy
- Format
- 10 TB
- Publisher
- University of the Sunshine Coast
- Date published
- 2022
- DOI
- 10.25907/00145
- Identifiers
- Q3300
- Copyright note
- This is public data, we can use it but do not have the right to distribute it. Please see Description for details of where the data can be downloaded from.
- Organisation Unit
- Cancer Research Cluster; School of Science, Technology and Engineering; Centre for Bioinnovation
- Language
- English
- Record Identifier
- 99640879002621
- Output Type
- Dataset
Metrics
179 Record Views