Innovative science

Technology

Big data platform

Introduction

Features

  • Enormous data accumulated over 10 years’ national as well as international collaborative researches
  • Domestically the largest scaled database includes clinical and experimental data related to 7 cancer types(liver cancer, breast cancer, gastric cancer, and colorectal cancer etc.)

CBS retained Big-data

  • Specimen & clinical info.:

    144,411

    (7 cancer types)

  • Experimental data:

    3,300,275

    (7 cancer types)

  • Public data:

    158,098,078

    (7 cancer types)

  • Standard transcriptomics gene expression atlas:

    4 cancers

    HCC (2), HNSCC (1), BC (1)
    Expanding to other cancers

Retained genomic data by cancer types

Type of cancer Assay platform No. of Target gene Retained data
HCC RT-PCR 1,200 2,381,904
nCounter 770 138,600
MS 2,085 2,085
MS/MS 1,429 1,429
RC nCounter 770 120,120
GC NGS (DNA seq.) On process On process
FACS / Bioplex On process On process
SCLC NGS (DNA seq.) 62 744
HNSCC NGS (DNA seq.) 244 46,360
nCounter 99 9,603
PSC NGS (DNA seq.) On process On process
TNBC NGS (DNA seq.) 27,685 599,430
34,344 3,300,275

Uniformized database for public data

Database Types of data No. of Data
DNA 504 cell line related DNA sequence data 1,159,663
504cell line related 24 drug response data 11,670
RNA 60 liver cancer specimen DNA sequence data 60,000
59 breat cancer specimen DNA sequence data 30,000
Protein Amino acid sequence and function data 550,000
Protein matching data (protein name) 130,000,000
PPI Info. rotein protein information 2,500,0000
Signal pathway Info. 320 signal transduction pathway data 1,145,020
Drug Info. Drug data 9,588
Drug details (Target gene, regulatory approval etc.) 132,137
158,098,078

Standard cancer transcriptomics gene expression atlas by cancer types