Supercharge your Data Discovery with Kids First Data Resource

Supercharge your data discovery.

Illustration of data processed in a cloud platform and a battery charging

Icon of a hand holding a heavy dumbbell weight

Stronger

Combine Kids First data with your own
Consistent clinical terms across conditions
Harmonized genomic variants available immediately

Faster

Build cohorts and explore variants rapidly
Cloud-based analysis in CAVATICA
Investigate outcomes in PedcBioPortal

Greater

Uncover genetic predisposition for conditions in familial trios
Identify tumor oncogenes for drug development
Validate discoveries in model organisms to human health

Kids First

About the Data

Kids First studies on the portal are ready for analysis, having been harmonized and curated by the Kids First Data Resource Center team. These experts apply deep experience with pediatric data and are considerate of community feedback. Kids First data is functionally equivalent to other extensive genomic efforts such as GTeX and NCI Genomic Data Commons.

Variants of interest and genomic data are paired with clinical data for quick analysis startup.

Robust data includes SNVs, CNVs, and SVs annotated with Cancer Hotspots. Trio-structured germline data allows for de novo discoveries within families. These are produced by publicly available bioinformatic pipelines that are tested, documented, and accessible. Conditions are harmonized to be easily searched, regardless of the data language you speak. The Human Phenotype Ontology (HPO) for phenotypes and MONDO Ontology for diagnoses enable cohort discovery on the Kids First Portal.

Data Features

Genomic

Germline variants for each participant in gVCF format for rapid joint genotyping

Somatic variant pipeline calling SNVs, CNVs, and SVs for cancer studies

Trio-based joint-called variants to identify de novo mutations in congenital disorder studies

Clinical

Phenotypes mapped to the Human Phenotype Ontology (HPO)

Diagnoses mapped to the MONDO Disease Ontology (MONDO)

Search and build a cohort on the Kids First Data Resource Portal

Data Modalities

Whole Genome Sequencing

RNA Sequencing

Whole Exome Sequencing

Linked-Read Whole Genome Sequencing

Long Reads Sequencing

Access the Studies

The Kids First Data Resource Portal is a collection of studies from various investigators who perform disease-specific research. Originally part of separate research studies, the goal of collecting and sharing these studies to enable other investigators to combine and create new studies and research based on the data already collected.

Explore the many disease areas represented in datasets currently available through the portal. Additional conditions are anticipated to be added through future Kids First opportunities. Explore the many disease areas represented in studies currently available through the portal. Additional conditions are anticipated to be added through future Kids First opportunities.

Kids First Data

VIEW STUDIES

Your Questions Answered

What can you find on this website?

You can find out about the Gabriella Miller Kids First Data Resource Center, and sign up to access data, tools, and resources offered through Kids First.

Access pediatric cancer and congenital disorder data here.

How is Kids First Data Collected?

Researchers contribute tens of thousands of patient DNA samples collected from blood, tissue, and saliva to be sequenced and integrated with patient clinical data in the Kids First DRC.

In addition, patient families can partner with researchers by participating in studies seeking cures for childhood cancer and congenital disorders.

How to attribute Kids First Data in Publications

In addition to listing the PHS Accession Number(s) of the datasets used for a particular analysis and the databases from which they are accessible to the research community, X01 investigator teams (i.e., “Contributing Investigator(s)”) are asked to describe support for the project, including NIH grant numbers.

Secondary users, or “end users,” must acknowledge all datasets used in a publication or analysis by listing all relevant dbGaP PHS Accession Numbers and the URLs of the databases where the datasets were accessed. The Data Use Certification (DUC) agreed to by secondary users outlines how to use and acknowledge each approved dataset.

Is there a sample statement for the acknowledgment?

Yes! See below.

The results analyzed and here are based in whole or in part upon data generated by Gabriella Miller Kids First Pediatric Research Program (Kids First) projects and are accessible through from the Kids First Data Resource Portal (kidsfirstdrc.org) and/or dbGaP (www.ncbi.nlm.nih.gov/gap). Kids First was supported by the Common Fund of the Office of the Director of the National Institutes of Health (www.commonfund.nih.gov/KidsFirst). The was awarded a U24 () to sequence [childhood cancer and/or structural birth defect cohort samples] submitted by investigators through the Kids First program (). Additional funds supported assembling the cohorts, collecting the phenotypic data and samples, and/or data analysis.

Contributing investigators include: *.

*If there are many collaborators/consortium members, you can use a ‘corporate authorship’ with a link to a website that lists everyone.

Kids First requires that researchers share genomic data generated by NIH funds. Learn more about the transformative Genomic Data Sharing Policy here.

VIEW FAQs

The Kids First Data Resource Center (“DRC”) comprises partnered institutions supported by the NIH Common Fund under Award Number U2CHL138346 as part of the Common Fund’s Gabriella Miller Kids First Pediatric Research Program (“Kids First”). All content, terms and conditions and policies associated with the DRC Portal and Website (the “Services”) are produced by the DRC. The views and opinions of authors expressed on the Services do not necessarily state or reflect those of the National Institutes of Health (“NIH”) or the U.S. government. Furthermore, the NIH does not endorse or promote any DRC entity or any of its products or services nor guarantees the products, services, or information provided by the DRC.

Kids First: Congenital Diaphragmatic Hernia
Kids First: Congenital Heart Defects
Kids First: Ewing Sarcoma - Genetic Risk
Kids First: Orofacial Cleft - European Ancestry
Kids First: Syndromic Cranial Dysinnervation
Kids First: Adolescent Idiopathic Scoliosis
Kids First: Disorders of Sex Development
Kids First: Orofacial Cleft - Latin American
Kids First: Neuroblastoma
Kids First: Enchondromatoses
Kids First: Familial Leukemia
Kids First: Orofacial Cleft - African and Asian Ancestry
Kids First: Novel Cancer Susceptibility in Families (from BASIC3)
Kids First: Osteosarcoma
Kids First: Craniofacial Microsomia
Kids First: Kidney and Urinary Tract Defects
Kids First: Microtia - Hispanic
Kids First: Intersections of Cancer & SBD
Kids First: Esophageal Atresia and Tracheoesophageal Fistulas
Kid First: Hemangiomas (PHACE)
Kids First: Nonsyndromic Craniosynostosis
Kids First: Myeloid Malignancies
Kids First: Leukemia & Heart Defects in Down Syndrome
Kids First: T-Cell ALL
Kids First: Cornelia de Lange Syndrome
Kids First: Bladder extrophy, Epispadias, Complex
Kids First: Laterality Birth Defects
Kids First: CHARGE Syndrome
Kids First: Orofacial Clefts - Philippines
Kids First: Fetal Alcohol Spectrum Disorders
Kids First: Intracranial Germ Cell Tumors
Kids First: Structural Defects of The Neural Tube
Kids First: Recessive Structural Brain Defects
Kids First: Chromosome 18 Structural Birth Defects
Children's Brain Tumor Network (CBTN)
Kids First: Whole genome sequencing studies of multiplex nonsyndromic cleft lip/palate families