Presenter Information

Jinmiao CHEN, Duke-NUS and A*STAR

Loading...

Media is loading
 

Event Website

https://libguides.ntu.edu.sg/SGORconference2024

Description

Since its inception in 2009, single-cell RNA-seq techniques have evolved, increasing throughput and reducing costs. With a growing number of published studies, efficient data retrieval is crucial. DISCO was created as a comprehensive database of single-cell RNA-seq data, enabling exploration of cell types and gene expressions in various tissues. Now, DISCO hosts over 100 million single-cell profiles from 16,734 samples, reflecting a fivefold increase since its first version. We have curated metadata, categorized samples, and refined cell type annotations using a harmonized reference. DISCO platform provides online tools for data integration, cell type annotation, projecting query dataset to atlases, and gene set enrichment analysis. The DISCO R toolkit supports offline analyses. Our data also aids in training foundation AI models like scGPT and scFoundation, enhancing hypothesis generation and data mining. DISCO’s continued updates and extensive dataset underscore its role as a key resource in the field.

Document Type

Presentation

Publication Date

2024

Share

COinS
 
Nov 12th, 2:40 PM Nov 12th, 3:00 PM

Rediscovering Publicly Available Single-cell Data with the DISCO Platform

Since its inception in 2009, single-cell RNA-seq techniques have evolved, increasing throughput and reducing costs. With a growing number of published studies, efficient data retrieval is crucial. DISCO was created as a comprehensive database of single-cell RNA-seq data, enabling exploration of cell types and gene expressions in various tissues. Now, DISCO hosts over 100 million single-cell profiles from 16,734 samples, reflecting a fivefold increase since its first version. We have curated metadata, categorized samples, and refined cell type annotations using a harmonized reference. DISCO platform provides online tools for data integration, cell type annotation, projecting query dataset to atlases, and gene set enrichment analysis. The DISCO R toolkit supports offline analyses. Our data also aids in training foundation AI models like scGPT and scFoundation, enhancing hypothesis generation and data mining. DISCO’s continued updates and extensive dataset underscore its role as a key resource in the field.

https://ink.library.smu.edu.sg/sgor2024/programme/schedule/13

 

To view the content in your browser, please download Adobe Reader or, alternately,
you may Download the file to your hard drive.

NOTE: The latest versions of Adobe Reader do not support viewing PDF files within Firefox on Mac OS and if you are using a modern (Intel) Mac, there is no official plugin for viewing PDF files within the browser window.