The rapid growth of single-cell RNA-seq studies (scRNA-seq) demands efficient data storage, processing, and analysis. Big data technology provides a framework that facilitates the comprehensive discovery of biological signals from inter-institutional scRNA-seq datasets. The strategies to solve the stochastic and heterogeneous single-cell transcriptome signal are discussed in this article. After extensively reviewing the big data applications of next-generation sequencing (NGS)-based studies available, we propose a workflow that accounts for the unique characteristics of scRNA-seq data and primary objectives of single-cell studies.
Copyright © 2016. Production and hosting by Elsevier Ltd.