scientificprotocols authored about 8 years ago
Authors: Wei Shi
We describe a powerful and easy-to-use RNA-seq analysis pipeline that can be used for complete analysis of RNA-seq data. It starts with raw read output of an sequencing instrument and reports lists of genes that are found to be differentially expressed in the comparison of different cell types. It consists of several analysis modules including Subread read alignment [1], featureCounts read summarization [2], voom normalization [3] and statistical testing of differential expression using empirical Bayes moderated t-statistic [4]. The entire pipeline mainly makes use of two R packages, Rsubread and limma, both available from the popular Bioconductor project.
The Rsubread package can be downloaded from http://bioconductor.org/packages/release/bioc/html/Rsubread.html. The limma package can be downloaded from http://bioconductor.org/packages/release/bioc/html/limma.html
The whole pipeline will take less than 8 hours to complete the analysis of an RNA-seq dataset including 100 million reads in total, on a computer with >=4 CPUs and >=8GB of memory.
Lists of genes whose expression levels are found to be statistically significantly changed in different conditions (eg. different cell types or different treatments).
Transcriptional profiling of mouse B cell terminal differentiation defines a signature for antibody-secreting plasma cells, Wei Shi, Yang Liao, Simon N Willis, Nadine Taubenheim, Michael Inouye, David M Tarlinton, Gordon K Smyth, Philip D Hodgkin, Stephen L Nutt, and Lynn M Corcoran, Nature Immunology doi:10.1038/ni.3154
Wei Shi, Gordon Smyth's Lab, Walter and Eliza Hall Institute, Melbourne
Correspondence to: Wei Shi ([email protected])
Source: Protocol Exchange (2015) doi:10.1038/protex.2015.039. Originally published online 18 May 2015.