RNA-seq
Updated by Hongjiang on 09/17/2022
Bulk RNA-seq is a common sequencing technique for transcriptome analysis.
mRNA were enriched using ploy-T tails and reverse transcripted to cDNA.
Upstream analysis
Upstream means the QC & mapping of raw sequcening data.
fastp
seqkit
STAR
Salmon
Salmon is a novel, fast, accuarte sofeware published recently. It's developed for TPM/counts calculation.
To install Salmon using conda:
Salmon uses transcriptome FASTA for analysis. You can download cdna
files directly from ensembl.org.
If you have access to the genome FASTA and GTF used for alignment consider generating a transcriptome FASTA using a command like:
you can find the gffread utility at (http://ccb.jhu.edu/software/stringtie/gff.shtml), which could be installed by conda install gffread
.
Rsem
Downstream analysis
edgeR
ClusterProfiler
Install
BiocManager::install("clusterProfiler")
BiocManager::install("pathview")
BiocManager::install("org.Hs.eg.db") # Homo sapiens
org.xx.eg.db
You can search org.xx.eg.db on Bioconductor, below are some frequently used ones.
BiocManager::install("org.Hs.eg.db") # Homo sapiens
BiocManager::install("org.Mm.eg.db") # Mus musculus
BiocManager::install("org.Rn.eg.db") # Rattus norvegicus