Cost of short read RNA sequencing

4 minute read

Tags: , , ,

Published:

Why do you do this experiment?

Sequencing RNA enables the identification and quantification of RNA expressed in a cell or a sample (the transcriptome).

Input 100k-1M Live cells, FFPE, frozen cells or 25-250ng RNA

Output Fastq file (20-100M PE reads) -> Gene expression

Strategic Value

By comparing multiple samples, we know the effect of perturbations (drug, disease, knock-out, etc) on the transcriptome of the cell. This can be used to understand gene regulation, how a drug works, or which processes a disease affects.

RNAseq provides the sequence of all expressed genes, meaning variants (e.g. SNPs, gene fusions) can be called but coverage will be biased towards highly expressed genes. In the context of cancer and with deep enough RNAseq, sub-clonal exonic mutations can be detected for most genes.

Cost & Scale

  • Variable per run: \$150/sample. Range: \$118 - \$236
  • Cost breakdown:
    • RNA extraction: \$56
    • Short read library preparation: \$50
    • Sequencing (20-100M reads, 4-30Gb): \$12-\$120
  • Capex: Thermocycler (\$10-20k), TapeStation (\$6-30k), NGS Sequencer (\$50k-1M)

Experimental Modules

  1. RNA extraction (2h30, 40’ hands-on)
  2. Sequencing library preparation (6h, 2h hands-on)
  3. Sequencing run (4-24h depending on the sequencer)

Ops & Throughput

Turnaround: 3+ days (day 1 extraction, day 2 library prep, day 3 or later sequencing)

Hands-on time: 4h

Parallelizability: High. All steps can be done in parallel for as many samples as needed.

Bottlenecks: availability of Tapestation (16 lanes) and thermocycler (96 wells).

Batching: 1 to 16 samples per technician.

Automation readiness: Full, with commercial solutions available.

Outsourceability: Yes.

Data scale: 20-100M reads/sample, ~30Gb/sample

Data API

Raw format: FASTQ Processed format: count matrix Resolution: gene-level expression, single nucleotide variant

Analysis Ecosystem

  1. QC and cleaning
    • fastqc: Quality control of the run
    • cutadapt: Trimming of sequencing adapters from the reads
  2. Alignement:
  3. Gene expression quantification:
  4. Differential expression

Public datasets

Pitfalls & Failure Modes

  • Don’t skip the ribo-depletion or polyA enrichment step, they represent most of the extration cost but are there for a reason. >90%[^1] of RNA in a cell are rRNA or tRNA. Sequencing total RNA from a cell without size selection with short read sequencing would yield around 70% of rRNA reads and 15% of tRNA reads which are not very interesting populations (unless you look at base modifications, which is not done in short read). With the cheap cost of sequencing nowdays you should systematically go for ribo-depletion over polyA. Batch correction can integrate your ribo-depleted data with a polyA cohorts without problems.
  • Most protocols for RNAseq are optimized for the extraction of RNA longer than 20bp and will size select the sequencing library to 300-500bp. This will exclude small RNA populations (tRNA, miRNA, snoRNA, etc). If you are interested in those populations use dedicated kit (e.g Qiagen miRNAeasy) and remove the size selection steps.

[^1] https://www.frontiersin.org/journals/genetics/articles/10.3389/fgene.2015.00002/full

Order list

Plenty of suppliers exist for this kind of protocol and you can mostly mix an match suppliers to your liking for each step. I used NEB as a convenient example as their documentation is quite clear.

ItemCostNumber of experimentsLink
 Monarch® Total RNA Miniprep Kit30050https://www.neb.com/en/products/t2010-monarch-total-rna-miniprep-kit?srsltid=AfmBOopSZmPKF4Cfc-PLtnsJVH3Cw5xaUBpW1I56u-Zhhk1bdz_qEuKi
NEBNext® rRNA Depletion Kit117024https://www.neb.com/en/products/e7400-nebnext-rrna-depletion-kit-v2-human-mouse-rat
NEBNext Ultra II Directional RNA Library Prep Kit Illumina110024https://www.neb.com/en/products/e7760-nebnext-ultra-ii-directional-rna-library-prep-kit-for-illumina?srsltid=AfmBOooPomu_ib-QTTzKump5qvf8Tz8iLRobH3FuSFLhvdkatczjhqMW
NEBNext® Multiplex Oligos for Illumina®12024https://www.neb.com/en/products/e7335-nebnext-multiplex-oligos-for-illumina-index-primers-set-1
    
Total per xp\$2001 

Protocol variations

  • RNA extraction should yield 10-30pg of RNA/cell
  • Ultra-low-input protocols based on direct reverse transcription enable RNAseq from as low as 10 cells input (e.g from Thermo-Fischer).

This post is part of a series on the cost of experiments. All costs are orders of magnitude and are susceptible to have changed between the post and your order date. All costs assume you perform the whole pipeline in house and do not include labor costs. For outsourcing a decent first estimate is to double the indicated costs. Cheap consumables are not always included if they affect less than 1% of the cost. Always check the protocols coming with the kits for the complete list of consumables to order.