Cost of single cell RNA sequencing

8 minute read

Tags: , , ,

Published:

Why do you do this experiment?

Long-read RNA sequencing enables the identification and quantification of RNA expressed in a cell or a sample (the transcriptome) at the isoform resolution.

Input 100-10M cells

Output Fastq file (100M-25B PE reads) -> Single cell gene expression

Strategic Value

  • Characterise cell type and cell state in a complex sample (developing embryo, tumor sample or simply a healthy tissue).
  • Massive parallel perturbation with CROPseq, PerturbSeq or pooled cell culture.

Cost & Scale

  • Variable per run: \$2700 for 20k cells, \$190 (96 cells) - \$36,500 (1M cells)
  • Cost breakdown:
    • Cell barcoding of RNA: \$90 (96 cells) - \$10k (1M cells)
    • Sequencing: \$100 (10M, 1Gb) - \$16,500 (25B reads, 7.5Tb)
  • Capex: Magnetic Stand 96 (\$800), Thermocycler (\$10-20k), TapeStation (\$6-30k), Chromium Controller (\$20k, needed for 10x Genomics only), Illumina NovaseqX or MGI T20 (\$1M)

Experimental Modules

  1. Cell barcoding of RNA (8h, 4h hands-on)
  2. Sequencing library preparation (2h15, 30’ hands-on)
  3. Sequencing run (48h, 30’ hands-on)

Ops & Throughput

Turnaround: 4+ days (day 1 single cell RNA barcoding, day 2 library prep, day 3 or later sequencing >40h)

Hands-on time: 5h

Parallelizability: High. All steps can be done in parallel for as many samples as needed.

Bottlenecks: availability of sequencer (2-4 flowcells per sequencer fully occupied).

Batching: 1 preparation per technician, number of samples up to 96 depending on the protocol multiplexing possibilities.

Automation readiness: Partial. Custom solution via automation specialists for Parse Bioscience and Scale Bioscience. Partially released Chromium Connect by 10x Genomics. Worth mentionning is the Cellen One X1 Neo which can easily be adapted for SmartSeq2 automation.

Outsourceability: Yes, most CROs offer it.

Data scale: 100M-25B reads/sample, 1Gb-7.5Tb/sample

Data API

Raw format: FASTQ

Processed format: sparse count matrix -> cell type (with RNA velocity if relevant)

Resolution: 3’-biased polyA gene products expression for individual cells

Analysis Ecosystem

  1. QC and cleaning
    • fastqc: Quality control of the run
    • cutadapt: Trimming of sequencing adapters from the reads
  2. Alignement and cell barcode attribution pipelines (most use STAR under the hood):
  3. (optional) RNA velocity
  4. Count processing and cell clustering
    • scanpy in python, faster and better suited for large dataset (>100k)
    • Seurat in R
  5. Cell type annotation (many tools exist, including foundation models)
  6. Differential expression

Public datasets

Pitfalls & Failure Modes

  • Ambient RNA contamination is a prime noise factor in single cell RNA-seq. Ambiant RNA is release by dead cells when they loose membrane integrity and can be barcoded with cells barcode. This problem is most present with encapsulation methods such as 10x or
  • Most protocols rely on polyA oligos to barcode the RNAs, leading to only mRNA and lncRNA being captured. Parse Bioscience Evercode takes an intermediate route with a mix of polyA and random hexamers and 10x offers capture sequence on their beads. If you are interested mainly in non-polyA transcripts at the single cell resolution, there are protocols but they are usually lower throughput.
  • polyA capture followed by fragmentation induces a 3’ bias, limiting the resolution to the gene level. SmartSeq2 notably uses tagmentation to insert barcodes, providing reads covering the full length of the transcript. A protocol variation with 10x to perform long read sequencing strongly decreases the 3’ bias for short transcripts (<10kb) but requires using long read sequencing technologies. Takara also provides a long read variation of SmartSeq.
  • Picelli2014: SmartSeq2 foundational paper
  • Rosenberg2018: Single-cell profiling of the developing mouse brain and spinal cord with split-pool barcoding (Parse Bioscience foundational paper)
  • Zheng2017: Massively parallel digital transcriptional profiling of single cells (10x genomics foundational paper)
  • Gaisser2024: High-throughput single-cell transcriptomics of bacteria using combinatorial barcoding
  • Pan2024: Single Cell Atlas: a single-cell multi-omics human cell encyclopedia
  • Heimberg2024: A cell atlas foundation model for scalable search of similar human cells
  • Peidli2024: scPerturb: harmonized single-cell perturbation data
  • Replogle2022: Mapping information-rich genotype-phenotype landscapes with genome-scale Perturb-seq
  • Bergen2020: Generalizing RNA velocity to transient cell states through dynamical modeling
  • Clarke2021: Tutorial: guidelines for annotating single-cell transcriptomic maps using automated and manual methods
  • Ianevski2022: Fully-automated and ultra-fast cell-type identification using specific marker combinations from single-cell transcriptomic data
  • Fu2024: A comparison of scRNA-seq annotation methods based on experimentally labeled immune cell subtype dataset

Order list

Single well barcoding: 1 - 384 cells (SmartSeq2 or FLASHseq)

ItemCostNumber of experimentsLink
SMART-Seq® Single Cell Kit\$440048https://www.takarabio.com/products/next-generation-sequencing/rna-seq/legacy-rna-seq-kits/smart-seq-single-cell-for-scrna-seq)
10M 2x150 reads (200k/cell) with NextSeq2000 XBS P1 or Aviti Low Output flowcell\$1001https://www.elementbiosciences.com/products/aviti/specs
Total per xp\$190196 cells
Cost per cell\$2  

Droplet-based barcoding: 100 - 100k cells (10x genomics)

ItemCostNumber of experimentsLink
GEM-X Universal 3’ Gene Expression v4, 16 samples\$2450016https://www.10xgenomics.com/store/experiment-builder?assay=ThreePrime&version=V40&step=form
Chromium GEM-X Single Cell 3’ Chip Kit v4, 4 chips\$140032https://www.10xgenomics.com/store/experiment-builder?assay=ThreePrime&version=V40&step=form
Dual Index Kit TT Set A, 96 rxn\$110096https://www.10xgenomics.com/store/experiment-builder?assay=ThreePrime&version=V40&step=form
500M 2x150 reads (25k/cell) on Aviti Medium Output\$11001https://www.elementbiosciences.com/products/aviti/specs
Total per xp\$2700120k cells
Cost per cell\$0.14  

Split-pool barcoding: 100k - 10M cells (Parse Bioscience or Scale Bioscience)

ItemCostNumber of experimentsLink
Parse Bioscience Evercode WT v3\$100001https://www.parsebiosciences.com/products/evercode-wt/
25B 2x150 reads (25k/cell) on NovaseqX 25B or MGI T20\$165001https://emea.illumina.com/products/by-type/sequencing-kits/cluster-gen-sequencing-reagents/novaseq-x-series-reagent-kits.html#tabs-80eb4f32eb-item-f8cd845d52-order
Total per xp\$2650011M cells
Cost per cell\$0.03  

Protocol variations

  • CROPseq/PerturbSeq: Perturb cells with CRISPR technologies and read which guide RNA is present in each single cell, either via a polyA-sgRNA (Datlinger2017) or a dedicated capture sequence (Dixit2016, Replogle2020)
  • Single cell RNAseq with long read sequencing. This usually simply requires a longer RT step to produce a full cDNA copy of the transcript, skipping cDNA fragmentation and using a long read technology.
  • Demultiplexing via SNPs with Souporcell enables the multiplexing or an arbitrary number of samples of different genetic origin (patient or cell line). <!–
  • Single cell methylation can be obtained with bisulfite transformation of the barcoded DNA. –>

Bonus: Main metrics for common kits

TechologyYearCells per runCost per runCost/cellMultiplexingMin Cost per SampleLink
10X Genomics GEM-X Universal 3' Gene Expression 2024 20k 1573 0.07 No 1573 https://www.10xgenomics.com/store/product-catalog
Parse Bioscience Evercode WT v3 2024 100k 10000 0.1 48 208 https://www.parsebiosciences.com/products/evercode-wt/
Parse Bioscience Evercode Mega v3 2024 1M 20000 0.02 384 52 https://www.parsebiosciences.com/products/evercode-wt-mega/
Illumina Single Cell 3' RNA Prep T10 2025 10k 625 0.06 No 625 https://emea.illumina.com/products/by-type/sequencing-kits/library-prep-kits/single-cell-rna-prep.html#tabs-2442e1bdc3-item-1ecee5b249-order
Illumina Single Cell 3' RNA Prep T100 2025 100k 3425 0.03 No 3425 https://emea.illumina.com/products/by-type/sequencing-kits/library-prep-kits/single-cell-rna-prep.html#tabs-2442e1bdc3-item-1ecee5b249-order
Scale Bioscience QuantumScale Modular 2024 160k 4800 0.03 16 300 https://scale.bio/single-cell-rna-sequencing-kit/
Scale Bioscience QuantumScale Large 2024 2M 28000 0.015 384 73 https://scale.bio/single-cell-rna-sequencing-kit/
SmartSeq2 2014 96 90 2 96 2 https://www.takarabio.com/products/next-generation-sequencing/rna-seq/legacy-rna-seq-kits/smart-seq-single-cell-for-scrna-seq
TechologyYearCells per runCost per runCost/cellMultiplexingMin Cost per SampleLink

This post is part of a series on the cost of experiments. All costs are orders of magnitude and are susceptible to have changed between the post and your order date. All costs assume you perform the whole pipeline in house and do not include labor costs. For outsourcing a decent first estimate is to double the indicated costs. Cheap consumables are not always included if they affect less than 1% of the cost. Always check the protocols coming with the kits for the complete list of consumables to order.