Single-cell technologies such as scRNA-seq, scATAC-seq and CITE-seq allow researchers to profile gene expression, chromatin accessibility and surface proteins at single-cell resolution.
However, analysis workflows are complex, require multiple tools and are often locked into proprietary vendor platforms.
This page presents an open, reproducible alternative built with Viash components.
Computational biologists and platform leads working on single-cell or multi-modal projects.
Whether you are analyzing 10x Genomics data, Smart-seq libraries or custom protocols, the pipeline can be adapted to your needs.
The workflows provide end-to-end support for:
The demultiplexing components convert raw BCL files from Illumina sequencers into FASTQ format by separating multiplexed reads based on their cell barcodes and UMIs. Both Illumina's demultiplexing solution as well as 10X Genomics wrappers are available:
The ingestion workflows aligns FASTQ files to reference genomes to generate count matrices in H5MU format. Both BD Genomics and 10X Genomics protocols are supported:
Note that a workflow for generating the transcriptome reference is also available.
After having generated count matrices in H5MU format, single-cell data require comprehensive pre-processing, which is supported by the pre-processing workflow. This workflow enables processing of multiple modalities, including Gene Expression, Antibody Capture, VDJ and ATAC. The following pre-processing steps are executed:
Multiple workflows with various computational approaches are available to remove unwanted technical variation across batches while preserving biological differences:
All of the above mentioned integration workflows also include steps to perform neighbor detection, Leiden clustering, and UMAP dimensionality reduction to facilitate exploration of the integrated data.
The platform provides multiple approaches for cell type annotation, assigning biological identities to cells based on their expression profiles: