openpipeline
comprehensive collection of extensible, best-practice analysis pipelines
for processing single-cell multi-omics data. These workflows handle various omics modalities,
including gene expression, antibody capture, VDJ, and ATAC data, from diverse experimental set-ups
such as CITE-seq, ATAC-seq, and Drop-seq. The platform provides reproducible and scalable solutions across
the entire analysis journey, from raw sequencing data processing to functional interpretation.
The workflows provide end-to-end support for:
The demultiplexing components convert raw BCL files from Illumina sequencers into FASTQ format by separating multiplexed reads based on their cell barcodes and UMIs. Both Illumina's demultiplexing solution as well as 10X Genomics wrappers are available:
The ingestion workflows alligns FASTQ files to reference genomes to generate count matrices in H5MU format. Both BD Genomics and 10X Genomics protocols are supported:
Note that a workflow for generating the transcriptome reference is also available.
After having generated count matrices in H5MU format, single-cell data require comprehensive pre-processing, which is supported by the pre-processing workflow. This workflow enables processing of multiple modalities, including Gene Expression, Antibody Capture, VDJ and ATAC. The following pre-processing steps are executed:
Multiple workflows with various computational approaches are available to remove unwanted technical variation across batches while preserving biological differences:
All of the above mentioned integration workflows also include steps to perform neighbor detection, Leiden clustering, and UMAP dimensionality reduction to facilitate exploration of the integrated data.
The platform provides multiple approaches for cell type annotation, assigning biological identities to cells based on their expression profiles:
Viash workflows are built on a modular architecture where components and workflows are fully equivalent. Each component can be executed as a stand-alone workflow, while any workflow can be seamlessly integrated as a dependency of another workflow. This design enables flexible customization and recombination to address diverse analytical needs.