flashseq_demo_human

Run Summary

Summary	Value
Run ID	flashseq_demo_human
Genome Assembly	homo_sapiens_GRCh38.104
# Plates Detected	1
# Groups	1
# Total Wells Processed	384
# Excluded Wells (empty FASTQs)	155
# Cells (Before QC)	229
# Cells (After QC)	191
% Cells Filtered Out	16.59 %
Seurat Version	5.1.0
R Version	R version 4.2.3 (2023-03-15)
Analysis Date	2025-09-18

Introduction

This file contains an overview of the results from flashseq_demo_human.

Whenever possible we compare the reference group to the other groups using a two-sided Wilcoxon rank sum test. Only P-values inferior to 0.05 are displayed and encoded as follows:

p-value < 0.05: “*”
p-value < 0.01: “**”
p-value < 0.001: “***”
p-value < 0.0001: “****”

Raw Reads

Number of raw reads per cell.

A uniform distribution of the reads between cells is expected when following the standard FLASH-seq protocol.

Raw Reads - Plate View

Drop outs originating from FACS sorting issues are often characterized by a low number of associated reads.

Mapping Stats

From STAR mapping statistics.

Uniquely Mapped reads: Reads associated to a single position in the reference genome. Multimapped reads: Reads mapping at multiple positions onto the reference. Unmapped reads: Reads that do not map to the reference genome or present a too high number of multimapped positions.

The percentage of uniquely mapped reads can vary between cell types. Good quality cells typically display values >70%.

Sequencing Depth Saturation

Read Distribution

The read distribution between 3’/5’UTR, exonic coding sequence, intronic and intergenic regions are estimated using ReSQC. Read tag percentages are displayed.

The majority of the reads will be distributed between CDS exon and intronic categories.

Mitochondrial Genes

Gene-body Coverage

Distribution of the reads along the gene-body. As a full length protocol, FLASH-seq produces uniform distribution from 5’ to 3’.

Estimated using ReSQC.

Population Overview

First glance at the single-cell populations. This analysis is provided for reference only. Due to its automated nature, we highly recommend fine-tuning the parameters before annotating your cell populations.

Filtering

Cells are considered as outliers and filtered out of the population analysis if they don’t meet the following criteria::

Mitochondria reads: 0.1%-10%
Number of detected genes: >1000 and <(median + 2x median absolte deviation)
Number of associated reads (exonic): median +/- 3x median absolte deviation

Clustering

Cell populations are identified following the standard Seurat (V5) pipeline. Briefly, the data are log-normalized and then rescaled, regressing out the read counts. The top 2000 most variable features are extracted using ‘vst’ function and used to run a principal component analysis. The number of dimensions is chosen as follows:

Number of principal component (PC) that explain >80% of the variability
Number of principal component where the difference between two consecutive PC is <0.25%

Both metrics are compared and the smallest number of PC is selected. These dimensions are used to cluster cells into groups (shared nearest neighbor) and to display the results (UMAP).

Feature Distribution

Markers

Wilcoxon-test - LogFC > |1| - minimal population expression > 0.5 - Bonferroni correction for multiple testing.

Only genes with adjusted p-values <0.05 are displayed.

Failure Rate

Percentage of low quality cells after sequencing. Defined as:

less than 100K raw reads
more than 25% unmapped reads
less than 75K uniquely mapped reads