Libre Biotech

ONT Long-Read Transcriptomics (IsoQuant)

C57BL/6 combined (bc19+bc20+bc21) — GENCODE M35, complete_genedb

Type
CWL
Status
succeeded
Engine
cwltool
Duration
0.6 h
Source Data
Study Strain-specific cortex gene expression and isoform usage
Sample prep PCR-cDNA barcoding library preparation for C57/DBA mouse RNA (SQK-PCB111.24) 2023-03-13
Sample prep AMPure XP 1.5 kb size selection of C57/DBA PCR-cDNA libraries 2023-05-01
Sample prep PCR-cDNA barcoding library preparation for C57/DBA mouse RNA (run 3, bc10-15) 2023-05-09
Sequencing Nanopore PCR-cDNA sequencing of C57/DBA mouse RNA (run 1) 2023-04-21
Sequencing Nanopore PCR-cDNA sequencing of C57/DBA mouse RNA 1.5kb (run 2) 2023-05-04
Sequencing Nanopore PCR-cDNA sequencing of C57/DBA mouse RNA (run 3) 2023-05-12
Run Data Run #61 (6 samples)
Run Data Run #62 (6 samples)
Run Data Run #63 (6 samples)
Samples C57_rep1_bc10 C57_rep1_bc19 C57_rep1_bc19_1.5kb C57_rep2_bc11 C57_rep2_bc20 C57_rep2_bc20_1.5kb C57_rep3_bc12 C57_rep3_bc21 C57_rep3_bc21_1.5kb DBA_rep1_bc13 DBA_rep1_bc22 DBA_rep1_bc22_1.5kb DBA_rep2_bc14 DBA_rep2_bc23 DBA_rep2_bc23_1.5kb DBA_rep3_bc15 DBA_rep3_bc24 DBA_rep3_bc24_1.5kb

Workflow

ONT Transcriptomics (IsoQuant + TransDecoder)

#cwl

Software Tools

ToolVersionURL
cwltool - https://github.com/common-workflow-language/cwltool
isoquant_3.6.0--hdfd78af_0.sif - -

Results Summary

Total Reads
1,726,210
Expressed Genes (CPM ≥ 1)
13,315
Novel Transcripts (CPM ≥ 1)
1,709
PolyA Detected
85.7%

Expressed Transcript Sources (CPM ≥ 1, unique reads)

Novel Transcript Categories (CPM ≥ 1, unique reads)

Top Expressed Genes (CPM, unique reads)

Gene Expression Distribution (CPM, unique reads)

Read Assignment Quality

Read Structural Classification

Chromosomal Distribution (unique reads)

Transcript Length Distribution (expressed transcripts, CPM ≥ 1)

14,134 transcripts | Median: 1,828 nt | Mean: 2,248 nt | N50: 2,810 nt

Exons per Transcript (expressed transcripts, CPM ≥ 1)

Isoforms per Gene (expressed transcripts, CPM ≥ 1)

TransDecoder ORF Types (expressed transcripts)

Peptide Length Distribution (expressed transcripts)

14,709 ORFs | Median: 278 aa | Mean: 362 aa | Max: 6,299 aa

Output Files

OUT.extended_annotation.gtf HPC 314.9 MB OUT.gene_counts.tsv HPC 1.4 MB OUT.read_assignments.tsv.gz HPC 46.4 MB OUT.transcript_counts.tsv HPC 3.7 MB OUT.transcript_model_counts.tsv HPC 389.1 KB OUT.transcript_models.gtf HPC 45.4 MB job.yml HPC 304 B results_summary.json HPC 6.1 KB transcripts.fa.transdecoder.cds HPC 18.4 MB transcripts.fa.transdecoder.gff3 HPC 11.7 MB transcripts.fa.transdecoder.pep HPC 7.6 MB

Input Data

FileFormatDescription
OUT.transcript_models.gtf text/plain -
OUT.extended_annotation.gtf text/plain -
OUT.read_assignments.tsv.gz application/octet-stream -
OUT.transcript_counts.tsv text/tab-separated-values -
OUT.gene_counts.tsv text/tab-separated-values -

Provenance

Execution Expression quantification summary
Completed 2026-03-01T07:41:56+00:00
RO-Crate 1.1 Workflow RO-Crate 1.0 FAIR
This analysis is packaged as a Research Object Crate with machine-readable provenance and FAIR metadata.

RO-Crate Metadata (JSON-LD)

Show/hide raw JSON-LD
{
    "@context": "https://w3id.org/ro/crate/1.1/context",
    "@graph": [
        {
            "@id": "ro-crate-metadata.json",
            "@type": "CreativeWork",
            "about": {
                "@id": "./"
            },
            "conformsTo": [
                {
                    "@id": "https://w3id.org/ro/crate/1.1"
                },
                {
                    "@id": "https://w3id.org/workflowhub/workflow-ro-crate/1.0"
                }
            ]
        },
        {
            "@id": "./",
            "@type": "Dataset",
            "name": "ONT Transcriptomics (IsoQuant + TransDecoder) \u2014 Run #54",
            "description": "Long-read transcriptomics: IsoQuant for transcript discovery and quantification, gffread for FASTA extraction, TransDecoder for ORF prediction.",
            "datePublished": "2026-03-01",
            "license": {
                "@id": "https://creativecommons.org/licenses/by/4.0/"
            },
            "mainEntity": {
                "@id": "ont_transcriptomics.cwl"
            },
            "hasPart": [
                {
                    "@id": "ont_transcriptomics.cwl"
                },
                {
                    "@id": "job.yml"
                },
                {
                    "@id": "OUT.transcript_models.gtf"
                },
                {
                    "@id": "transcripts.fa.transdecoder.gff3"
                },
                {
                    "@id": "transcripts.fa.transdecoder.pep"
                },
                {
                    "@id": "OUT.transcript_model_counts.tsv"
                },
                {
                    "@id": "OUT.extended_annotation.gtf"
                },
                {
                    "@id": "OUT.read_assignments.tsv.gz"
                },
                {
                    "@id": "OUT.transcript_counts.tsv"
                },
                {
                    "@id": "transcripts.fa.transdecoder.cds"
                },
                {
                    "@id": "OUT.gene_counts.tsv"
                },
                {
                    "@id": "results_summary.json"
                },
                {
                    "@id": "summary_extractor.py"
                }
            ],
            "mentions": [
                {
                    "@id": "#execution"
                },
                {
                    "@id": "#summary-extraction"
                }
            ]
        },
        {
            "@id": "ont_transcriptomics.cwl",
            "@type": [
                "File",
                "SoftwareSourceCode",
                "ComputationalWorkflow"
            ],
            "name": "ONT Transcriptomics (IsoQuant + TransDecoder)",
            "description": "#cwl",
            "programmingLanguage": {
                "@id": "Long-read transcriptomics: IsoQuant for transcript discovery and quantification, gffread for FASTA extraction, TransDecoder for ORF prediction."
            },
            "contentSize": "1.7 KB",
            "sha256": "c3b82f1cff216a3f2d95fa91eef186e33edc1a646b2f06a254984b1c1bd29b96"
        },
        {
            "@id": "#cwl",
            "@type": "ComputerLanguage",
            "name": "Common Workflow Language",
            "url": {
                "@id": "https://www.commonwl.org/"
            },
            "version": "1.2"
        },
        {
            "@id": "#cwltool",
            "@type": "SoftwareApplication",
            "name": "cwltool",
            "url": {
                "@id": "https://github.com/common-workflow-language/cwltool"
            }
        },
        {
            "@id": "#singularity-container",
            "@type": "SoftwareApplication",
            "name": "isoquant_3.6.0--hdfd78af_0.sif"
        },
        {
            "@id": "job.yml",
            "@type": "File",
            "name": "job.yml",
            "description": "CWL job input parameters",
            "encodingFormat": "text/yaml",
            "contentSize": "304 B",
            "sha256": "0433474ed0ae3a47d41b71211be5e402ea8cfe19d025cb26046b765a7be004a7"
        },
        {
            "@id": "OUT.transcript_models.gtf",
            "@type": "File",
            "name": "OUT.transcript_models.gtf",
            "encodingFormat": "text/plain",
            "contentSize": "45.4 MB",
            "sha256": "8c250a851e31c3eee0829a22b5573501d147a4a2bc9803ab12cadb66c0511d80"
        },
        {
            "@id": "transcripts.fa.transdecoder.gff3",
            "@type": "File",
            "name": "transcripts.fa.transdecoder.gff3",
            "encodingFormat": "text/plain",
            "contentSize": "11.7 MB",
            "sha256": "9f3cb3a1714d2ccbdd5d4819644da0d64cc31ad41ea0586cde764928eb3bc344"
        },
        {
            "@id": "transcripts.fa.transdecoder.pep",
            "@type": "File",
            "name": "transcripts.fa.transdecoder.pep",
            "encodingFormat": "application/octet-stream",
            "contentSize": "7.6 MB",
            "sha256": "4d57a21e6b10c41bfeb7f08389eb9e0ee1b11ac24f059315acade251bbc1e6b2"
        },
        {
            "@id": "OUT.transcript_model_counts.tsv",
            "@type": "File",
            "name": "OUT.transcript_model_counts.tsv",
            "encodingFormat": "text/tab-separated-values",
            "contentSize": "389.1 KB",
            "sha256": "4307a8fd48d3a6f7dd0196f23cdfc7dc2ed4b8edfcebbbcf829e7c0a2df987cf"
        },
        {
            "@id": "OUT.extended_annotation.gtf",
            "@type": "File",
            "name": "OUT.extended_annotation.gtf",
            "encodingFormat": "text/plain",
            "contentSize": "314.9 MB",
            "sha256": "42d5935c6f2d2b1f666182e57eb0e447e9d7ec7a754be560e4144ed999616553"
        },
        {
            "@id": "OUT.read_assignments.tsv.gz",
            "@type": "File",
            "name": "OUT.read_assignments.tsv.gz",
            "encodingFormat": "application/octet-stream",
            "contentSize": "46.4 MB",
            "sha256": "ef030e2573518f0272fab1168edd9cfc33f115094a13e4c7bbf4cb89bf42562f"
        },
        {
            "@id": "OUT.transcript_counts.tsv",
            "@type": "File",
            "name": "OUT.transcript_counts.tsv",
            "encodingFormat": "text/tab-separated-values",
            "contentSize": "3.7 MB",
            "sha256": "deefac2df78464eb3c56009d55b118b42c1dd018eb16bb491e73e8e0cfe5d79f"
        },
        {
            "@id": "transcripts.fa.transdecoder.cds",
            "@type": "File",
            "name": "transcripts.fa.transdecoder.cds",
            "encodingFormat": "application/octet-stream",
            "contentSize": "18.4 MB",
            "sha256": "4822baa7b338d512544410a2b8e4415d3afafa78d529306010d2b8370663850d"
        },
        {
            "@id": "OUT.gene_counts.tsv",
            "@type": "File",
            "name": "OUT.gene_counts.tsv",
            "encodingFormat": "text/tab-separated-values",
            "contentSize": "1.4 MB",
            "sha256": "a5842812bc296d341d2db6f2357afba571ee63a5b9b3e0c05a8d99c4575ee516"
        },
        {
            "@id": "#execution",
            "@type": "CreateAction",
            "name": "ONT Transcriptomics (IsoQuant + TransDecoder) execution",
            "instrument": {
                "@id": "ont_transcriptomics.cwl"
            },
            "startTime": "2026-03-01T17:08:13+00:00",
            "endTime": "2026-03-01T07:41:33+00:00",
            "object": [
                {
                    "@id": "job.yml"
                }
            ],
            "result": [
                {
                    "@id": "OUT.transcript_models.gtf"
                },
                {
                    "@id": "transcripts.fa.transdecoder.gff3"
                },
                {
                    "@id": "transcripts.fa.transdecoder.pep"
                },
                {
                    "@id": "OUT.transcript_model_counts.tsv"
                },
                {
                    "@id": "OUT.extended_annotation.gtf"
                },
                {
                    "@id": "OUT.read_assignments.tsv.gz"
                },
                {
                    "@id": "OUT.transcript_counts.tsv"
                },
                {
                    "@id": "transcripts.fa.transdecoder.cds"
                },
                {
                    "@id": "OUT.gene_counts.tsv"
                }
            ]
        },
        {
            "@id": "results_summary.json",
            "@type": "File",
            "name": "results_summary.json",
            "description": "Derived summary statistics from pipeline outputs (CPM >= 1, uniquely mapped reads)",
            "encodingFormat": "application/json",
            "contentSize": "6.1 KB",
            "sha256": "3226180b53bb8c2515641d5f796ddfd155f49d6bbbb41e6866148dbc8c795a35"
        },
        {
            "@id": "summary_extractor.py",
            "@type": [
                "File",
                "SoftwareSourceCode"
            ],
            "name": "Summary extraction script",
            "description": "Python script that computed results_summary.json from pipeline outputs",
            "programmingLanguage": {
                "@id": "#python3"
            }
        },
        {
            "@id": "#python3",
            "@type": "ComputerLanguage",
            "name": "Python",
            "url": {
                "@id": "https://www.python.org/"
            },
            "version": "3"
        },
        {
            "@id": "#summary-extraction",
            "@type": "CreateAction",
            "name": "Expression quantification summary",
            "instrument": {
                "@id": "summary_extractor.py"
            },
            "endTime": "2026-03-01T07:41:56+00:00",
            "object": [
                {
                    "@id": "OUT.read_assignments.tsv.gz"
                },
                {
                    "@id": "OUT.gene_counts.tsv"
                },
                {
                    "@id": "OUT.transcript_counts.tsv"
                },
                {
                    "@id": "OUT.extended_annotation.gtf"
                },
                {
                    "@id": "OUT.transcript_models.gtf"
                }
            ],
            "result": [
                {
                    "@id": "results_summary.json"
                }
            ]
        }
    ]
}