Libre Biotech

ONT Long-Read Transcriptomics (IsoQuant)

DBA/2 combined (bc22+bc23+bc24) — GENCODE M35, complete_genedb

Type
CWL
Status
succeeded
Engine
cwltool
Duration
0.7 h
Source Data
Study Strain-specific cortex gene expression and isoform usage
Sample prep PCR-cDNA barcoding library preparation for C57/DBA mouse RNA (SQK-PCB111.24) 2023-03-13
Sample prep AMPure XP 1.5 kb size selection of C57/DBA PCR-cDNA libraries 2023-05-01
Sample prep PCR-cDNA barcoding library preparation for C57/DBA mouse RNA (run 3, bc10-15) 2023-05-09
Sequencing Nanopore PCR-cDNA sequencing of C57/DBA mouse RNA (run 1) 2023-04-21
Sequencing Nanopore PCR-cDNA sequencing of C57/DBA mouse RNA 1.5kb (run 2) 2023-05-04
Sequencing Nanopore PCR-cDNA sequencing of C57/DBA mouse RNA (run 3) 2023-05-12
Run Data Run #61 (6 samples)
Run Data Run #62 (6 samples)
Run Data Run #63 (6 samples)
Samples C57_rep1_bc10 C57_rep1_bc19 C57_rep1_bc19_1.5kb C57_rep2_bc11 C57_rep2_bc20 C57_rep2_bc20_1.5kb C57_rep3_bc12 C57_rep3_bc21 C57_rep3_bc21_1.5kb DBA_rep1_bc13 DBA_rep1_bc22 DBA_rep1_bc22_1.5kb DBA_rep2_bc14 DBA_rep2_bc23 DBA_rep2_bc23_1.5kb DBA_rep3_bc15 DBA_rep3_bc24 DBA_rep3_bc24_1.5kb

Workflow

ONT Transcriptomics (IsoQuant + TransDecoder)

#cwl

Software Tools

ToolVersionURL
cwltool - https://github.com/common-workflow-language/cwltool
isoquant_3.6.0--hdfd78af_0.sif - -

Results Summary

Total Reads
2,674,365
Expressed Genes (CPM ≥ 1)
12,095
Novel Transcripts (CPM ≥ 1)
2,457
PolyA Detected
86.4%

Expressed Transcript Sources (CPM ≥ 1, unique reads)

Novel Transcript Categories (CPM ≥ 1, unique reads)

Top Expressed Genes (CPM, unique reads)

Gene Expression Distribution (CPM, unique reads)

Read Assignment Quality

Read Structural Classification

Chromosomal Distribution (unique reads)

Transcript Length Distribution (expressed transcripts, CPM ≥ 1)

15,496 transcripts | Median: 1,841 nt | Mean: 2,255 nt | N50: 2,792 nt

Exons per Transcript (expressed transcripts, CPM ≥ 1)

Isoforms per Gene (expressed transcripts, CPM ≥ 1)

TransDecoder ORF Types (expressed transcripts)

Peptide Length Distribution (expressed transcripts)

16,301 ORFs | Median: 276 aa | Mean: 358 aa | Max: 6,299 aa

Output Files

OUT.extended_annotation.gtf HPC 316 MB OUT.gene_counts.tsv HPC 1.4 MB OUT.read_assignments.tsv.gz HPC 70.8 MB OUT.transcript_counts.tsv HPC 3.7 MB OUT.transcript_model_counts.tsv HPC 458.2 KB OUT.transcript_models.gtf HPC 51.2 MB job.yml HPC 304 B results_summary.json HPC 6 KB transcripts.fa.transdecoder.cds HPC 21.3 MB transcripts.fa.transdecoder.gff3 HPC 13.7 MB transcripts.fa.transdecoder.pep HPC 8.9 MB

Input Data

FileFormatDescription
OUT.transcript_models.gtf text/plain -
OUT.extended_annotation.gtf text/plain -
OUT.read_assignments.tsv.gz application/octet-stream -
OUT.transcript_counts.tsv text/tab-separated-values -
OUT.gene_counts.tsv text/tab-separated-values -

Provenance

Execution Expression quantification summary
Completed 2026-03-01T07:51:27+00:00
RO-Crate 1.1 Workflow RO-Crate 1.0 FAIR
This analysis is packaged as a Research Object Crate with machine-readable provenance and FAIR metadata.

RO-Crate Metadata (JSON-LD)

Show/hide raw JSON-LD
{
    "@context": "https://w3id.org/ro/crate/1.1/context",
    "@graph": [
        {
            "@id": "ro-crate-metadata.json",
            "@type": "CreativeWork",
            "about": {
                "@id": "./"
            },
            "conformsTo": [
                {
                    "@id": "https://w3id.org/ro/crate/1.1"
                },
                {
                    "@id": "https://w3id.org/workflowhub/workflow-ro-crate/1.0"
                }
            ]
        },
        {
            "@id": "./",
            "@type": "Dataset",
            "name": "ONT Transcriptomics (IsoQuant + TransDecoder) \u2014 Run #55",
            "description": "Long-read transcriptomics: IsoQuant for transcript discovery and quantification, gffread for FASTA extraction, TransDecoder for ORF prediction.",
            "datePublished": "2026-03-01",
            "license": {
                "@id": "https://creativecommons.org/licenses/by/4.0/"
            },
            "mainEntity": {
                "@id": "ont_transcriptomics.cwl"
            },
            "hasPart": [
                {
                    "@id": "ont_transcriptomics.cwl"
                },
                {
                    "@id": "job.yml"
                },
                {
                    "@id": "OUT.transcript_models.gtf"
                },
                {
                    "@id": "transcripts.fa.transdecoder.gff3"
                },
                {
                    "@id": "transcripts.fa.transdecoder.pep"
                },
                {
                    "@id": "OUT.transcript_model_counts.tsv"
                },
                {
                    "@id": "OUT.extended_annotation.gtf"
                },
                {
                    "@id": "OUT.read_assignments.tsv.gz"
                },
                {
                    "@id": "OUT.transcript_counts.tsv"
                },
                {
                    "@id": "transcripts.fa.transdecoder.cds"
                },
                {
                    "@id": "OUT.gene_counts.tsv"
                },
                {
                    "@id": "results_summary.json"
                },
                {
                    "@id": "summary_extractor.py"
                }
            ],
            "mentions": [
                {
                    "@id": "#execution"
                },
                {
                    "@id": "#summary-extraction"
                }
            ]
        },
        {
            "@id": "ont_transcriptomics.cwl",
            "@type": [
                "File",
                "SoftwareSourceCode",
                "ComputationalWorkflow"
            ],
            "name": "ONT Transcriptomics (IsoQuant + TransDecoder)",
            "description": "#cwl",
            "programmingLanguage": {
                "@id": "Long-read transcriptomics: IsoQuant for transcript discovery and quantification, gffread for FASTA extraction, TransDecoder for ORF prediction."
            },
            "contentSize": "1.7 KB",
            "sha256": "c3b82f1cff216a3f2d95fa91eef186e33edc1a646b2f06a254984b1c1bd29b96"
        },
        {
            "@id": "#cwl",
            "@type": "ComputerLanguage",
            "name": "Common Workflow Language",
            "url": {
                "@id": "https://www.commonwl.org/"
            },
            "version": "1.2"
        },
        {
            "@id": "#cwltool",
            "@type": "SoftwareApplication",
            "name": "cwltool",
            "url": {
                "@id": "https://github.com/common-workflow-language/cwltool"
            }
        },
        {
            "@id": "#singularity-container",
            "@type": "SoftwareApplication",
            "name": "isoquant_3.6.0--hdfd78af_0.sif"
        },
        {
            "@id": "job.yml",
            "@type": "File",
            "name": "job.yml",
            "description": "CWL job input parameters",
            "encodingFormat": "text/yaml",
            "contentSize": "304 B",
            "sha256": "54c4117f28b53f9a52905399ae840c3976ff048c6ed727f6bbd4ebb97cad0b3c"
        },
        {
            "@id": "OUT.transcript_models.gtf",
            "@type": "File",
            "name": "OUT.transcript_models.gtf",
            "encodingFormat": "text/plain",
            "contentSize": "51.2 MB",
            "sha256": "17a6e5387c9dda139d634956318b72f0b6e37f885888ac380ebfb7ab17370542"
        },
        {
            "@id": "transcripts.fa.transdecoder.gff3",
            "@type": "File",
            "name": "transcripts.fa.transdecoder.gff3",
            "encodingFormat": "text/plain",
            "contentSize": "13.7 MB",
            "sha256": "bc496deb4498fdd0b838bac716f07c91936ba8477f73bb4660080d6f34859159"
        },
        {
            "@id": "transcripts.fa.transdecoder.pep",
            "@type": "File",
            "name": "transcripts.fa.transdecoder.pep",
            "encodingFormat": "application/octet-stream",
            "contentSize": "8.9 MB",
            "sha256": "c3e05731ccc1445ef357ad4f8aa0cafcf2188f7892b061ad1d212c0de3d0ff20"
        },
        {
            "@id": "OUT.transcript_model_counts.tsv",
            "@type": "File",
            "name": "OUT.transcript_model_counts.tsv",
            "encodingFormat": "text/tab-separated-values",
            "contentSize": "458.2 KB",
            "sha256": "30ddad9524f453021f08e2bc832a176ef8e4353981827535d54917e59aeb5a58"
        },
        {
            "@id": "OUT.extended_annotation.gtf",
            "@type": "File",
            "name": "OUT.extended_annotation.gtf",
            "encodingFormat": "text/plain",
            "contentSize": "316 MB",
            "sha256": "cc884c4a50015152fc948f1ad006108f1b967c63576d90c7c31cb610457301a2"
        },
        {
            "@id": "OUT.read_assignments.tsv.gz",
            "@type": "File",
            "name": "OUT.read_assignments.tsv.gz",
            "encodingFormat": "application/octet-stream",
            "contentSize": "70.8 MB",
            "sha256": "74f1ec517d48fb923982d7b4baffc33c2772d6d984f03865a326b88290fb710d"
        },
        {
            "@id": "OUT.transcript_counts.tsv",
            "@type": "File",
            "name": "OUT.transcript_counts.tsv",
            "encodingFormat": "text/tab-separated-values",
            "contentSize": "3.7 MB",
            "sha256": "d618d2c725b41a7216f9d33b6fdc298f3412e76d7b21cc23a11fcdfa1ed3ee45"
        },
        {
            "@id": "transcripts.fa.transdecoder.cds",
            "@type": "File",
            "name": "transcripts.fa.transdecoder.cds",
            "encodingFormat": "application/octet-stream",
            "contentSize": "21.3 MB",
            "sha256": "c624eb18577d462f4d26fa0c3f553ca19e326490c264b8b0c142677db241f175"
        },
        {
            "@id": "OUT.gene_counts.tsv",
            "@type": "File",
            "name": "OUT.gene_counts.tsv",
            "encodingFormat": "text/tab-separated-values",
            "contentSize": "1.4 MB",
            "sha256": "a21afd4488044eef771a131fe0386cf506a7f39c34326ade4b4216a764440493"
        },
        {
            "@id": "#execution",
            "@type": "CreateAction",
            "name": "ONT Transcriptomics (IsoQuant + TransDecoder) execution",
            "instrument": {
                "@id": "ont_transcriptomics.cwl"
            },
            "startTime": "2026-03-01T17:08:24+00:00",
            "endTime": "2026-03-01T07:50:50+00:00",
            "object": [
                {
                    "@id": "job.yml"
                }
            ],
            "result": [
                {
                    "@id": "OUT.transcript_models.gtf"
                },
                {
                    "@id": "transcripts.fa.transdecoder.gff3"
                },
                {
                    "@id": "transcripts.fa.transdecoder.pep"
                },
                {
                    "@id": "OUT.transcript_model_counts.tsv"
                },
                {
                    "@id": "OUT.extended_annotation.gtf"
                },
                {
                    "@id": "OUT.read_assignments.tsv.gz"
                },
                {
                    "@id": "OUT.transcript_counts.tsv"
                },
                {
                    "@id": "transcripts.fa.transdecoder.cds"
                },
                {
                    "@id": "OUT.gene_counts.tsv"
                }
            ]
        },
        {
            "@id": "results_summary.json",
            "@type": "File",
            "name": "results_summary.json",
            "description": "Derived summary statistics from pipeline outputs (CPM >= 1, uniquely mapped reads)",
            "encodingFormat": "application/json",
            "contentSize": "6 KB",
            "sha256": "15d9820ec2e811fa4009704d2ee635784840c171d9ec8df78595efc23205114b"
        },
        {
            "@id": "summary_extractor.py",
            "@type": [
                "File",
                "SoftwareSourceCode"
            ],
            "name": "Summary extraction script",
            "description": "Python script that computed results_summary.json from pipeline outputs",
            "programmingLanguage": {
                "@id": "#python3"
            }
        },
        {
            "@id": "#python3",
            "@type": "ComputerLanguage",
            "name": "Python",
            "url": {
                "@id": "https://www.python.org/"
            },
            "version": "3"
        },
        {
            "@id": "#summary-extraction",
            "@type": "CreateAction",
            "name": "Expression quantification summary",
            "instrument": {
                "@id": "summary_extractor.py"
            },
            "endTime": "2026-03-01T07:51:27+00:00",
            "object": [
                {
                    "@id": "OUT.read_assignments.tsv.gz"
                },
                {
                    "@id": "OUT.gene_counts.tsv"
                },
                {
                    "@id": "OUT.transcript_counts.tsv"
                },
                {
                    "@id": "OUT.extended_annotation.gtf"
                },
                {
                    "@id": "OUT.transcript_models.gtf"
                }
            ],
            "result": [
                {
                    "@id": "results_summary.json"
                }
            ]
        }
    ]
}