ONT Long-Read Transcriptomics (IsoQuant)
DBA/2 combined (bc22+bc23+bc24) — GENCODE M35, complete_genedb
Type
CWL
Status
succeeded
Engine
cwltool
Duration
0.7 h
Source Data
| Study | Strain-specific cortex gene expression and isoform usage |
| Sample prep | PCR-cDNA barcoding library preparation for C57/DBA mouse RNA (SQK-PCB111.24) 2023-03-13 |
| Sample prep | AMPure XP 1.5 kb size selection of C57/DBA PCR-cDNA libraries 2023-05-01 |
| Sample prep | PCR-cDNA barcoding library preparation for C57/DBA mouse RNA (run 3, bc10-15) 2023-05-09 |
| Sequencing | Nanopore PCR-cDNA sequencing of C57/DBA mouse RNA (run 1) 2023-04-21 |
| Sequencing | Nanopore PCR-cDNA sequencing of C57/DBA mouse RNA 1.5kb (run 2) 2023-05-04 |
| Sequencing | Nanopore PCR-cDNA sequencing of C57/DBA mouse RNA (run 3) 2023-05-12 |
| Run Data | Run #61 (6 samples) |
| Run Data | Run #62 (6 samples) |
| Run Data | Run #63 (6 samples) |
| Samples | C57_rep1_bc10 C57_rep1_bc19 C57_rep1_bc19_1.5kb C57_rep2_bc11 C57_rep2_bc20 C57_rep2_bc20_1.5kb C57_rep3_bc12 C57_rep3_bc21 C57_rep3_bc21_1.5kb DBA_rep1_bc13 DBA_rep1_bc22 DBA_rep1_bc22_1.5kb DBA_rep2_bc14 DBA_rep2_bc23 DBA_rep2_bc23_1.5kb DBA_rep3_bc15 DBA_rep3_bc24 DBA_rep3_bc24_1.5kb |
Workflow
ONT Transcriptomics (IsoQuant + TransDecoder)
#cwl
Software Tools
| Tool | Version | URL |
|---|---|---|
| cwltool | - | https://github.com/common-workflow-language/cwltool |
| isoquant_3.6.0--hdfd78af_0.sif | - | - |
Results Summary
Total Reads
2,674,365
Expressed Genes (CPM ≥ 1)
12,095
Novel Transcripts (CPM ≥ 1)
2,457
PolyA Detected
86.4%
Expressed Transcript Sources (CPM ≥ 1, unique reads)
Novel Transcript Categories (CPM ≥ 1, unique reads)
Top Expressed Genes (CPM, unique reads)
Gene Expression Distribution (CPM, unique reads)
Read Assignment Quality
Read Structural Classification
Chromosomal Distribution (unique reads)
Transcript Length Distribution (expressed transcripts, CPM ≥ 1)
15,496 transcripts | Median: 1,841 nt | Mean: 2,255 nt | N50: 2,792 nt
Exons per Transcript (expressed transcripts, CPM ≥ 1)
Isoforms per Gene (expressed transcripts, CPM ≥ 1)
TransDecoder ORF Types (expressed transcripts)
Peptide Length Distribution (expressed transcripts)
16,301 ORFs | Median: 276 aa | Mean: 358 aa | Max: 6,299 aa
Output Files
Input Data
| File | Format | Description |
|---|---|---|
OUT.transcript_models.gtf |
text/plain | - |
OUT.extended_annotation.gtf |
text/plain | - |
OUT.read_assignments.tsv.gz |
application/octet-stream | - |
OUT.transcript_counts.tsv |
text/tab-separated-values | - |
OUT.gene_counts.tsv |
text/tab-separated-values | - |
Provenance
| Execution | Expression quantification summary |
| Completed | 2026-03-01T07:51:27+00:00 |
RO-Crate 1.1
Workflow RO-Crate 1.0
FAIR
This analysis is packaged as a Research Object Crate
with machine-readable provenance and FAIR metadata.
RO-Crate Metadata (JSON-LD)
Show/hide raw JSON-LD
{
"@context": "https://w3id.org/ro/crate/1.1/context",
"@graph": [
{
"@id": "ro-crate-metadata.json",
"@type": "CreativeWork",
"about": {
"@id": "./"
},
"conformsTo": [
{
"@id": "https://w3id.org/ro/crate/1.1"
},
{
"@id": "https://w3id.org/workflowhub/workflow-ro-crate/1.0"
}
]
},
{
"@id": "./",
"@type": "Dataset",
"name": "ONT Transcriptomics (IsoQuant + TransDecoder) \u2014 Run #55",
"description": "Long-read transcriptomics: IsoQuant for transcript discovery and quantification, gffread for FASTA extraction, TransDecoder for ORF prediction.",
"datePublished": "2026-03-01",
"license": {
"@id": "https://creativecommons.org/licenses/by/4.0/"
},
"mainEntity": {
"@id": "ont_transcriptomics.cwl"
},
"hasPart": [
{
"@id": "ont_transcriptomics.cwl"
},
{
"@id": "job.yml"
},
{
"@id": "OUT.transcript_models.gtf"
},
{
"@id": "transcripts.fa.transdecoder.gff3"
},
{
"@id": "transcripts.fa.transdecoder.pep"
},
{
"@id": "OUT.transcript_model_counts.tsv"
},
{
"@id": "OUT.extended_annotation.gtf"
},
{
"@id": "OUT.read_assignments.tsv.gz"
},
{
"@id": "OUT.transcript_counts.tsv"
},
{
"@id": "transcripts.fa.transdecoder.cds"
},
{
"@id": "OUT.gene_counts.tsv"
},
{
"@id": "results_summary.json"
},
{
"@id": "summary_extractor.py"
}
],
"mentions": [
{
"@id": "#execution"
},
{
"@id": "#summary-extraction"
}
]
},
{
"@id": "ont_transcriptomics.cwl",
"@type": [
"File",
"SoftwareSourceCode",
"ComputationalWorkflow"
],
"name": "ONT Transcriptomics (IsoQuant + TransDecoder)",
"description": "#cwl",
"programmingLanguage": {
"@id": "Long-read transcriptomics: IsoQuant for transcript discovery and quantification, gffread for FASTA extraction, TransDecoder for ORF prediction."
},
"contentSize": "1.7 KB",
"sha256": "c3b82f1cff216a3f2d95fa91eef186e33edc1a646b2f06a254984b1c1bd29b96"
},
{
"@id": "#cwl",
"@type": "ComputerLanguage",
"name": "Common Workflow Language",
"url": {
"@id": "https://www.commonwl.org/"
},
"version": "1.2"
},
{
"@id": "#cwltool",
"@type": "SoftwareApplication",
"name": "cwltool",
"url": {
"@id": "https://github.com/common-workflow-language/cwltool"
}
},
{
"@id": "#singularity-container",
"@type": "SoftwareApplication",
"name": "isoquant_3.6.0--hdfd78af_0.sif"
},
{
"@id": "job.yml",
"@type": "File",
"name": "job.yml",
"description": "CWL job input parameters",
"encodingFormat": "text/yaml",
"contentSize": "304 B",
"sha256": "54c4117f28b53f9a52905399ae840c3976ff048c6ed727f6bbd4ebb97cad0b3c"
},
{
"@id": "OUT.transcript_models.gtf",
"@type": "File",
"name": "OUT.transcript_models.gtf",
"encodingFormat": "text/plain",
"contentSize": "51.2 MB",
"sha256": "17a6e5387c9dda139d634956318b72f0b6e37f885888ac380ebfb7ab17370542"
},
{
"@id": "transcripts.fa.transdecoder.gff3",
"@type": "File",
"name": "transcripts.fa.transdecoder.gff3",
"encodingFormat": "text/plain",
"contentSize": "13.7 MB",
"sha256": "bc496deb4498fdd0b838bac716f07c91936ba8477f73bb4660080d6f34859159"
},
{
"@id": "transcripts.fa.transdecoder.pep",
"@type": "File",
"name": "transcripts.fa.transdecoder.pep",
"encodingFormat": "application/octet-stream",
"contentSize": "8.9 MB",
"sha256": "c3e05731ccc1445ef357ad4f8aa0cafcf2188f7892b061ad1d212c0de3d0ff20"
},
{
"@id": "OUT.transcript_model_counts.tsv",
"@type": "File",
"name": "OUT.transcript_model_counts.tsv",
"encodingFormat": "text/tab-separated-values",
"contentSize": "458.2 KB",
"sha256": "30ddad9524f453021f08e2bc832a176ef8e4353981827535d54917e59aeb5a58"
},
{
"@id": "OUT.extended_annotation.gtf",
"@type": "File",
"name": "OUT.extended_annotation.gtf",
"encodingFormat": "text/plain",
"contentSize": "316 MB",
"sha256": "cc884c4a50015152fc948f1ad006108f1b967c63576d90c7c31cb610457301a2"
},
{
"@id": "OUT.read_assignments.tsv.gz",
"@type": "File",
"name": "OUT.read_assignments.tsv.gz",
"encodingFormat": "application/octet-stream",
"contentSize": "70.8 MB",
"sha256": "74f1ec517d48fb923982d7b4baffc33c2772d6d984f03865a326b88290fb710d"
},
{
"@id": "OUT.transcript_counts.tsv",
"@type": "File",
"name": "OUT.transcript_counts.tsv",
"encodingFormat": "text/tab-separated-values",
"contentSize": "3.7 MB",
"sha256": "d618d2c725b41a7216f9d33b6fdc298f3412e76d7b21cc23a11fcdfa1ed3ee45"
},
{
"@id": "transcripts.fa.transdecoder.cds",
"@type": "File",
"name": "transcripts.fa.transdecoder.cds",
"encodingFormat": "application/octet-stream",
"contentSize": "21.3 MB",
"sha256": "c624eb18577d462f4d26fa0c3f553ca19e326490c264b8b0c142677db241f175"
},
{
"@id": "OUT.gene_counts.tsv",
"@type": "File",
"name": "OUT.gene_counts.tsv",
"encodingFormat": "text/tab-separated-values",
"contentSize": "1.4 MB",
"sha256": "a21afd4488044eef771a131fe0386cf506a7f39c34326ade4b4216a764440493"
},
{
"@id": "#execution",
"@type": "CreateAction",
"name": "ONT Transcriptomics (IsoQuant + TransDecoder) execution",
"instrument": {
"@id": "ont_transcriptomics.cwl"
},
"startTime": "2026-03-01T17:08:24+00:00",
"endTime": "2026-03-01T07:50:50+00:00",
"object": [
{
"@id": "job.yml"
}
],
"result": [
{
"@id": "OUT.transcript_models.gtf"
},
{
"@id": "transcripts.fa.transdecoder.gff3"
},
{
"@id": "transcripts.fa.transdecoder.pep"
},
{
"@id": "OUT.transcript_model_counts.tsv"
},
{
"@id": "OUT.extended_annotation.gtf"
},
{
"@id": "OUT.read_assignments.tsv.gz"
},
{
"@id": "OUT.transcript_counts.tsv"
},
{
"@id": "transcripts.fa.transdecoder.cds"
},
{
"@id": "OUT.gene_counts.tsv"
}
]
},
{
"@id": "results_summary.json",
"@type": "File",
"name": "results_summary.json",
"description": "Derived summary statistics from pipeline outputs (CPM >= 1, uniquely mapped reads)",
"encodingFormat": "application/json",
"contentSize": "6 KB",
"sha256": "15d9820ec2e811fa4009704d2ee635784840c171d9ec8df78595efc23205114b"
},
{
"@id": "summary_extractor.py",
"@type": [
"File",
"SoftwareSourceCode"
],
"name": "Summary extraction script",
"description": "Python script that computed results_summary.json from pipeline outputs",
"programmingLanguage": {
"@id": "#python3"
}
},
{
"@id": "#python3",
"@type": "ComputerLanguage",
"name": "Python",
"url": {
"@id": "https://www.python.org/"
},
"version": "3"
},
{
"@id": "#summary-extraction",
"@type": "CreateAction",
"name": "Expression quantification summary",
"instrument": {
"@id": "summary_extractor.py"
},
"endTime": "2026-03-01T07:51:27+00:00",
"object": [
{
"@id": "OUT.read_assignments.tsv.gz"
},
{
"@id": "OUT.gene_counts.tsv"
},
{
"@id": "OUT.transcript_counts.tsv"
},
{
"@id": "OUT.extended_annotation.gtf"
},
{
"@id": "OUT.transcript_models.gtf"
}
],
"result": [
{
"@id": "results_summary.json"
}
]
}
]
}