Libre Biotech — AI-Ready FAIR Data Platform for Genomics

Machine-Readable Metadata

ISA-structured data with ontology annotations exports to ISA-JSON and RO-Crate — formats ML pipelines can consume directly.

Reproducible Provenance

Every CWL workflow run records parameters, containers, and outputs — giving AI models the lineage context they need.

Quality-Controlled Inputs

QC dashboards across three sequencing platforms flag issues before data enters your analysis — cleaner inputs, better models.

What You Can Do

FAIR + AI-Ready Metadata in Minutes

ISA-structured metadata with ontology annotations means your data is FAIR and machine-readable from day one. Export to ISA-JSON, RO-Crate, or CSV — formats that ML pipelines can consume directly.

Never Lose Track of a Pipeline Run

CWL workflow execution with full provenance. Every parameter, container version, and output file is recorded and linked back to the samples that produced it.

Trace Any Result Back to Its Source

Answering "where did this data come from?" takes seconds, not hours. Complete audit trail from sample collection through library prep, sequencing, and analysis.

Spot Quality Issues Before They Cost You

Unified dashboards for Illumina, Oxford Nanopore, and PacBio. See yield, quality scores, and instrument trends across platforms so you catch problems early.

Build on Proven Methods

Versioned, forkable SOPs with step-by-step instructions. Find a protocol that works, adapt it for your organism, and share improvements back to the community.

Explore Genomes Visually

Integrated JBrowse 2 with custom tracks for coverage, junctions, and gene models. Go from a pipeline result to a visual view of your data in one click.

Built for Working Scientists

Free public projects with discoverable landing pages — get cited, not buried
Low-friction publishing: your protocols and data become citable, forkable resources
Training courses in genomics, bioinformatics, and open science practices

Templates for biodiversity, eDNA, and environmental genomics projects
Community-driven protocol improvement — build on what others have tested
A bridge between community participation and professional data practice

Data Structured for Machine Learning

Machine-readable exports (ISA-JSON, RO-Crate) that feed directly into ML pipelines
Complete provenance chains — every sample, process, and analysis linked with full parameter tracking
Quality-controlled inputs via sequencing QC dashboards — catch data issues before they corrupt your training set

Ontology-annotated metadata using standard vocabularies — no custom parsing required
CWL workflows with containerised tools — reproduce any analysis, audit any result
REST API with programmatic access to all metadata, samples, and analysis outputs

API Docs, Code Examples & Data Formats

Infrastructure Your Facility Can Rely On

FAIR + AI-ready metadata capture for projects, samples, assays, and workflows
Role-based access control with group permissions and data embargoes
Reproducibility and provenance tracking across your organisation
Data structured for both compliance and computational reuse from day one

Open APIs and exportable metadata — no vendor lock-in, ever
Managed hosting, onboarding, training, and support available
Data sovereignty guaranteed — your data stays yours, full stop
Video training courses for FAIR data management and platform onboarding

See Libre Biotech in Action

These research projects were designed, tracked, analysed, and published using Libre Biotech's open infrastructure.

C57BL/6 and DBA/2 Mouse Cortex Transcriptomics

1 study

Comparative long-read transcriptomics of cortex tissue from C57BL/6J and DBA/2J inbred mouse strains using Oxford Nanopore PCR-cDNA seque...

Mouse transcriptomics

Decoding DNA Methylation Dynamics in the Cotton Bollworm (Helicoverpa armigera)

3 studies

Helicoverpa armigera is a globally significant, polyphagous pest responsible for multi-billion-dollar crop losses each year. Emerging eviden...

Capstone students (2025)

Platform at a Glance

2

Research Projects

29

Protocols

11

Courses

3

Discussions

11

Groups

AI-Ready FAIR Data Platform for Genomics