AI-Ready FAIR Data Platform for Genomics
Research data management designed so your data is structured for machine learning and automated analysis from day one. FAIR-compliant metadata, reproducible workflows, and quality-controlled outputs — ready for AI pipelines, not just archives.
Open source (AGPL-3.0). Self-hostable. No vendor lock-in.
Machine-Readable Metadata
ISA-structured data with ontology annotations exports to ISA-JSON and RO-Crate — formats ML pipelines can consume directly.
Reproducible Provenance
Every CWL workflow run records parameters, containers, and outputs — giving AI models the lineage context they need.
Quality-Controlled Inputs
QC dashboards across three sequencing platforms flag issues before data enters your analysis — cleaner inputs, better models.
What You Can Do
FAIR + AI-Ready Metadata in Minutes
ISA-structured metadata with ontology annotations means your data is FAIR and machine-readable from day one. Export to ISA-JSON, RO-Crate, or CSV — formats that ML pipelines can consume directly.
Never Lose Track of a Pipeline Run
CWL workflow execution with full provenance. Every parameter, container version, and output file is recorded and linked back to the samples that produced it.
Trace Any Result Back to Its Source
Answering "where did this data come from?" takes seconds, not hours. Complete audit trail from sample collection through library prep, sequencing, and analysis.
Spot Quality Issues Before They Cost You
Unified dashboards for Illumina, Oxford Nanopore, and PacBio. See yield, quality scores, and instrument trends across platforms so you catch problems early.
Build on Proven Methods
Versioned, forkable SOPs with step-by-step instructions. Find a protocol that works, adapt it for your organism, and share improvements back to the community.
Explore Genomes Visually
Integrated JBrowse 2 with custom tracks for coverage, junctions, and gene models. Go from a pipeline result to a visual view of your data in one click.
Built for Working Scientists
- Free public projects with discoverable landing pages — get cited, not buried
- Low-friction publishing: your protocols and data become citable, forkable resources
- Training courses in genomics, bioinformatics, and open science practices
- Templates for biodiversity, eDNA, and environmental genomics projects
- Community-driven protocol improvement — build on what others have tested
- A bridge between community participation and professional data practice
See Libre Biotech in Action
These research projects were designed, tracked, analysed, and published using Libre Biotech's open infrastructure.
Discussions
View allContributors
Platform at a Glance
Built for Open Science, Ready for AI
Libre Biotech is open-source research infrastructure that combines laboratory information management, protocol versioning, sequencing QC, compute pipelines, and a genome browser into a single platform — designed so data is FAIR-compliant and AI-ready from the start. No other open tool does all of this.
Licensed under AGPL-3.0. Free as in freedom. Your data stays yours — full export, no lock-in, data sovereignty guaranteed.