Libre Biotech

AI-Ready FAIR Data Platform for Genomics

Research data management designed so your data is structured for machine learning and automated analysis from day one. FAIR-compliant metadata, reproducible workflows, and quality-controlled outputs — ready for AI pipelines, not just archives.

Open source (AGPL-3.0). Self-hostable. No vendor lock-in.

Machine-Readable Metadata

ISA-structured data with ontology annotations exports to ISA-JSON and RO-Crate — formats ML pipelines can consume directly.

Reproducible Provenance

Every CWL workflow run records parameters, containers, and outputs — giving AI models the lineage context they need.

Quality-Controlled Inputs

QC dashboards across three sequencing platforms flag issues before data enters your analysis — cleaner inputs, better models.

What You Can Do

FAIR + AI-Ready Metadata in Minutes

ISA-structured metadata with ontology annotations means your data is FAIR and machine-readable from day one. Export to ISA-JSON, RO-Crate, or CSV — formats that ML pipelines can consume directly.

Never Lose Track of a Pipeline Run

CWL workflow execution with full provenance. Every parameter, container version, and output file is recorded and linked back to the samples that produced it.

Trace Any Result Back to Its Source

Answering "where did this data come from?" takes seconds, not hours. Complete audit trail from sample collection through library prep, sequencing, and analysis.

Spot Quality Issues Before They Cost You

Unified dashboards for Illumina, Oxford Nanopore, and PacBio. See yield, quality scores, and instrument trends across platforms so you catch problems early.

Build on Proven Methods

Versioned, forkable SOPs with step-by-step instructions. Find a protocol that works, adapt it for your organism, and share improvements back to the community.

Explore Genomes Visually

Integrated JBrowse 2 with custom tracks for coverage, junctions, and gene models. Go from a pipeline result to a visual view of your data in one click.

Built for Working Scientists

  • Free public projects with discoverable landing pages — get cited, not buried
  • Low-friction publishing: your protocols and data become citable, forkable resources
  • Training courses in genomics, bioinformatics, and open science practices
  • Templates for biodiversity, eDNA, and environmental genomics projects
  • Community-driven protocol improvement — build on what others have tested
  • A bridge between community participation and professional data practice

Platform at a Glance

2
Research Projects
29
Protocols
11
Courses
3
Discussions
11
Groups

Built for Open Science, Ready for AI

Libre Biotech is open-source research infrastructure that combines laboratory information management, protocol versioning, sequencing QC, compute pipelines, and a genome browser into a single platform — designed so data is FAIR-compliant and AI-ready from the start. No other open tool does all of this.

Licensed under AGPL-3.0. Free as in freedom. Your data stays yours — full export, no lock-in, data sovereignty guaranteed.