Libre Biotech

Key Concepts

Understand the data model, FAIR principles, and access control that underpin the platform.

The ISA framework

Libre Biotech organises research data using the ISA (Investigation–Study–Assay) framework, an internationally recognised standard for structuring experimental metadata. ISA is maintained by the ISA Tools community and is widely adopted in life sciences.

Hierarchy

Investigation          (research project)
  └── Study            (experimental campaign)
       └── Process     (a specific activity)
            ├── Input Samples
            ├── Output Samples
            ├── Files
            ├── People
            └── Assays / Measurements
LevelWhat it representsExample
InvestigationA research project or programme"Moth transcriptome characterisation"
StudyAn experimental campaign within the investigation"Larval tissue RNA-Seq"
ProcessA specific activity that transforms inputs to outputs"RNA extraction from fat body tissue"
SampleA biological or digital material"HaFB_RNA_01" (extracted RNA sample)

Process categories

Processes are classified by category, which determines the metadata fields available and how they connect in the workflow:

CategoryPurposeExamples
ExtractionIsolating material from a sourceDNA extraction, RNA extraction, tissue dissection
Sample PreparationPreparing samples for analysisLibrary preparation, bead selection, pooling
SequencingGenerating sequence dataNanopore sequencing run, Illumina run submission
MeasurementQuantitative assessmentFragment Analyzer QC, Qubit quantification
Data TransformationComputational analysis of dataAlignment, assembly, annotation pipeline

Process chains

Processes link together through their input and output samples to form process chains — a complete provenance trail from raw material to final result.

Dissection → RNA Extraction → Library Prep → Sequencing → Analysis
  (tissue)    (RNA sample)     (library)     (FASTQ)      (GFF3, BAM)

Each step in the chain records who performed it, which protocol was followed, what equipment was used, and what files were produced. This gives you full reproducibility and audit capability.

FAIR principles

Libre Biotech is built on the FAIR principles — guidelines to make research data Findable, Accessible, Interoperable, and Reusable.

PrincipleHow Libre Biotech implements it
FindableRich metadata with ISA structure, ontology annotations from 3M+ OBO terms, searchable public API, public discovery pages
AccessibleOpen access to public content without login, REST API with CORS, share links for controlled access
InteroperableISA-JSON and ISA-Tab export, CWL workflow definitions, RO-Crate packaging, standard file formats (FASTQ, BAM, VCF, GFF3)
ReusableSample provenance chains, protocol versioning with changelogs, per-entity licensing, community review
FAIR score: Each investigation displays a FAIR self-assessment score showing how well it meets each principle (Findable, Accessible, Interoperable, Reusable), with actionable suggestions for improvement.

Ontology annotations

Libre Biotech includes over 3 million ontology terms from the OBO Foundry, enabling precise, machine-readable annotations on any entity. Common ontologies include:

  • NCBI Taxonomy — Species and organism classification
  • EFO — Experimental Factor Ontology
  • OBI — Ontology for Biomedical Investigations
  • UBERON — Anatomical structures
  • ENVO — Environmental conditions
  • SO — Sequence Ontology

The term picker (available on process, sample, and investigation pages) provides type-ahead search across all loaded ontologies. Terms are stored with their CURIE identifiers (e.g., NCBI:7529) for machine readability.

Access control

Access control operates at three levels:

Visibility levels

LevelWho can see it
PrivateMembers of the owning group only
GroupMembers of associated groups (e.g., a research group + the core facility)
PublicAnyone, including non-logged-in visitors and search engines

Group roles

Within a group, each member has a role that determines what they can do:

  • Leader — Full control: create/delete investigations, manage members, change visibility settings
  • Manager — Create and edit content, manage processes, invite members
  • Member — View content, create processes and samples, upload files

Admin role

Platform administrators have visibility across all groups and can manage users, instruments, and system-wide settings. Admin status is separate from group roles.

Data ownership

You own your data. Libre Biotech has a licence to host and display it — nothing more. Key guarantees:

  • Full export at any time in standard formats
  • No proprietary schemas — ISA, CWL, and RO-Crate throughout
  • Account deletion removes your personal data
  • 90-day notice before any service changes

See the Data Sovereignty Statement for the complete commitment.