Open data platform for biology
Enable learning at scale with an open-source framework. One API: lakehouse, lineage, feature store, ontologies, LIMS, and ELN.

Trace data & code
Know where a dataset came from and what it's used for. Track data lineage with a single line of code.
Manage datasets at scale
Query flexibly across storage and databases with a biology-aware lakehouse that supports AnnData, SpatialData, zarr, and more.

Manage flexible metadata
One Python class for your LIMS & ELN: experiments, samples, datasets, models, notes, reports, and more. Built on the Django ORM with ontology support.

Validate & annotate datasets
Use schemas to enforce consistency. Annotate datasets with a single line of code.
Administrate with ease while staying in control
Manage fine-grained access with SaaS-like simplicity while maintaining direct admin control through Postgres and AWS S3.

Build your organization's long-term memory
Your data, models, and reports are auto-connected during regular operations so that your team and agents can learn & improve.
