Drop a PRML manifest in your repo. Every MLflow run gets cryptographically bound to it. No workflow changes.
pip install mlflow-falsifyThe plugin is discovered automatically through MLflow's mlflow.run_context_provider entry point.
import mlflow
# .prml.yaml exists in CWD or any parent directory — that's all you need.
with mlflow.start_run():
mlflow.log_metric("accuracy", 0.873)
# The run now carries prml.manifest_hash and friends as tags.When a .prml.yaml or prml.yaml is found in the current directory or any ancestor, every run is tagged with:
prml.manifest_hash— SHA-256 of the canonical manifest bytes (PRML v0.1 §3)prml.manifest_path— relative path to the discovered manifestprml.version— manifest schema version (e.g.prml/0.1)prml.metric— the pre-registered metric (e.g.accuracy)prml.comparator— one of>=,>,==,<=,<prml.threshold— the numeric threshold, as a stringprml.dataset_id— the pre-registered dataset identifier
Missing or malformed fields are silently skipped. The provider never raises into your run.
In an HPO sweep the same PRML claim repeats across thousands of runs, so emitting the 5 descriptive tags per-run becomes pure tag noise. As of v0.2.0 the plugin supports lifting them to experiment level:
export MLFLOW_FALSIFY_TAG_SCOPE=experimentimport mlflow_falsify
mlflow.set_experiment("credit-scorer-hpo")
mlflow_falsify.tag_experiment() # idempotent; sets metric/comparator/threshold/dataset_id/version once
for params in hpo_grid:
with mlflow.start_run():
... # only prml.manifest_hash and prml.manifest_path attach per-runDefault behaviour (MLFLOW_FALSIFY_TAG_SCOPE=run or unset) is unchanged: all 7 tags attach per-run, backward-compatible with v0.1.x.
- EU AI Act Article 12 evidence layer. Every logged run carries a tamper-evident pointer to the claim it was meant to test.
- Eval reproducibility by default. The hash freezes metric, threshold, dataset, and seed before the experiment runs.
- Audit trails for free. Reviewers can recompute the manifest hash from the YAML and compare it against your tracked runs.
- No workflow change. Existing MLflow code is untouched — the plugin attaches via entry points.
- PRML specification: spec.falsify.dev/v0.1
- Falsify reference implementation: github.com/studio-11-co/falsify
- Zenodo (this plugin): doi.org/10.5281/zenodo.20235451
- Zenodo (PRML spec): doi.org/10.5281/zenodo.20177839
- Registry: registry.falsify.dev
Where the manifest hash this plugin attaches fits in major AI governance frameworks (FULL / PARTIAL / NONE tagged):
- EU AI Act Article 12 — automated-logging pattern for the 2 December 2027 high-risk deadline
- NIST AI RMF 1.0 — GOVERN / MAP / MEASURE / MANAGE subcategory map
- ISO/IEC 42001:2023 — AI Management System clause-by-clause evidence map
MIT. Copyright 2026 Cüneyt Öztürk.