FAQs

What is data provenance?

Updated:

September 16, 2025

Data provenance is the documentation of the origin, history, and transformations applied to a dataset throughout its lifecycle. In research and bioinformatics, provenance enables validation of results, facilitates error detection, and supports regulatory compliance by providing a complete audit trail of data handling. Foundry captures provenance through Metadata Tracker audit trails and exported run parameters (e.g., run_params.json).