What is data provenance?
Updated:
September 16, 2025
Data provenance is the documentation of the origin, history, and transformations applied to a dataset throughout its lifecycle. In research and bioinformatics, provenance enables validation of results, facilitates error detection, and supports regulatory compliance by providing a complete audit trail of data handling. Foundry captures provenance through Metadata Tracker audit trails and exported run parameters (e.g., run_params.json).