What is data lineage?
Updated:
September 16, 2025
Data lineage is the detailed tracking of data’s journey from its origin through all processing, transformations, and storage steps to its final form. In bioinformatics, it helps ensure reproducibility, validate analyses, and manage complex pipelines across datasets and tools. In Foundry, lineage appears via audit trails, versioned processes, and run reports that record method revisions and parameters.