Data provenance (robotics datasets)
Data provenance is the documented record of where each piece of training data came from - who produced it, with what consent, and how it was processed. For robotics datasets it includes a provenance log tracing every clip to its source, plus a dataset card describing scope and limitations.
Provenance is increasingly a buying requirement under data-governance rules such as the EU AI Act and India’s DPDP Act.