Glossary / Synthetic Media

Provenance Metadata

Provenance metadata is the embedded data carrying a media asset's origin record — the capturing device or generating tool, the edit history, timestamps, and whether AI was involved.

Provenance metadata is the payload that makes content provenance verifiable. Under C2PA, it is structured as a cryptographically signed manifest bound to the file, so any modification to the content invalidates the signature — making tampering detectable.

The structural weakness is portability. Standard upload and transcoding pipelines often strip embedded metadata, so signed content can reach a viewer with its manifest removed. The absence of provenance metadata does not prove content is fake — only that it lacks a verifiable record. This is why durable disclosure pairs metadata with watermarking and fingerprinting.

FAQ

What is provenance metadata?

It is the embedded data carrying a media asset's origin record — device or tool, edit history, timestamps, and AI involvement.

Why does provenance metadata get stripped?

Standard upload and transcoding pipelines often remove embedded metadata as a byproduct of processing, not deliberate suppression.