Black Hat2024

Tracing Origins: Navigating Content Authenticity in the Deepfake Era

Black Hat940 views39:23about 1 year ago

This talk explores the implementation of the Coalition for Content Provenance and Authenticity (C2PA) standard to combat the proliferation of deepfakes and AI-generated disinformation. It details how cryptographic manifests are embedded into media files to provide verifiable provenance and authenticity metadata, allowing users to trace the origin and modification history of digital assets. The presentation highlights the role of hardware-level signing and open-source SDKs in establishing a chain of trust for digital content. The speaker also demonstrates how these tools can detect unauthorized edits to AI-generated images.

How Cryptographic Manifests Are Changing the Game for Content Provenance

TLDR: The C2PA standard introduces a cryptographic layer to digital media, allowing creators to embed verifiable provenance and modification history directly into files. While this doesn't stop deepfakes, it provides a standardized way to detect unauthorized edits and verify the origin of digital assets. Pentesters should start looking at how these manifests are handled in their target applications, as they represent a new, critical attack surface for data manipulation.

Deepfakes are no longer a theoretical threat. They are actively being used to manipulate public perception, facilitate financial fraud, and erode trust in digital communication. The industry response has largely been reactive, relying on detection models that are constantly being outpaced by generative AI. The Coalition for Content Provenance and Authenticity (C2PA) is shifting the focus from detection to verification. By embedding cryptographic manifests into media files, the C2PA standard provides a way to trace the origin and modification history of an asset, effectively creating a chain of trust that is difficult to forge without detection.

The Mechanics of C2PA Manifests

At its core, the C2PA standard is about transparency. When a file is created or modified, the software generates a manifest that includes metadata about the source, the tools used, and any subsequent edits. This manifest is cryptographically signed, ensuring that any unauthorized modification to the file or its metadata will break the signature.

The process is straightforward. When you create an asset, the software generates a hash of the file and creates a series of assertions. These assertions might include the GPS coordinates of where a photo was taken, the camera model, or the specific AI model used to generate the image. These assertions are bundled into a claim, which is then signed by the creator's private key.

For a pentester, this is a new target. If an application claims to support C2PA, you need to test how it handles these manifests. Does the application strip the metadata upon upload? Does it validate the signature against a trusted root certificate? If the application allows for editing, does it correctly update the manifest to reflect the changes, or does it leave the original signature intact, creating a potential for data manipulation that could mislead users?

Technical Implementation and Testing

The C2PA ecosystem relies on open-source tools to manage these manifests. You can start exploring this by looking at the c2pa-tool, which allows you to inspect and validate the manifests embedded in images and videos.

If you are testing an application that processes these files, you should attempt to inject or modify the manifest data. A simple way to start is by using the c2pa-attacks-tool, which is designed to test the robustness of C2PA implementations. This tool can help you identify if an application is properly validating the integrity of the manifest or if it is susceptible to stripping attacks where the metadata is removed but the file remains functional.

# Example of inspecting a C2PA-enabled image
c2pa-tool inspect image.jpg

When you are on an engagement, look for the presence of these manifests in the file headers. If you find them, your goal is to determine if the application treats them as authoritative. If you can modify the image content without updating the manifest, you have successfully bypassed the integrity check. This is a high-impact finding, especially in environments where content authenticity is a regulatory requirement.

Real-World Applicability

The risk here is not just about fake news. It is about the integrity of the entire digital supply chain. Imagine a scenario where a financial institution uses C2PA to verify the identity of a customer submitting a document. If an attacker can manipulate the document and maintain a valid-looking, albeit forged, manifest, they could bypass identity verification controls.

During a red team engagement, you should treat C2PA-enabled files as part of the application's security boundary. If the application uses these files to make authorization decisions, you need to understand the underlying trust model. Does the application rely on a centralized list of trusted CAs, or does it allow for custom, local CAs? If the latter, you might be able to inject your own certificates to sign malicious content that the application will then trust as authentic.

The Defensive Perspective

Defenders need to ensure that their infrastructure is capable of validating these signatures at scale. This means maintaining a robust Public Key Infrastructure (PKI) and ensuring that all ingestion pipelines are configured to reject files with broken or missing signatures. It is not enough to simply display the metadata; the application must actively validate the cryptographic chain back to a trusted root.

The C2PA standard is a significant step forward in the fight against disinformation, but it is not a silver bullet. It provides the tools for verification, but the security of the system depends entirely on the implementation. As researchers, we need to stress-test these implementations, identify the edge cases, and ensure that the promise of content authenticity doesn't become a false sense of security. Start by auditing how your organization's applications handle these manifests, and don't assume that a valid signature is proof of truth. Always verify the context, the source, and the history of the data you are processing.

Talk Type

research presentation

Difficulty

intermediate