OpenAI Inc.

05/19/2026 | News release | Distributed by Public on 05/19/2026 11:47

Advancing content provenance for a safer, more transparent AI ecosystem

May 19, 2026

Safety

Advancing content provenance for a safer, more transparent AI ecosystem

Helping people understand the origin of AI-generated content through Content Credentials, SynthID, and an early public verification tool.

Loading…
Share

People are using OpenAI's tools everyday to create and edit images and audio in ways that make communication more expressive, useful, and accessible. As these tools become a part of how people build, imagine, and share, it's important that people can understand and verify where the media comes from so they can interpret it with more confidence. Provenance signals can help by giving people context about where content came from, how it was created or edited, and whether it is what it claims to be.

Today we're strengthening our approach to content provenance with a multi-layered, ecosystem-driven model to building trust online. We are making our provenance signals easier for other tools and platforms to recognize through C2PA conformance, adding durable cross-platform SynthID watermarking to images through a partnership with Google, and sharing a preview of a tool the public can use to verify whether images came from OpenAI.

Together these updates build on our earlier work to support open standards, make OpenAI-generated content easier to identify, and collaborate across the industry to support a more trustworthy information ecosystem.

Building the trust ecosystem through C2PA conformance

OpenAI has been engaged in the development and adoption of provenance standards since 2024, when we began adding Content Credentials to images generated by DALL·E 3 (opens in a new window) and later to ImageGen (opens in a new window) and Sora (opens in a new window). We also joined the Steering Committee of the Coalition for Content Provenance and Authenticity (C2PA), the cross-industry group behind the open technical standard for content provenance. C2PA's technical approach uses metadata and cryptographic signatures to help information about a piece of media securely travel with the content itself. This information includes context that helps journalists evaluating a source, platforms making integrity decisions, and people trying to understand what they are seeing online.

We recently took the step of making OpenAI a C2PA Conforming Generator Product (opens in a new window). By becoming C2PA conformant, we are giving platforms a trusted way to read, preserve, and pass along the provenance information we attach to our content. This matters because provenance only works if it survives beyond the first platform where content is created, and conformance makes that possible.

A multi-layered approach to provenance with Google SynthID for images

C2PA metadata is an important foundation for provenance. It helps content carry information about where it came from, how it was created or edited, and who signed that information. But metadata is not foolproof. It can be stripped, lost through uploads and downloads, or broken by transformations like file format changes, resizing, or screenshots.

To make provenance more resilient, we are taking a multi-layered approach and incorporating watermarking through Google DeepMind's SynthID (opens in a new window), starting with images generated through ChatGPT , Codex, or the OpenAI API. SynthID embeds an invisible watermarking layer that complements C2PA metadata-based approaches.

We've been building toward this for some time. We have used visible watermarks in Sora and an audio watermark in Voice Engine, and have continued to test and research accuracy and reliability over time. through deployment.

These two systems reinforce each other. C2PA helps content carry detailed context; SynthID helps preserve a signal when metadata does not survive. Watermarking can be more durable through transformations like screenshots, while metadata can provide more information than a watermark alone. Together, they make provenance more resilient than either layer would be on its own.

Detection and a preview of our public verification tool

Trusted metadata and watermarks that resist most modifications can make provenance signals more durable. But people need a way to detect these signals. We are now previewing a public verification tool that will help people verify whether an uploaded image was generated on ChatGPT, the OpenAI API, or Codex, by checking if it contains provenance signals, including Content Credentials and SynthID.

We believe provenance should be easier for people to verify and interpret, and that our tool can help people play a role in answering the question, "Was this generated with AI?" by integrating multiple signals. This builds on learnings from the initial research preview of our image detection classifier in 2024 and enables people to reliably detect whether a SynthID watermark originating from OpenAI is present in the media, as well as surface C2PA metadata when it is found.

No detection method is foolproof, so we take a cautious approach in cases when detection fails. If no metadata or watermark is detected, for example, the tool will not make a definitive conclusion about whether the image was generated with OpenAI tools since provenance signals can in some cases be stripped.

At launch, the tool is limited to content generated by OpenAI. In the upcoming months, we aim to support cross-industry efforts to make verification possible across platforms. Over time, we also expect to support more types of content that people may encounter online.

Looking ahead

No single provenance technique is enough on its own. We believe a strong approach combines shared standards, durable watermarking signals, and public verification. By building on our long-standing support for Content Credentials, becoming conformant with C2PA, adopting SynthID, and previewing public verification tooling, we hope to contribute in the long run to a more interoperable provenance ecosystem.

Author

OpenAI
OpenAI Inc. published this content on May 19, 2026, and is solely responsible for the information contained herein. Distributed via Public Technologies (PUBT), unedited and unaltered, on May 19, 2026 at 17:47 UTC. If you believe the information included in the content is inaccurate or outdated and requires editing or removal, please contact us at [email protected]