Google Open-Sources SynthID Text Watermarking Technology AI NEWS

Home
AInews
Google Open-Sources SynthID Text Watermarking Technology

Google Open-Sources SynthID Text Watermarking Technology

2024-10-24

Google has announced the open-source release of SynthID, its proprietary text watermarking technology, through the company's Responsible Generative AI Toolkit. The technology is designed to enhance the detectability of AI-generated text.

Pushmit Koli, Vice President of Research at Google DeepMind, told MIT Technology Review, "Now, other generative AI developers will be able to utilize this technology to detect whether text outputs are generated by their own large language models, enabling more developers to build AI responsibly."

As large language models are increasingly used to disseminate political disinformation, generate non-consensual explicit content, and for other malicious intents, watermarking has become an essential tool. California is contemplating mandatory AI watermarking, and the Chinese government began enforcing this requirement last year. Nonetheless, these tools remain under development.

Introduced last August, SynthID embeds invisible watermarks during the generation of images, audio, video, and text, enabling the detection of AI-produced outputs. Google explains that the text version of SynthID operates by subtly decreasing the likelihood of certain text outputs, in a way that is detectable by software but remains unnoticed by humans.

Large language models generate text tokens sequentially, where each token can represent an individual character, word, or a segment of a phrase. To create coherent text sequences, the model predicts the next most probable token based on preceding words and their associated probability scores.

For instance, in the sentence "My favorite tropical fruit is __," a large language model might begin completing it with tokens such as "mango," "lychee," "papaya," or "durian," assigning each a probability score. When multiple tokens are available for selection, SynthID can adjust the probability scores of each predicted token in a way that does not affect the quality, accuracy, or creativity of the output.

This procedure is applied consistently across the entire generated text, resulting in a single sentence potentially containing ten or more modified probability scores, and a full page may include hundreds. The final pattern formed by the selected words and their adjusted probability scores constitutes the watermark.

Google asserts that SynthID has been integrated into its Gemini chatbot without sacrificing the quality, accuracy, creativity, or speed of the generated text—a common challenge with watermark systems. The company states that the technology is effective for texts comprising at least three sentences and for content that has been cropped, rewritten, or modified. However, it faces difficulties when dealing with short texts, rewritten or translated material, and even factual responses.

In a blog post released in May, Google stated, "SynthID is not a cure-all for identifying AI-generated content. Nevertheless, it serves as a crucial foundation for developing more dependable AI detection tools and aids millions in making informed decisions about interacting with AI-generated material."

Face Detector

Face Detector - Analyze face shape from uploaded photos

Glambase

Glambase - Create and monetize AI influencers.

Aider Chat

Aider Chat - Pair program with AI in terminal.

Tidio Chat

Tidio Chat - Manage customer communications through live chat, email, and chatbots.

Botpress

Botpress - Build and manage AI chatbots.

Theee AI

Theee AI - Use 50,000 AI tools for free online

Tarotap

Tarotap - Personalized AI tarot readings and predictions

RECENT AI TOOLS

MathGPT

Face Detector

Glambase

Aider Chat

Tidio Chat

RECENT AI NEWS

El Capitan Tops Supercomputer Rankings, Powered by AMD Instinct Chips

Logo Creator: New AI-Powered Design Tool Simplifies Logo Creation Process

AWS Launches Multi-Agent Orchestrator for Managing AI Agents

Microsoft Ignite Conference Unveils Copilot Actions and Multiple AI Enhancements

Microsoft Launches Windows 365 Link, a New Option for Cloud Mini PCs

Niantic Develops Large-Scale Geospatial Models to Redefine Real-World Interactions

Google Gemini Update: Personalized Memory Feature Launched

OpenAI Launches Advanced Voice Mode for ChatGPT Web Version

RECENT AI TOOLS