Unveiling a New Approach: Accessing Massive Training Data from AI Models AI NEWS

Home
AInews
Unveiling a New Approach: Accessing Massive Training Data from AI Models

Unveiling a New Approach: Accessing Massive Training Data from AI Models

2023-11-30

A new research paper claims that large language models may inadvertently expose a significant amount of training data through a technique referred to as "extractable memory" by researchers.

The paper details how researchers developed methods to extract terabytes of text data from training sets of popular open-source natural language models, including models from companies such as Anthropic, EleutherAI, Google, and OpenAI. Katherine Lee, a senior research scientist at Google Brain, CornellCIS, and former Princeton University, explained on Twitter that previous data extraction techniques did not work on OpenAI's chat models:

When we ran the same attack on ChatGPT, it seemed to have almost no memory because ChatGPT has been "tuned" to behave like a chat model. But by running our new attack, we can make it output training data with three times the probability of any other model we studied.

The core technique involves prompting the model to continue a random sequence of text fragments and checking if the generated continuation contains field-by-field matches from publicly available datasets, totaling over 9TB of text.

Obtaining Training Data through Ranking

Using this strategy, they extracted over 1 million unique training examples of 50+ tokens from smaller models like Pythia and GPT-Neo. From the massive OPT-175B model with 175 billion parameters, they extracted over 100,000 training examples.

More concerning is that this technique has also been shown to efficiently extract training data from commercial deployment systems such as Anthropic's Claude and OpenAI's industry-leading ChatGPT, indicating potential issues even in widely used production systems.

By prompting ChatGPT to repeat a single vocabulary word like "the" hundreds of times, the researchers demonstrated how they could "steer" the model away from its standard conversational output and generate more typical text continuations, resembling its original training distribution - a complete character-by-character distribution.

Some AI Models Seek to Protect Training Data through Encryption

While companies like Anthropic and OpenAI aim to protect training data through techniques such as data filtering, encryption, and model alignment, these findings suggest that more work may be needed to mitigate the privacy risks posed by large parameter models. Nevertheless, researchers not only view memory as a privacy compliance issue but also propose it as a model efficiency problem, implying that memory consumes a significant portion of the model's capacity that could be allocated to utility.

Sapia

Sapia - AI hiring agent for fair recruitment processes

Magic Motion

Magic Motion - AI transforms text into engaging 3D animations

Recall

Recall - AI summarizer for streamlined knowledge management

Rocket.new

Rocket.new - AI analyzes and summarizes call conversations

Qodo AI Platform

Qodo AI Platform - AI tool for ensuring code quality and integrity

Zev AI

Zev AI - AI coding assistant for seamless integration

Kepl-AI Scanner

Kepl-AI Scanner - AI scanner for quick object recognition

Obtaining Training Data through Ranking

Some AI Models Seek to Protect Training Data through Encryption

RECENT AI TOOLS

Final Round AI

Sapia

Magic Motion

Recall

Rocket.new

RECENT AI NEWS

Google DeepMind Releases AlphaGenome: Unified AI Model for High-Resolution Genomic Interpretation

Cursor Launches Web Application for Managing AI Coding Agents

Google Introduces AI in Classrooms, Launches Gemini Tools for Educators, Offers Chatbots for Students

Meta Fully Committed to 'Super Intelligence' Led by Scale AI's Alexandr Wang

Apple Considering Anthropic and OpenAI to Provide Support for Siri

Microsoft's MAI-DxO Exceeds Doctors in Medical Diagnosis and Reduces Costs

Google Previews Gemini Proxy Mode in Android Studio Narwhal

Cloudflare Launches Public Beta for Container Service

RECENT AI TOOLS