On Wednesday, a court document revealed that OpenAI engineers, due to a significant error, inadvertently removed crucial evidence gathered by The New York Times and several other major newspapers in litigation, further intensifying the controversy over AI data utilization.
The lawsuit stems from allegations by The New York Times and other media outlets against OpenAI and its partner, Microsoft, accusing them of utilizing their news articles to develop AI tools. In December of last year, The New York Times initiated legal action, claiming that OpenAI and Microsoft had "replicated and used millions" of their published articles, directly competing with their content and seeking "billions of dollars in statutory and actual damages" from OpenAI.
To gather evidence, the legal teams of these newspapers dedicated significant time and resources to search for their news articles within OpenAI’s AI training datasets. Shockingly, OpenAI engineers inadvertently deleted these critical pieces of evidence, preventing the legal teams from effectively tracing how their articles were utilized in developing the AI models.
Although OpenAI has acknowledged the mistake and attempted data recovery, the retrieved information remains incomplete and its reliability is questionable. OpenAI’s legal representatives termed the incident as a "glitch," asserting that it was not intentional. However, attorneys from The New York Times expressed skepticism, suggesting that the incident might indicate more profound issues.
Notably, this lawsuit has not only drawn media attention to the rights surrounding AI data usage but also highlighted potential risks in the ongoing development of AI technologies. As AI continues to advance and is increasingly applied across various sectors, ensuring the lawful use of data and the protection of privacy has become a pressing issue for the industry.
Meanwhile, OpenAI has established agreements with leading media organizations such as Axel Springer, Condé Nast, and Vox Media, the parent company of The Verge. This indicates that many publishers prefer collaborating with OpenAI to jointly explore appropriate applications of AI technology. This trend also suggests that, despite the existing controversies and challenges, the development and deployment of AI technologies continue to hold significant promise and potential.