OpenAI is pushing back against a court ruling mandating it to hand over 20 million user chat logs to The New York Times and other media plaintiffs in an ongoing lawsuit related to alleged copyright infringement. Originally, OpenAI proposed providing these 20 million logs as an alternative to the NYT’s initial demand for 120 million. However, OpenAI argues that complying with the court’s order is excessively broad and risks exposing private user information.
In its recent filing with the U.S. District Court for the Southern District of New York, OpenAI emphasized that the logs in question represent entire conversations between users and ChatGPT, not just isolated input-output pairs. The company contends that sharing full conversations increases the likelihood of revealing sensitive information, comparing it to the difference between overhearing an entire conversation and just a brief five-second snippet.
According to OpenAI, over 99.99% of the chats are unrelated to the case, which centers around potential copyright violations. OpenAI is seeking the court’s agreement to vacate the existing order and is pushing for the news plaintiffs to engage with its proposal to identify relevant conversations. The company has indicated it may further pursue the matter in a federal court of appeals if necessary. Additional details about this ongoing legal battle can be explored here.
The legal community is closely watching the case as it raises significant concerns about user privacy, data protection, and the proportionality of discovery orders in litigation, particularly concerning emerging technologies like artificial intelligence. The decision could set a precedent on how confidential user data is treated in legal contexts, especially when weighed against claims of copyright infringement. The outcome may influence future cases where technology companies face demands to disclose large volumes of user-generated data.