Business

Is OpenAI using your content without permission?

The number of organizations accusing OpenAI of stealing their work continues to grow like extra patties on a burger, with a prominent news organization now joining the fray with its own set of claims against the Microsoft-backed artificial intelligence startup.

In a lawsuit filed against OpenAI, the Center for Investigative Reporting, the oldest nonprofit newsroom in the US, claims the ChatGPT maker used its investigative journalism to train and enhance its generative AI product without permission or compensation.

It’s a tale as old as time.

Ever since ChatGPT hit the scene, different quarters of the internet have been raising alarm bells over the data used to train generative AI, often, without permission. You’ve got artistsmusic labelsauthors, heck, even programmers, who have either sued or complained against the company for allegedly using their work to build ChatGPT and its derivatives.

“This free rider behavior is not only unfair, it is a violation of copyright,” Monika Bauerlein, CEO of the Center for Investigative Reporting, said in a statement.

Free rider behavior is perhaps the best way to describe what companies developing AI are doing.

Take Meta, for example. The social media giant admitted to using users’ Facebook and Instagram posts to develop an AI assistant. Meanwhile, ChatGPT has been found to produce verbatim paragraphs from novels, complete verbatim copies of poems, and even articles from The New York Times!

In fact, CopyLeaks estimates that nearly 60% of the responses provided by GPT-3.5 (which is the model behind ChatGPT) contain some form of plagiarized content, the Center for Investigative Reporting says.

Grim, isn’t it?

At this point, the entire output of humanity, creative or otherwise, is apparently a valid target for AI companies. The question then is, are gen AI companies just profiteering off of our work? Evidence seems to suggest so.

Reddit, for example, has already struck a deal with both OpenAI and Google to let them use content from its platform to make their AI products better. There’s an age old adage: the rich get richer, while the poor get poorer. That seems to fit with Reddit’s partnership with OpenAI and Google, as the company will earn millions of dollars off of the deals but will likely never share its earnings with the users whose posts are gobbled up by OpenAI and Google to fine tune their AI models.

OpenAI also has similar arrangements with the Associated PressAxel Springer, and TIME magazine to use up journalists’ work to (probably) make ChatGPT even better. Other tech companies probably have something lined up with major publications as well.

This means that people who create will be left to do the heavy lifting while some tech bro is going to feed all that raw material to produce more powerful generative AI products, likely without permission or compensation.

The Center for Investigative Reporting is one of a handful of organizations that have taken OpenAI to court, joining the likes of The New York Times and others like it for allegedly infringing on its copyrights.

Suing OpenAI is not cheap, though. As The Verge reports, The NYT has raked up $1 million in legal costs during Q1 after it began its legal action, and there’s no telling how long this entire saga will play out — assuming both parties don’t end up settling out of court.

However, the case(s) are perhaps significant in that they could determine how AI operates within the bounds of copyright. Until then, I guess OpenAI is going to be sailing the high seas. 🏴‍☠️ 🏴‍☠️ 🏴‍☠️ ☠️☠️☠️ #IYKYK 😉

OpenAI backer Microsoft topped HackerNoon’s Tech Company Rankings this week.


In Other News.. 📰

  • Crypto Industry Is About to Boom, Is Outperforming the Internet: Architect Partners — via CoinDesk
  • Figma disables its AI design feature that appeared to be ripping off Apple’s Weather app — via TechCrunch
  • Meta accused of breaking European law with its ‘pay or consent’ model — via CNN
  • OnlyFans vows it’s a safe space. Predators are exploiting kids there. — via Reuters
  • Meta’s Threads turns one, has more than 175 million active users — via Axios
  • China’s BYD is set to take Tesla’s crown as the world’s No. 1 producer of battery electric vehicles — via CNBC

And that’s a wrap! Don’t forget to share this newsletter with your family and friends

See y’all next week. PEACE! ☮️


This article was originally published by Sheharyar Khan on HackerNoon.

HackerNoon

Recent Posts

WEF ‘Summer Davos’ in China to tackle transhumanism, AI & One Health agendas

The program agenda for the World Economic Forum's (WEF) 16th Annual Meeting of the New…

2 days ago

10 design and architecture writers to put on your radar this year

It’s easy to get caught up in the visuals—perfectly styled rooms, dramatic before-and-afters, bold architectural…

4 days ago

Elon Musk Turns News Into a Bet — Is This the Future of Honest Media?

Polymarket and xAI have created a feedback loop where headlines aren’t written - they’re traded.…

4 days ago

10 thoughtful gifts for the man who says he wants nothing, but deserves everything: Dad.

Father’s Day is just around the corner, and so is the age-old question: what do…

5 days ago

Why software release speeds are being throttled 

As the race for innovation continues, experts have flagged that how well an enterprise is…

5 days ago

As both recruiters and candidates suffer from fatigue, SF-based Goldbridge.ai has a solution 

Last week the Bureau of Labor Statistics released its latest U.S. employment figures. On one…

6 days ago