Arts·Commotion

Are tech giants ignoring their own policies in order to train their AI systems?

New York Times technology reporter Cade Metz shares what he's learned from investigating how companies like Google, Meta and OpenAI are harvesting everything on the Internet.

Cade Metz explains how OpenAI, Google and Meta are scraping data using highly questionable methods

CBC Arts · Posted: Apr 11, 2024 4:10 PM EDT | Last Updated: April 11

The OpenAI logo appears on a mobile phone in front of a screen showing a portion of the company website, Tuesday, Nov. 21, 2023, in New York. (AP Photo/Peter Morgan, File) (Peter Morgan/The Associated Press)

In the race to develop the smartest AI model, big companies like Google, Meta and OpenAI are going to great lengths to harvest everything already on the Internet — lengths that may violate their own policies, and even copyright law.

Host Elamin Abdelmahmoud speaks with Cade Metz, a New York Times technology reporter who's been investigating this issue, about what this means for online creators and the future of AI, and what protections could come into place.

LISTEN | Today's episode on YouTube:

You can listen to the full discussion from today's show on CBC Listen or on our podcast, Commotion with Elamin Abdelmahmoud, available wherever you get your podcasts.

Interview with Cade Metz produced by Jess Low.

Are tech giants ignoring their own policies in order to train their AI systems?

Cade Metz explains how OpenAI, Google and Meta are scraping data using highly questionable methods

Social Sharing

Related Stories