Welcome to WarBulletin - your new best friend in the world of gaming. We're all about bringing you the hottest updates and juicy insights from across the gaming universe. Are you into epic RPG adventures or fast-paced eSports? We've got you covered with the latest scoop on everything from next-level PC gaming rigs to the coolest game releases. But hey, we're more than just news! Ever wondered what goes on behind the scenes of your favorite games? We're talking exclusive interviews with the brains behind the games, fresh off-the-press photos and videos straight from gaming conventions, and, of course, breaking news that you just can't miss. We know you love gaming 24/7, and that's why we're here round the clock, updating you on all things gaming. Whether it's the lowdown on a new patch or the buzz about the next big gaming celeb, we're on it.

Contacts

  • Owner: SNOWLAND s.r.o.
  • Registration certificate 06691200
  • 16200, Na okraji 381/41, Veleslavín, 162 00 Praha 6
  • Czech Republic

A leaked document indicates Runway's Gen-3 AI video generation tool may have been trained on YouTube videos and copyrighted content without permission

Here's a question that can throw a generative AI company into a twist: «What content has been used to train your models?» While some opt to dodge the question, and others bullishly front out the issue entirely, the question of whether an AI company has scraped content for its own business purposes without permission is a thorny one. 

At best, you're likely to get a mealy-mouthed explanation of «curated datasets», and at worst, a polemic about whether everything on the internet is essentially fair game.

Now a document obtained by 404media appears to show that part of the data used to train Runway's latest AI video generation tool, Gen-3, may have come from the YouTube channels of thousands of popular media companies, including Pixar, Netflix, Disney and Sony.

While 404media doesn't go into details as to how the document was obtained, nor could it verify that every video mentioned within was used to train Gen-3, it's potentially an insight into the sort of practices that an AI company might use to scrape copyrighted material to train its models.

A former Runway employee spoke to 404media about the methodology involved. The 14 spreadsheets contained within the leaked document are said to feature terms like «beach» or «rain», with the names of Runway employees next to them. 

According to the source, these names were said to be employees tasked with finding videos or channels related to these keywords, who would then go on to use a YouTube video downloader tool via a proxy to scrape them from the site without being blocked by Google.

It's not just YouTube content that looks to have been scraped, either. A spreadsheet containing 14 links to non-YouTube sources, including a link to a website dedicated to streaming popular cartoons and animated movies, with thousands of copyright complaints logged against it. 

Keep up to date with the most important stories and the best deals, as picked by the PC Gamer team.

Essentially, pirated media looks to have been at least under

Read more on pcgamer.com