Reddit Strikes $60 Million Licensing Deal to Reportedly Train Google's AI Models

Martina Bretous

Published: February 27, 2024

About a week ago, Bloomberg reported that Reddit had just signed a huge licensing deal ahead of its IPO, allowing an unnamed company to train its AI models on their data.

Reddit Strikes $60 Million Licensing Deal to Reportedly Train Google's AI Models

A new report says that company is Google, although neither party has confirmed it. If true, this would be Reddit's first content deal.

Why is every AI company looking for licensing deals?

Since the AI race started, getting access to large, quality datasets has been a top priority.

AI models are trained on data – the more data it’s trained on, the better the output. In addition to quantity, there’s also a quality perspective. AI models want access to high-quality data that their competitors ideally don’t have access to.

This is where publishers like Reddit come in.

For a long time, OpenAI and other AI companies were freely roaming through publishers’ data. That was until publishers like The New York Times and Reddit caught on.

Last April, Reddit said, “If you want access to an 18-year deep well of data, you’re going to have to pay up.”

The NYT, on the other hand, just said, “no.” (And they’re suing OpenAI for allegedly still doing it.)

Now close to a year later, Google, Apple, and OpenAI have all signed licensing agreements with huge publishers worth $100+ million.

The latest to join is Reddit who reportedly signed with Google, in a deal worth $60 million annually. This deal likely has an exclusivity clause, ensuring that only Google has access to this data, however that hasn’t been confirmed.

With an upcoming IPO, Reddit’s CEO Steve Huffman shared the company had earned over $200 million in licensing deals.

“Reddit’s vast and unmatched archive of real, timely, and relevant human conversation on literally any topic is an invaluable dataset for a variety of purposes, including search, AI training, and research,” wrote Huffman in their S-1 filing.

This would also be a huge win for Google who’s been trying to dethrone OpenAI for years.

Should AI licensing deals come with guardrails?

Some see licensing deals as a win-win: Publishers get paid for their data while AI companies get access to large, quality datasets.

However, it also comes with some setbacks.

Social media platforms like Reddit and X are community forums where people can write just about anything. Conspiracy theories, misinformation, and hateful rhetoric.

X user disapproves of Reddit's AI licensing deal with Google

Image Source

And although Reddit does have content moderators and policies, they only introduced a ban on hate speech 15 years after the site was founded.

Is that what AI models should be trained on?

AI companies can clean their data to filter out this type of content but there’s no clear standard that every model is built on. So, as a consumer, I won’t know what data models were trained on and how well they’ve been “cleaned.”

So, it begs the question: Should some websites be off the table when it comes to training AI models? And what guardrails are in place to ensure their models aren’t regurgitating the darkest content on the internet?

These answers are still up in the air.

Topics: Artificial Intelligence

Klarna’s AI Assistant Does the Job of 700 Customer Service Agents

Mar 05, 2024
New ElevenLabs Feature Empowers Voice Actors to Charge for Usage

Feb 27, 2024
Is AI in Need of a Rating System? OpenAI Partners with Common Sense Media

Feb 27, 2024
OpenAI is Launching Text-to-Video AI Model Sora

Feb 20, 2024
Gemini Ultra is Here and Bard Gets a Name Change

Feb 13, 2024
I Used AI To Plan My Wedding: Here's How It Went

Feb 13, 2024
Where AI Regulation Stands in the EU, According to a Tech Lawyer

Feb 06, 2024
A Lying AI Committed Insider Trading. Can Rogue LLMs be Fixed?

Feb 06, 2024
The NYT Is Building an AI Team: Unpacking The State of Publishing

Feb 01, 2024
How AI is Impacting the Job Market, According to a LinkedIn Report

Jan 30, 2024

Blogs

Blogs

Marketing

Sales

Service

Website

The Hustle

Next in AI

Instagram Marketing

Customer Retention

Email Marketing

SEO

Sales Prospecting

Newsletters

Newsletters

The Hustle

Videos

Videos

The Hustle

Marketing with HubSpot

My First Million

Marketing Against the Grain

HubSpot

Podcasts

Podcasts

My First Million

Goal Digger

The Hustle Daily Show

Another Bite

Business Made Simple

Marketing Against the Grain

Online Marketing Made Easy

The Product Boss

Nudge

Side Hustle Pro

Outbound Squad

Resources

Resources

Academy

Templates

Ebooks

Kits

Tools

HubSpot Products

The HubSpot Customer Platform

Free HubSpot CRM

Overview of all products

Marketing Hub

Sales Hub

Service Hub

CMS Hub

Operations Hub

Commerce Hub

About HubSpot

Contact Us

Customer Support

Log in

日本語

Deutsch

English

Español

Português

Français

Reddit Strikes $60 Million Licensing Deal to Reportedly Train Google's AI Models

Why is every AI company looking for licensing deals?

Should AI licensing deals come with guardrails?

Don't forget to share this post!

Klarna’s AI Assistant Does the Job of 700 Customer Service Agents

New ElevenLabs Feature Empowers Voice Actors to Charge for Usage

Is AI in Need of a Rating System? OpenAI Partners with Common Sense Media

OpenAI is Launching Text-to-Video AI Model Sora

Gemini Ultra is Here and Bard Gets a Name Change

I Used AI To Plan My Wedding: Here's How It Went

Where AI Regulation Stands in the EU, According to a Tech Lawyer

A Lying AI Committed Insider Trading. Can Rogue LLMs be Fixed?

The NYT Is Building an AI Team: Unpacking The State of Publishing

How AI is Impacting the Job Market, According to a LinkedIn Report