Reddit is about to charge for its API access by restricting third-party access to its data. Reddit’s CEO and co-founder, Steve Huffman, stated that the API will remain free to developers who want to build apps and bots that help people use Reddit, as well as to researchers who wish to study Reddit for strictly academic or non-commercial purposes.
However, companies that “crawl” Reddit for data and “don’t return any of that value” to users will have to pay up. As reported by The New York Times, Reddit is seeking ways to monetize its vast array of user-generated content, which has been increasingly used to train high-profile text-generating machine learning models such as OpenAI’s ChatGPT and GPT-4.
Reddit had over 430 million monthly active users across more than 1.2 million special interest communities as of 2019, 138,000 of which are active.
The Reddit corpus of data is continuously updated, making it particularly valuable, according to Huffman. He emphasized that “more than any other place on the internet, Reddit is a home for authentic conversation.” Reddit has yet to announce its API pricing details, but as the company prepares for a potential initial public offering later this year, investors will be looking for growth in revenue streams.
In related news, Reddit hopes to incorporate more AI into how the site operates, such as identifying the use of AI-generated text on Reddit and adding a label that notifies users that a comment might have come from a bot. Reddit also aims to improve its moderation tools and the third-party bots that help moderators monitor the forums.
The announcement of Reddit’s new API policy has raised eyebrows and sparked discussions about data ownership and monetization. Reddit co-founder and CEO Steve Huffman believe that Reddit data is especially valuable due to its continuously updated nature and that companies that “crawl” Reddit for data without returning any value to users should be required to pay for it. While Reddit’s API will remain free for developers and researchers who use the data for strictly academic or non-commercial purposes, other companies will have to pay for access.
The move is part of Reddit’s efforts to monetize its vast array of user-generated content. Reddit has over 430 million monthly active users across more than 1.2 million special interest communities, making it a rich source of data that can be used for various purposes, including training high-profile, text-generating machine learning models such as OpenAI’s ChatGPT and GPT-4.
Reddit’s decision to charge for access to its API follows Twitter’s recent move to restrict third-party access to its data. However, it’s worth noting that Reddit’s policy change is not a blanket one. The API will remain free for developers who build apps and bots that help people use Reddit, as well as for researchers who study Reddit for academic or non-commercial purposes.
Reddit has not yet disclosed the details of its API pricing, but the company is preparing for a potential initial public offering later this year, and investors will be looking for growth in revenue. Reddit’s estimated $350 million revenue from ads two years ago pales in comparison to Meta’s and even Twitter’s ad revenues. Meta made $113 billion in 2022, while Twitter raked in nearly $7 billion.
In related news, Reddit hopes to incorporate more AI into how the site operates, including identifying the use of AI-generated text on Reddit and adding a label that notifies users that a comment might’ve come from a bot. Reddit also aims to improve its moderation tools and third-party bots that help moderators monitor the forums. The changes are part of Reddit’s ongoing efforts to ensure that its platform remains a place for authentic conversation while preventing the spread of misinformation and hate speech.