top of page

Meta Commences Labeling of AI-Generated Content

Meta, the parent company overseeing Facebook and Instagram, is poised to implement a significant change in content moderation strategy by introducing labeling for AI-generated media, set to commence in May. This move comes amidst growing concerns surrounding the proliferation of deepfakes, deceptive media created using artificial intelligence. Rather than outright removal, Meta aims to address these issues through labeling and contextualization, safeguarding freedom of expression while addressing the challenges posed by manipulated content.



The decision to adopt this approach follows critique from Meta's oversight board, an independent body responsible for reviewing the company's content moderation decisions. Prompted by the rapid advancements in AI technology and the increasing sophistication of deepfakes, the oversight board emphasized the urgent need for Meta to revamp its strategies for dealing with manipulated media.

The rise of AI-generated content has raised significant alarm, particularly in pivotal election years, where the potential for misinformation and disinformation is heightened. Meta's decision to introduce "Made with AI" labels represents a proactive step towards enhancing transparency and providing users with additional context regarding the origin and nature of the content they encounter on its platforms.


Monika Bickert, Meta’s Vice President of Content Policy, underscored the importance of transparency in addressing these challenges, emphasizing the need to provide users with the tools to distinguish between authentic and manipulated media. In a blog post, Bickert highlighted the broader scope of the labeling initiative, which extends beyond the recommendations of the oversight board to encompass a wide range of content affected by AI technologies.


The introduction of these labeling techniques is part of a broader collaborative effort among major tech companies and AI players to combat manipulated content effectively. Meta, along with industry counterparts like Google and OpenAI, has committed to utilizing a common watermarking standard to identify AI-generated images, ensuring greater consistency and interoperability across platforms.


Despite these proactive measures, challenges remain in accurately identifying and labeling AI-generated content. Nicolas Gaudemet, AI Director at Onepoint, noted potential limitations, particularly with open-source software that may not adhere to the watermarking standards adopted by major players in the AI industry.


Meta's implementation of the labeling system will occur in two phases, with the rollout of AI-generated content labeling slated to commence in May 2024. Additionally, Meta plans to cease the removal of manipulated media solely based on previous policies by July, transitioning to a more nuanced approach that considers other rule violations, such as hate speech or voter interference.


The proliferation of convincing AI deepfakes underscores the importance of effective countermeasures in preserving the integrity of online information ecosystems. Recent incidents, including the manipulation of videos featuring prominent political figures like U.S. President Joe Biden, underscore the potential impact of manipulated media on public perception and trust.


In light of these developments, Meta's commitment to enhancing transparency and combating manipulated content represents a significant step forward in addressing the evolving challenges posed by AI technology. However, ongoing vigilance and collaboration across the tech industry will be crucial in mitigating the risks associated with AI-generated media and ensuring the integrity of online discourse in the digital age.

bottom of page