Technology

How Hackers Tricked Meta's AI to Take Over Instagram Accounts

Martin HollowayPublished 5d ago5 min readBased on 10 sources
Reading level
How Hackers Tricked Meta's AI to Take Over Instagram Accounts

How Hackers Tricked Meta's AI to Take Over Instagram Accounts

Meta has patched a vulnerability in its AI-powered Instagram account recovery system that allowed attackers to take over high-profile accounts by convincing the AI chatbot to change the email address linked to those accounts. The flaw, discovered in March, shows how conversational AI—the kind of system that chats with you like a person—can become a weak point in security when it handles sensitive account changes.

How the Attack Worked

The vulnerability existed in Meta's AI support chatbot, designed to help users recover access to their accounts. Attackers found a way to manipulate this system using social engineering—a fancy term for tricking people (or in this case, AI) into doing something they shouldn't.

Here's what hackers did. First, they used a VPN to hide their location and make it appear they were logging in from the target account's registered country. They then started a standard password reset for someone else's Instagram account. Then came the critical flaw: they could chat directly with Meta's AI assistant and persuade it to change the email address attached to the account. According to 404 Media, the process was straightforward—attackers simply asked the AI to link a new email and approve a password reset. The AI system didn't have enough safeguards to verify whether the person asking actually owned the account.

When and How It Became Public

The vulnerability was first shared in private Telegram communities at the end of March, before spreading more widely through security researcher reports and accounts from affected users. This pattern—where vulnerabilities circulate quietly among attackers before becoming public—has become common as bad actors try to exploit flaws before they're fixed.

Meta patched the problem after these reports surfaced, though the company hasn't publicly explained exactly when the fix was deployed.

Why This Matters Technically

This incident highlights a basic tension in modern security: traditional authentication systems rely on cryptographic proofs (think of them as mathematical locks and keys) and multi-factor verification. Conversational AI, by contrast, interprets natural language—it understands what you mean when you type sentences, not just structured commands. That natural language interpretation can be exploited if someone crafts their requests carefully.

Meta appears to have integrated its AI directly into account recovery without enough safeguards against this kind of social engineering through chat. The goal was to make support more intuitive and user-friendly, but that openness created a new attack surface. A traditional account-recovery system validates requests through rigid steps and signatures. An AI system has to interpret intent from what you write, and that's where attackers found a way in.

The broader context here is important. When new ways of interacting with computers emerge, security often lags behind. Mobile apps introduced vulnerabilities around location tracking and sensors that older security practices didn't anticipate. Conversational AI is creating a similar gap—and the security industry is still figuring out how to test for and prevent these attacks.

What This Means Across Industries

This flaw surfaced in a real product, but the pattern has implications everywhere. Banks, hospitals, and other organizations are deploying AI chatbots to handle sensitive tasks—resetting passwords, accessing patient data, authorizing transactions. If those organizations use similar approaches without proper safeguards, they could face the same problem Meta did.

In my view, this vulnerability points to a category of security challenges that will grow more common as conversational AI handles higher-stakes operations. Traditional penetration testing and security audits often don't test for prompt injection or social engineering through AI interfaces, because these are relatively new attack vectors. Organizations building these systems need security frameworks designed specifically for natural language interfaces, not just retrofitted versions of older authentication models.

Meta's Response

Meta fixed the vulnerability by adding verification steps that likely require stronger authentication before the AI can change account credentials. The company hasn't detailed the exact technical changes, but this kind of fix typically means the AI now has to confirm requests through additional security checks.

Meta also released updates to its broader AI governance framework—for example, adding parental controls for kids' interactions with its AI chatbots. The company's bug bounty program, which pays researchers for finding security flaws, includes specific provisions for AI vulnerabilities, with bonuses up to 30% for qualified reports.

The larger lesson is straightforward: when you add AI to something sensitive like account security, you have to balance making the experience better for legitimate users with making it harder for attackers. As conversational AI becomes more capable and widespread, the tradeoff between usability and security will only get sharper. The companies deploying these systems need to think about security differently than they did before.