AwareForce

Vishing: 3 Examples of How Voice Cloning is Making it Easier Than Ever

Three shocking cases of AI voice cloning and vishing attacks: from celebrity deepfakes to a $35 million heist, exposing the risks of these cyber...

blog

The line between reality and fabrication is blurring. The age of AI and deepfakes is moving forward at a spectacular pace. ChatGPT 4’s website gets 1.5 billion visitors a month, and ChatGPT 5, an order of magnitude more capable, will be released in about a year. AI has revolutionized the entertainment industry and opened up new avenues for criminals. One high-profile instance is the use of AI voice cloning for vishing attacks.

An alarming scenario of a vishing attack

A victim receives a frantic phone call: Mom is on a vacation trip, and she’s frantic. Her voice, laced with fear and desperation, informs you that she’s been detained in a foreign country. To secure her release, she needs money transferred to an overseas account immediately…

Overwhelmed by shock and concern, you act instinctively. Without questioning the call’s authenticity, you proceed with the wire transfer to get her out. It is a vishing scam that costs even technically adept, well-informed victims significant sums of money.

The voice on the other end sounded just like hers. The caller ID might have displayed something like “Police Unit.” Everything seemed so real, so urgent.

This is the dystopian world we’re entering, where everyday scam calls are tailored and engineered like a heist out of Mission: Impossible. And it’s all so easy for perpetrators.

In this article, we outline three cases where voice cloning is used.

(Skip to the cases)

What is AI Voice Cloning?

AI voice cloning, a product of artificial intelligence, enables the creation of realistic replicas of human voices by training a tool (cheap and easy to find) with a few seconds of a voice sample. The app can capture the voice from posts on social media.

AI analyzes as little as three seconds of audio and replicates the unique vocal patterns, intonation, and speech cadence.

Once trained, it synthesizes new audio content that mimics the target individual’s voice, making it virtually indistinguishable from the real person. The similarity is remarkable.

And what exactly is vishing?

Vishing, a combination of “voice” and “phishing,” is a social engineering attack that utilizes voice calls to deceive individuals into revealing sensitive information or transferring funds. It exploits the power of human empathy by impersonating trusted individuals (family members, bank representatives, or law enforcement officials).

AI voice cloning has elevated vishing attacks to a new level. Scammers can easily replicate voices and seamlessly bypass traditional authentication methods, convincing victims that the conversation comes from a trusted source.

Three Examples of AI Voice Cloning Being Used for Vishing Attacks

The audio quality is jaw-dropping…from a viral video impersonating a celebrity…to a $35 million heist. The implications of this technology are so far-reaching that employees should be reminded often about how it works.

Case 1: AI Joe Rogan Promotes Libido Booster for Men in Illegal Deepfake Video

In 2023, a deepfake video surfaced online featuring podcaster Joe Rogan endorsing a male libido booster supplement. The video, meticulously crafted using AI voice cloning, was so convincing that it fooled many viewers.

BREAKING: This AI deepfake video rendering of Joe Rogan is going viral on TikTok. This is the start of massive waves of new scams & misinformation!

If only there was a decentralized GPU compute/rendering network that can authenticate provenance for IP. $RNDR https://t.co/m5WQvJjEuF pic.twitter.com/VqtQgEFcyJ
— MachineAlpha ⭕️ (@Machine4lpha) February 13, 2023

In a podcast episode from October, Rogan addressed the issue of deepfakes and said that he was “disappointed” that his likeness had been used in a fraudulent video. He also warned his listeners to be wary of such videos and to do their research before believing anything they see online.

This incident highlights the potential misuse of AI voice cloning using celebrities to spread misinformation and promote fraudulent products or services.

Case 2: Voice Cloning Heist

In a case exposed by Red Goat, a group of cybercriminals employed AI voice cloning to perpetrate a multi-million dollar heist.

Although victims’ names were undisclosed, it is known that the Ministry of Justice of the United Arab Emirates submitted a request for assistance from the Criminal Division of the U.S. Department of Justice.

According to court documents, the victim company’s branch manager received a phone call from someone claiming to be from the company headquarters. The caller’s voice was so similar to a company director’s that the branch manager believed the call was legitimate.

Using voice and email, the caller informed the branch manager that the company was about to make an acquisition and that a lawyer named “Martin Zelner” had been authorized to oversee the process.

The manager received multiple emails from Zelner regarding the acquisition, including a letter of authorization from the director to Zelner. As a result of these communications, when Zelner requested that the branch manager transfer $35 million to various accounts as part of the acquisition, the branch manager complied.

Case 3: Scammers Use AI to Mimic Voices of Loved Ones in Distress

In 2023, a particularly disturbing vishing tactic has emerged, utilizing AI to mimic the voices of distressed loved ones. Scammers would contact victims, impersonating their children or other family members, claiming to be in dire need of financial assistance due to an arrest or medical emergency.

These scams are becoming increasingly popular because it is so easy to train AI voice models.

This case underscores the emotional manipulation employed by vishing scammers, preying on victims’ vulnerabilities and exploiting their desire to help loved ones in distress.

What now?

Have you noticed an increase in spam and scam calls lately?

You have. According to a 2022 report by the app Truecaller, the number of spam and scam calls in the US has risen steadily over the past years. Last year alone, 31% of all calls received by US residents were from one of the two, up from 18% in 2018.

graph showing the percentage of calls from scams or spams over the year

(alt text: graph showing the increase in scam and spam calls over the past 5 years)

And surprise: 20- and 30-somethings are more likely to be victims

Another finding of the Truecaller report was that younger Americans are more susceptible to scam calls than their older counterparts: in 2021, 41% of Americans aged 18-24 received a scam call, compared to 20% of Americans aged 65 and over.

The Implications of AI Voice Cloning for Vishing Attacks

Factors like the rise of robocalls, the availability of cheap calling technology, and the increasing sophistication of scam techniques powered by AI have made it easier for criminals to adopt this type of attack.

And the implications of AI voice cloning extend far beyond financial losses. The attacks can cause significant emotional distress, shame, damage reputations, and undermine public trust in institutions and individuals.

Awareness and Prevention are Key

As a countermeasure, cybersecurity awareness professionals should continuously educate employees about vishing tactics and empower individuals with the knowledge and tools to protect themselves from these deceptive calls.

6 tips for employees to protect against voice cloning scams

For your employees, here are measures to keep themselves, their employer, and their family members:

A family member confronted with an emergency call from a relative asking for help and money should ask a question only the two would know.
Try calling the person back to verify the authenticity of the call.
Set social media accounts to private to avoid having the information you make publicly available being used in a scam.
Callers should be reminded often that the number on caller ID can easily be spoofed.
Don’t press “1” or any other number when prompted.
Never give out personal information over the phone, such as Social Security, bank account, or credit card numbers.

How to keep cybersecurity top of mind in an organization

CISOs and cybersecurity teams are under tremendous pressure and must compete for qualified employees. While technical tools exist to test and track employees’ cybersecurity prowess, the key to engaging employees isn’t automation.

It’s actionable videos, quizzes, infographics, the latest cyber news, and answers to common questions written in a style they can understand and share with their families.

Aware Force delivers that service year-round. It’s easy to use and cost-effective. And everything we deliver is branded and customized, so all the content comes from your company’s IT team.

Aware Force generates unsolicited praise from employees and fierce loyalty from our customers. Check out our extensive cyber library and our awesome twice-monthly cybersecurity newsletter — all branded and tailored for you.

Get the latest insights in cybersecurity.
Subscribe to the Aware Force Cyber Blog

Insightful cyber news, fresh ideas for engaging your employees and more.

Search

Richard Warner is a recognized expert on human cyber risk and the founder/CEO of Aware Force, where he and his team create cybersecurity content tailored to each client’s culture that is engaging, relatable, and effective.

Leveraging his decades of experience as a prominent journalist and communicator with outlets including FOX and the GPB Television Network, Richard helps organizations worldwide transform human weak links into their strongest digital defense.

He is based in Atlanta and pioneers effective strategies for security culture and employee engagement.