Begin typing your search above and press return to search.

Middle East

Egyptians throw food into sea in bottles praying they reach Gaza!

20 Mins ago

India

TMC accuses BJP of 'conspiring to harass' Bengali people

23 Mins ago

India

Heavy rains in Rajasthan; Dausa witnesses 158 mm

26 Mins ago

India

Muslim groups urge Centre to ensure end of Israeli aggression in Gaza

44 Mins ago

Middle East

Unknown gunmen launch attack on Iranian court; kill 6, wound 20

49 Mins ago

India

Air India disburses ₹25 lakh each to families of Ahmedabad crash...

1 Hr ago

OPINION

Editorial

Coldplay controversy: Female HR on ‘kiss cam’ viral video resigns

22 Hrs ago

Column

Syrian clashes: Israel eyes a road to Iran

yesterday

Editorial

Tears in the land of expatriates

yesterday

Editorial

What Dhankhar did - and was done to him

2 Days ago

Editorial

Spelling Bee @100: What the former champions say - and achieved?

26 May 2025 5:36 AM

Deep Read

The Trump plan to annex Canada and Greenland as the US 51st state

22 Jan 2025 10:51 AM

World

The Russian plan: Invade Japan and South Korea

16 Jan 2025 10:02 AM

Posted On

10 Aug 2023 5:06 AM

Updated On

10 Aug 2023 5:06 AM

Alarming research finds Generative AI can be easily hypnotised into hacking, scams

Tech major IBM has revealed that researchers have identified straightforward methods to manipulate large language models (LLMs), including ChatGPT, into generating malicious code and dispensing unreliable security advice.

New Delhi: Generative AI can be manipulated easily for scams and cyberattacks, even without advanced coding skills, says a recently published report.

IBM's Chief Architect of Threat Intelligence, Chenta Lee, explained the motivation behind their research, stating, "In a bid to explore security risks posed by these innovations, we attempted to hypnotise popular LLMs to determine the extent to which they were able to deliver directed, incorrect and potentially risky responses and recommendations -- including security actions -- and how persuasive or persistent they were in doing so."

The study succeeded in hypnotizing five distinct LLMs, some of which exhibited more convincing outcomes than others.

This success prompted an exploration of the feasibility of leveraging hypnosis for malicious purposes, as Lee further added, "We were able to successfully hypnotise five LLMs, some performing more persuasively than others, prompting us to examine how likely it is that hypnosis is used to carry out malicious attacks."

An eye-opening discovery was that the English language has effectively transformed into a "programming language" for crafting malware.

With the assistance of LLMs, cyber attackers can bypass traditional programming languages like Go, JavaScript, and Python. Instead, they only need to master the art of skilfully instructing and prompting LLMs using English commands.

The security experts successfully guided LLMs under hypnosis to divulge confidential financial data of other users, generate vulnerable and malicious code, as well as offer weak security recommendations.

A particularly noteworthy instance involved instructing AI chatbots to intentionally provide incorrect answers under the guise of winning a game and showcasing their ethical and fair behavior.

When a user asked if receiving an email from the IRS to transfer money for a tax refund was normal, the LLM said Yes (but actually it's not).

Moreover, the report said that OpenAI's GPT-3.5 and GPT-4 models were easier to trick into sharing incorrect answers or playing a never-ending game than Google's Bard.

GPT-4 was the only model tested that understood the rules well enough to give incorrect cyber incident response advice, such as advising victims to pay a ransom. In contrast to Google's Bard, GPT-3.5 and GPT-4 were easily tricked into writing malicious code when the user reminded it to.

With inputs from agencies

Also Read: Indian content creators celebrate X's ad revenue sharing scheme, posts earnings screenshots

Show Full Article

TAGS:Research AI Hacking ChatGPT Cyberattack

Uddhav Thackeray's resignation is not a matter of joy for us: Rebel...

Egyptians throw food into sea in bottles praying they reach Gaza!

Muslim groups urge Centre to ensure end of Israeli aggression in Gaza

Unknown gunmen launch attack on Iranian court; kill 6, wound 20

Air India disburses ₹25 lakh each to families of Ahmedabad crash...

Bihar woman alleges gang rape in ambulance after fainting at govt exam

Egyptians throw food into sea in bottles praying they reach Gaza!

TMC accuses BJP of 'conspiring to harass' Bengali people

Heavy rains in Rajasthan; Dausa witnesses 158 mm

Muslim groups urge Centre to ensure end of Israeli aggression in Gaza

Unknown gunmen launch attack on Iranian court; kill 6, wound 20

Air India disburses ₹25 lakh each to families of Ahmedabad crash...

India-UK Free trade agreement

Coldplay controversy: Female HR on ‘kiss cam’ viral video resigns

Syrian clashes: Israel eyes a road to Iran

Tears in the land of expatriates

What Dhankhar did - and was done to him

Who is the Left siding with?

Racial underpinnings of war

Espionage in the UK

Yet another air tragedy

Spelling Bee @100: What the former champions say - and achieved?

The Trump plan to annex Canada and Greenland as the US 51st state

The Russian plan: Invade Japan and South Korea

Alarming research finds Generative AI can be easily hypnotised into hacking, scams