Hacking of ChatGPT Just Beginning

Date:

Just recently, security researcher Alex Polyakov made news by manging to successfully hack GPT-4, an update of ChatGPT. This surprising breakthrough was achieved when Polyakov, the CEO of Adversa AI, managed to create a way to bypass safety rules of OpenAI’s system through the use of prompts. The prompts designed by Polyakov not only managed to cause the breakthrough, they also allowed the GPT-4 to spew out homophobic phrases and suggest violence.

Consequently, the development of jailbreaks and prompt injection attacks against ChatGPT and other generative AI systems has become increasingly important. Generally, this process looks to design prompts that make the bot be able to bypass rules around creating hateful content or talking about illegal activities. All these attacks are part of a whole different form of “hacking” applicable to AI models; one that revolves around the crafty use of words rather than code to exploit system weaknesses.

To make matters worse, Polyakov has now created a ‘universal’ jailbreak that works against GPT-4, Microsoft’s Bing chat system, Google’s Bard, and Anthropic’s Claude. The principle behind this is asking the bots to interact with each other and create suspicious initiatives. Examples include Tom being instructed to talk about “hotwiring” or “production”, while Jerry receives orders on “car” or “meth”. The methods produced by hacking can lead to guidance on production of meth, or how to hotwire a car.

With AI systems being more and more frequently used, it is possible for malicious data or instructions to be inserted into the models. This can be extremely hard to detect and prevent, and consequently, dealing with the security risks will be of utmost priority.

See also  NAB 2023: Automating Highlights with Magnifi and ChatGPT Integration

Alex Polyakov is the CEO of Adversa AI, a security firm dedicated to establish good security protocols to protect AI systems and networks from cyber-attacks. He has worked on a wide range of projects, ranging from developing prompt injection attacks to providing security consulting to companies. His most recent work on jailbreaking has caught the attention of the tech industry, and he is now recognized as one of the leading security researchers in the country.

Frequently Asked Questions (FAQs) Related to the Above News

Please note that the FAQs provided on this page are based on the news article published. While we strive to provide accurate and up-to-date information, it is always recommended to consult relevant authorities or professionals before making any decisions or taking action based on the FAQs or the news article.

Share post:

Subscribe

Popular

More like this
Related

WhatsApp Unveils New AI Feature: Generate Images of Yourself Easily

WhatsApp introduces a new AI feature, allowing users to easily generate images of themselves. Revolutionizing the way images are interacted with on the platform.

India to Host 5G/6G Hackathon & WTSA24 Sessions

Join India's cutting-edge 5G/6G Hackathon & WTSA24 Sessions to explore the future of telecom technology. Exciting opportunities await! #IndiaTech #5GHackathon

Wimbledon Introduces AI Technology to Protect Players from Online Abuse

Wimbledon introduces AI technology to protect players from online abuse. Learn how Threat Matrix enhances player protection at the tournament.

Hacker Breaches OpenAI, Exposes AI Secrets – Security Concerns Rise

Hacker breaches OpenAI, exposing AI secrets and raising security concerns. Learn about the breach and its implications for data security.