Microsoft Creates Tools to Stop Users From Tricking Chatbots

Jackie Davalos

Thu, Mar 28, 2024, 11:52 AM2 min read

In this article:

(Bloomberg) -- Microsoft Corp. is trying to make it harder for people to trick artificial intelligence chatbots into doing weird things.

Most Read from Bloomberg

New safety features are being built into Azure AI Studio which lets developers build customized AI assistants using their own data, the Redmond, Washington-based company said in a blog post on Thursday.

The tools include “prompt shields,” which are designed to detect and block deliberate attempts — also known as prompt injection attacks or jailbreaks — to make an AI model behave in an unintended way. Microsoft is also addressing “indirect prompt injections,” when hackers insert malicious instructions into the data a model is trained on and trick it into performing such unauthorized actions as stealing user information or hijacking a system.

Such attacks are “a unique challenge and threat,” said Sarah Bird, Microsoft’s chief product officer of responsible AI. The new defenses are designed to spot suspicious inputs and block them in real time, she said. Microsoft is also rolling out a feature that alerts users when a model makes things up or generates erroneous responses.

Microsoft is keen to boost trust in its generative AI tools, which are now being used by consumers and corporate customers alike. In February, the company investigated incidents involving its Copilot chatbot, which was generating responses that ranged from weird to harmful. After reviewing the incidents, Microsoft said users had deliberately tried to fool Copilot into generating the responses.

“Certainly we see it increasing as there’s more use of the tools but also as more people are aware of these different techniques,” Bird said. Tell-tale signs of such attacks include asking a chatbot a question multiple times or prompts that describe role-playing.

Microsoft is OpenAI’s largest investor and has made the partnership a key part of its AI strategy. Bird said Microsoft and OpenAI are dedicated to deploying AI safely and building protections into the large language models underlying generative AI.

“However, you can’t rely on the model alone,” she said. “These jailbreaks for example, are an inherent weakness of the model technology.”

(Updates with more context in seventh paragraph.)

Most Read from Bloomberg Businessweek

©2024 Bloomberg L.P.

Advertisement

Analysts revamp Microsoft stock price target after earnings
Here's what could happen next to Microsoft shares.
TheStreet•8h ago
A Once-in-a-Decade Investment Opportunity: 2 Artificial Intelligence (AI) Stocks to Buy Now and Hold Long Term
These stocks could help investors turn a profit as artificial intelligence takes root across business processes and consumer products.
Motley Fool•22h ago
Despite complaints, Apple hasn't yet removed an obviously fake app pretending to be RockAuto
Apple's App Store isn't always as trustworthy as the company claims. The latest example comes from RockAuto, an auto parts dealer popular with home mechanics and other DIYers, which is upset that a fake app masquerading as its official app has not been removed from the App Store, despite numerous complaints to Apple. RockAuto co-founder and president Jim Taylor was first alerted to the situation when customers began complaining about "annoying ads" in its app -- something he said "surprised us s
TechCrunch•15h ago
1 Monster Artificial Intelligence (AI) Growth Stock Up 45,900% in 20 Years to Buy Now, According to Wall Street
Nvidia stock produced life-changing returns over the past two decades, but Wall Street analysts still see upside for shareholders.
Motley Fool•22h ago
Intel Tumbles Most in Three Months After Tepid Forecast
(Bloomberg) -- Intel Corp., the biggest maker of personal computer processors, tumbled the most in three months on Friday after giving a lackluster forecast for the current period, indicating that it’s still struggling to return to the top tier of the chip industry. Most Read from BloombergPlunging Home Prices, Fleeing Companies: Austin’s Glow Is FadingJavier Milei Fuels Wild Rally That Makes Peso No. 1 in WorldThe Long, Slow Death of Urban NightlifeApple Intensifies Talks With OpenAI for iPhone
Bloomberg•11h ago
These are the countries where TikTok is already banned
TikTok is in the crosshairs of authorities in the U.S., where new law threatens a nationwide ban unless its China-based parent ByteDance divests. TikTok is already banned in a handful of countries and from government-issued devices in a number of others, due to official worries that the app poses privacy and cybersecurity concerns. TikTok has long maintained that it doesn’t share data with the Chinese government and its CEO has taken a defiant stance, vowing to fight back.
Associated Press Finance•21h ago
Why an iPhone Can Survive a Drop From a Plane, but Not From Your Kitchen Counter
An iPhone that flew out of an airplane at 16,000 feet survived without a scratch. To find out how that’s even possible, WSJ’s Joanna Stern dropped Apple and Samsung phones from a drone. How did that iPhone survive?!
The Wall Street Journal•21h ago
What the New Ray-Ban Meta Smart Glasses Mean for Travelers
These AI-powered smart glasses offer an early glimpse at what on-the-go virtual travel assistants could be.
Skift•13h ago
Forget Nvidia: 2 Artificial Intelligence (AI) Stocks to Buy Instead
Evolving business strategies in the tech sector might transform your investment options. Two adaptable tech giants could be more appealing than Nvidia right now.
Motley Fool•2d ago
U.S. chip bans not meant to hobble China's growth, Blinken says
U.S. export controls on sending advanced computing chips to China are not meant to hold back China's economy or technological development, Secretary of State Antony Blinken said during an interview with National Public Radio on Friday. Since 2022, U.S. officials have imposed sweeping controls on which computing chips can be exported to China, cutting off some sales from Nvidia, Advanced Micro Devices and Intel, among others.
Reuters•7h ago