Home BrandpostsRedefining value in the age of AIWhen your AI chatbots mess up

When your AI chatbots mess up

BrandPost By Paul Gillin

Dec 08, 20234 mins

Generative AI

AI ‘hallucinations’ present significant business risks, but new types of guardrails can keep them from doing serious damage

Credit: iStock/Tippapatt

Barely a year after the release of ChatGPT and other generative AI tools, 75% of surveyed companies have already put them to work, according to a VentureBeat report. But as the numbers of new gen AI-powered chatbots grow, so do the risks of their occasional glitches—nonsensical or inaccurate outputs or answers that are not easily screened out of the large language models (LLMs) that the tools are trained on.

In AI parlance, they’re called hallucinations. They don’t present big problems if you’re noodling around with gen AI prompts at home, but in enterprise organizations that are deploying new chatbots to huge numbers of customers and employees, just one AI fabrication can land companies in court.

Last spring, a judge sanctioned a law firm for citing judicial opinions with fake quotes and citations in a legal brief that a chatbot had drafted. The firm admitted that it “failed to believe that a piece of technology could be making up cases out of whole cloth.”

Hallucinations occur when the data being used to train LLMs is of poor quality or incomplete. The rate of occurrence runs between 3% and 8% for most generative AI platforms. “Chatbots are almost like a living organism in that they are continually iterating, and as they ingest new data,” says Steven Smith, chief security architect at Freshworks. “You get out what you put in.”

Chatbot missteps

With customer service chatbots, dispensing incorrect advice or information can undermine key objectives, such as customer satisfaction; they can also cause confusion and potential harm in highly complex (and regulated) sectors like healthcare or finance.

In IT organizations, gen AI glitches wreak havoc in other ways. Chatbots may assign service tickets incorrectly, describe a problem inaccurately, or disrupt workflows and lead to significant systemic issues—causing data breaches or misallocation of vital resources—that then require human intervention.

For engineers, AI-generated code used in software development may contain security vulnerabilities or intellectual property ingested during training. AI systems can also overlook complex bugs or security issues that only a developer would catch and resolve.

“Software copilots are fantastic, but you want to read and understand what they give you,” Smith says. “Blindly putting code into production because you believe it’s from an expert is no safer than copying it from StackExchange—the question and answer site once favored by coders in search of a specific snippet— if you have no idea what that code is doing.”

Minimizing risk

Many companies are starting to invest in mitigating risk. Here are some of the most effective strategies, according to experts.

Deploy content filters. A variety of technical or policy-based guardrails can protect against inappropriate or harmful content. For example, content filters can decline to respond to questions about sensitive issues or topics. In customer-service scenarios, a chatbot should quickly hand off an inquiry to a human operator if it is confused or unable to track down the precise answer.
Continually upgrade data quality. When training LLMs, IT teams should validate the data to ensure it is high quality, relevant, and comprehensive. Training data should be reviewed regularly to protect against “model drift” or the degradation of performance that occurs due to changes in the underlying data model over time.
Security guardrails. Limiting the chatbots’ ability to connect to third-party apps and services eliminates the opportunity to generate misleading, inaccurate, or potentially damaging data. Side benefits of sandboxing the chatbot in this way are better performance (less dependencies) and enhanced compliance for those industries where that is essential.

Hallucinations may be a problem today, yet research is underway to solve it. In an effort to improve both accuracy and reliability, everything from building bigger models to having LLMs do the fact-checking themselves is being explored.

Ultimately, the best way to mitigate the risks of chatbot errors, Smith says, is to use common sense. “AI can be fantastic, but it needs to operate under your rules of engagement,” says Smith. “You want to define the things it can do, but also the things it cannot do, and ensure that it operates within those specific parameters.”

For more insights about innovating with AI, while minimizing the risks, visit The Works.

Show me more

CIO Leadership Live Middle East with Ahmed Wattar, Group Information Technology Director at Alfa Medical Group

Apr 23, 20248 mins

CIO Leadership Live

CIO Leadership Live Middle East with Dr. Mohammad Alshehri, CISO and Cybersecurity Consultant

Apr 23, 202414 mins

CIO Leadership Live

CIO Leadership Live Middle East with Wissam Al Adany, Chief Information Officer, ADES Holding

Apr 23, 20248 mins

CIO Leadership Live

3 Leadership Tips: Renate Cuneen, Vice President, Global Corporate Technology, Canada Life

Apr 25, 20242 mins

IT Leadership

GenAI and Trust: How Companies Are Thinking About the Trustworthiness of AI and GenAI Tools

Apr 25, 202415 mins

IT Leadership

CIO Leadership Live Middle East with Dr. Mohammad Alshehri, CISO and Cybersecurity Consultant

Apr 23, 202414 mins

CIO Leadership Live

Africa

Americas

Asia

Europe

Oceania

Topics

About

Policies

Our Network

More

When your AI chatbots mess up

AI ‘hallucinations’ present significant business risks, but new types of guardrails can keep them from doing serious damage

Chatbot missteps

Minimizing risk

Show me more

TransUnion transforms its business with IT

The 10 highest-paying industries for IT talent

M&A action is gaining momentum, are your cloud security leaders prepared?

CIO Leadership Live Middle East with Ahmed Wattar, Group Information Technology Director at Alfa Medical Group

CIO Leadership Live Middle East with Dr. Mohammad Alshehri, CISO and Cybersecurity Consultant

CIO Leadership Live Middle East with Wissam Al Adany, Chief Information Officer, ADES Holding

3 Leadership Tips: Renate Cuneen, Vice President, Global Corporate Technology, Canada Life

GenAI and Trust: How Companies Are Thinking About the Trustworthiness of AI and GenAI Tools

CIO Leadership Live Middle East with Dr. Mohammad Alshehri, CISO and Cybersecurity Consultant

When your AI chatbots mess up

AI ‘hallucinations’ present significant business risks, but new types of guardrails can keep them from doing serious damage

Chatbot missteps

Minimizing risk

Related content

4 industries where chatbots will have the biggest impact

Gen AI: The software developer’s new best friend

4 skills that can help companies thrive with AI

How to kick-start your generative AI strategy

From our editors straight to your inbox

Show me more

TransUnion transforms its business with IT

The 10 highest-paying industries for IT talent

M&A action is gaining momentum, are your cloud security leaders prepared?

CIO Leadership Live Middle East with Ahmed Wattar, Group Information Technology Director at Alfa Medical Group

CIO Leadership Live Middle East with Dr. Mohammad Alshehri, CISO and Cybersecurity Consultant

CIO Leadership Live Middle East with Wissam Al Adany, Chief Information Officer, ADES Holding

3 Leadership Tips: Renate Cuneen, Vice President, Global Corporate Technology, Canada Life

GenAI and Trust: How Companies Are Thinking About the Trustworthiness of AI and GenAI Tools

CIO Leadership Live Middle East with Dr. Mohammad Alshehri, CISO and Cybersecurity Consultant