Back to the Index

Hallucinations in LLMs

What are Hallucinations in LLMs?

Hallucinations are responses by LLMs that appear to be correct but are actually false or not based on the input given. Hallucinations can occur when you give a LLM a prompt that is ambiguous or can be interpreted in more than one way or when the LLM has not been trained on enough training data.

‍How do I prevent Hallucinations in LLMs?

‍If you provide more context in your prompts, you limit the possible outcomes, driving the model toward what you are looking for, that is, the LLM will generate more relevant results.

Zhang et al show how to try to prevent hallucination snowballing, by, as part of a prompt chain, asking the model to first acknowledge a mistake before correcting its output in a subsequent prompt.

‍Are hallucinations an inevitable by-product of transformer-based LLMs?

Yann LeCun famously claimed that LLMs are auto-regressive models and, as such, hallucinations are an inevitable property of LLMs. Others believe that hallucinations will decrease below human levels as LLMs become larger and are trained on more data.

Interested for more?

🤖 Register for free on Hopsworks Serverless
🐍 Learn all about the Python-Centric Feature Store
🛠️ Explore all Hopsworks Integrations
🧩 Get started with codes and examples
⚖️ Compare other Feature Stores with Hopsworks

Does this content look outdated? If you are interested in helping us maintain this, feel free to contact us.

H

Auto-regressive Models

H

Backfill features

Backfill training data

Backpressure for feature stores

Batch Inference Pipeline

H

CI/CD for MLOps

Compound AI Systems

Context Window for LLMs

H

DAG Processing Model

Data Compatibility

Data Partitioning

Data Transformation

Data Type (for features)

Data Validation (for features)

Data-Centric ML

Dimensional Modeling and Feature Stores

H

Encoding (for Features)

H

Gradient Accumulation

Grouped Query Attention

H

Hyperparameter Tuning

H

Idempotent Machine Learning Pipelines

In Context Learning (ICL)

Inference Pipeline

Instruction Datasets for Fine-Tuning LLMs

H

LLM Code Interpreter

LLM Temperature

LLMs - Large Language Models

Lagged features

H

Natural Language Processing (NLP)

H

On-Demand Features

On-Demand Transformation

Online Inference Pipeline

Online-Offline Feature Skew

Online-Offline Feature Store Consistency

H

Parameter-Efficient Fine-Tuning (PEFT) of LLMs

Point-in-Time Correct Joins

Precomputed Features

Prompt Engineering

H

RLHF - Reinforcement Learning from Human Feedback

Real-Time Machine Learning

Recommender System

Representation Learning

Retrieval Augmented Generation (RAG) for LLMs

H

SQL UDF in Python

Similarity Search

Splitting Training Data

Streaming Feature Pipeline

Streaming Inference Pipeline

H

Theory-of-Mind Tasks

Time travel (for features)

Train (Training) Set

Training Pipeline

Training-Inference Skew

Two-Tower Embedding Model

Types of Machine Learning

H

H

Vector Database

Versioning (of ML Artifacts)