Back to the Index

Instruction Datasets for Fine-Tuning LLMs

What are Instruction Datasets for Fine-Tuning LLMs?

Instruction datasets are used to fine-tune LLMs. Fine-tuning LLMs typically uses supervised machine learning and includes both an input string and an expected output string. The input and output string follow a template known as an instruction dataset format (e.g., [INST] <<SYS>>). ChatML by OpenAI and Alpaca from Stanford are examples of Instruction Dataset Formats. The following is the instruction data format used by Alpaca for fine-tuning the includes context information (the input field below):

Below is an instruction that describes a task, paired with an input that provides further context. 
Write a response that appropriately completes the request.

### Instruction:
{instruction}

### Input:
{input}

### Response:

Interested for more?

🤖 Register for free on Hopsworks Serverless
🐍 Learn all about the Python-Centric Feature Store
🛠️ Explore all Hopsworks Integrations
🧩 Get started with codes and examples
⚖️ Compare other Feature Stores with Hopsworks

Does this content look outdated? If you are interested in helping us maintain this, feel free to contact us.

I

Auto-regressive Models

I

Backfill features

Backfill training data

Backpressure for feature stores

Batch Inference Pipeline

I

CI/CD for MLOps

Compound AI Systems

Context Window for LLMs

I

DAG Processing Model

Data Compatibility

Data Partitioning

Data Transformation

Data Type (for features)

Data Validation (for features)

Data-Centric ML

Dimensional Modeling and Feature Stores

I

Encoding (for Features)

I

Gradient Accumulation

Grouped Query Attention

I

Hallucinations in LLMs

Hyperparameter Tuning

I

Idempotent Machine Learning Pipelines

In Context Learning (ICL)

Inference Pipeline

I

LLM Code Interpreter

LLM Temperature

LLMs - Large Language Models

Lagged features

I

Natural Language Processing (NLP)

I

On-Demand Features

On-Demand Transformation

Online Inference Pipeline

Online-Offline Feature Skew

Online-Offline Feature Store Consistency

I

Parameter-Efficient Fine-Tuning (PEFT) of LLMs

Point-in-Time Correct Joins

Precomputed Features

Prompt Engineering

I

RLHF - Reinforcement Learning from Human Feedback

Real-Time Machine Learning

Recommender System

Representation Learning

Retrieval Augmented Generation (RAG) for LLMs

I

SQL UDF in Python

Similarity Search

Splitting Training Data

Streaming Feature Pipeline

Streaming Inference Pipeline

I

Theory-of-Mind Tasks

Time travel (for features)

Train (Training) Set

Training Pipeline

Training-Inference Skew

Two-Tower Embedding Model

Types of Machine Learning

I

I

Vector Database

Versioning (of ML Artifacts)