Back to the Index

Model Registry

What is a model registry?

A model registry is a version control system for models that provides APIs to store and retrieve models and model-related artifacts. A model registry stores different versions of models, including metadata about their performance metrics, author, creation date, dependencies, usage, and lineage (training experiment code/hyperparameters and /training data).

When is a model registry needed?

Model registries are recommended for projects involving ML pipelines and/or those that require stringent governance, traceability, and management. The model registry stores the models and their metadata as the output of training pipelines, decoupling training pipelines from inference pipelines. Inference pipelines download a versioned model from the model registry during initialization, and keep it cached (this is safe, as models are immutable) for inference requests. In general, a model registry enables better collaboration, versioning, and organization of models in MLOps.

Examples of a model registry

Some model registries provide a unified API for storing and retrieving models and their metadata, such as Hopsworks, Sagemaker, and Weights & Biases, while others, such as MLFlow, separate the metadata store for model metadata from an artifact store, where the serialized models are stored.

Example code for registering a Scikit-Learn model to Hopsworks is shown below:

 from hsml.schema import Schema
from hsml.model_schema import ModelSchema

input_schema = Schema(X_train)  # take schema from train-set features DataFrame
output_schema = Schema(y_train) # take schema from train-set labels DataFrame

fraud_model = mr.sklearn.create_model("the_model",
                                      # 'accuracy' for the model is computed on the test set
                                      metrics={'accuracy': accuracy},
                                      # 'input_example' is used as a test row for a deployment
                                      input_example=X_train.sample().to_numpy(), 
                                      model_schema=ModelSchema(input_schema=input_schema, output_schema=output_schema))
fraud_model.save('the_model')

Interested for more?

🤖 Register for free on Hopsworks Serverless
🐍 Learn all about the Python-Centric Feature Store
🛠️ Explore all Hopsworks Integrations
🧩 Get started with codes and examples
⚖️ Compare other Feature Stores with Hopsworks

Does this content look outdated? If you are interested in helping us maintain this, feel free to contact us.

M

Auto-regressive Models

M

Backfill features

Backfill training data

Backpressure for feature stores

Batch Inference Pipeline

M

CI/CD for MLOps

Compound AI Systems

Context Window for LLMs

M

DAG Processing Model

Data Compatibility

Data Partitioning

Data Transformation

Data Type (for features)

Data Validation (for features)

Data-Centric ML

Dimensional Modeling and Feature Stores

M

Encoding (for Features)

M

Gradient Accumulation

Grouped Query Attention

M

Hallucinations in LLMs

Hyperparameter Tuning

M

Idempotent Machine Learning Pipelines

In Context Learning (ICL)

Inference Pipeline

Instruction Datasets for Fine-Tuning LLMs

M

LLM Code Interpreter

LLM Temperature

LLMs - Large Language Models

Lagged features

M

Natural Language Processing (NLP)

M

On-Demand Features

On-Demand Transformation

Online Inference Pipeline

Online-Offline Feature Skew

Online-Offline Feature Store Consistency

M

Parameter-Efficient Fine-Tuning (PEFT) of LLMs

Point-in-Time Correct Joins

Precomputed Features

Prompt Engineering

M

RLHF - Reinforcement Learning from Human Feedback

Real-Time Machine Learning

Recommender System

Representation Learning

Retrieval Augmented Generation (RAG) for LLMs

M

SQL UDF in Python

Similarity Search

Splitting Training Data

Streaming Feature Pipeline

Streaming Inference Pipeline

M

Theory-of-Mind Tasks

Time travel (for features)

Train (Training) Set

Training Pipeline

Training-Inference Skew

Two-Tower Embedding Model

Types of Machine Learning

M

M

Vector Database

Versioning (of ML Artifacts)