Use JOINs for feature reuse to save on infrastructure and the number of feature pipelines needed to maintain models in production.
Programmers know data types, but what is a feature type to a programmer new to machine learning, given no mainstream programming language has native support for them?
Integrate with third-party security standards and take advantage from our project-based multi-tenancy model to host data in one single shared cluster.
Operational machine learning requires the offline and online testing of both features and models. In this article, we show you how to design, build, and run test for features.
This blog is a guide to the popular file formats used in open source frameworks for machine learning in Python, including TensorFlow/Keras, PyTorch, Scikit-Learn, and PySpark.
Why HopsFS is a great choice as a distributed file system (DFS) in a time when DFS is becoming increasingly indispensable as a central store for training data, logs, model serving, and checkpoints.
We have many conversations with companies and organizations who are deciding between building their own feature store and buying one. We thought we would share our experience of building one.
© Hopsworks 2022. All rights reserved. Various trademarks held by their respective owners.