Hub Docs Home
We hope you enjoy the Docs for Hub, the open-source dataset format for AI by Activeloop!

Why Use Hub?

  • Data scientists spend the majority of their time building infrastructure, transferring data, and writing boilerplate code. Hub streamlines these tasks so that you can focus on building amazing ML models.
  • Hub enables you to stream unlimited amounts of data from the cloud to any machine without sacrificing performance compared to local storage.
  • Hub connects datasets to PyTorch and TensorFlow with minimal boilerplate code, and it contains powerful tools for version control, building ML pipelines, and running distributed workloads.
These features are possible because Hub stores data as compressed chunked arrays that can be stored anywhere and accessed through a simple and intuitive API.

To start using Hub ASAP, check out our Getting Started Colab Notebook

Apart from Hub Docs, you may also check out Hub's GitHub repository and give us a ⭐ if you like the project!
Join Hub's Slack Community if you need help or have suggestions for improving documentation!

Hub Docs Overview

Last modified 1d ago