Back to all posts
development
USE CASES
Company
Infrastructure
Workflows
Mar 13, 2025

How to optimize AI data pipelines with SuperAnnotate & UltiHash

Integrating UltiHash with SuperAnnotate enables AI teams to access and annotate their datasets efficiently

AI teams are constantly pushing the boundaries—building better models, improving data quality, and accelerating iteration cycles. From curating massive datasets to optimizing training pipelines, every step in the AI lifecycle demands efficiency, scalability, and precision.

Yet, as datasets grow larger, storage and data retrieval can become significant bottlenecks. Slow access times, redundant data storage, and complex integrations increase costs and slow down AI workflows.

That’s where SuperAnnotate and UltiHash come in. This integration helps AI teams:

  • Reduce storage usage by up to 60% with UltiHash’s built-in deduplication.
  • Speed up data access by 250%, eliminating slow retrieval times.

By combining UltiHash’s scalable, high-performance object storage with SuperAnnotate’s industry-leading annotation platform, AI teams can now work faster, more efficiently, and at a lower cost—from dataset preparation to model training.

Who Benefits from This Integration?

AI teams processing massive visual datasets – Quickly annotate and manage large-scale data with lower storage costs. (e.g., self-driving car datasets, aerial imagery, medical imaging)

Teams training LLMs & CV models – Accelerate dataset curation with near-instant data retrieval. (e.g., NLP teams training large-scale foundation models, computer vision teams working with video frames)

By removing traditional storage bottlenecks, this integration empowers computer vision, NLP, and AI teams to process and analyze large datasets with unprecedented efficiency.

How to connect your UltiHash Cluster to SuperAnnotate

Integrating UltiHash with SuperAnnotate enables AI teams to access and annotate their datasets efficiently—without unnecessary storage overhead or complex setup. This connection is powered by SuperAnnotate’s Custom Integration API, allowing secure access to data via temporary pre-signed URLs.

  1. Deploy the Pre-Signed URLs Generator
    Ensure the pre-signed URLs generator is running in the Kubernetes cluster hosting UltiHash. This service generates time-limited S3 pre-signed URLs on demand, allowing SuperAnnotate to securely access stored data.

  2. Retrieve Your Credentials
    Retrieve your automatically generated Access Key and Secret Key. These are stored in a Kubernetes secret and can be accessed using kubectl.

  3. Configure SuperAnnotate for UltiHash Storage
  • Request URL: Use the public HTTPS endpoint of the pre-signed URLs generator (e.g., https://<urls-generator-domain-name>/integrate).
  1. Enable CORS
    Ensure Cross-Origin Resource Sharing (CORS) is activated for uninterrupted access.

  2. Secure the Integration Token
    SuperAnnotate will use a secret token to authenticate requests to the pre-signed URLs generator. Keep this token secure to maintain controlled access.

  3. Verify & Start Annotating
    Once the integration is set up, your data is instantly available in SuperAnnotate. Start annotating and managing your datasets—no extra steps required.
For a full tutorial on how to integrate SuperAnnotate with UltiHash, see this documentation

About UltiHash

UltiHash is high-performance object storage built for modern AI and analytics workflows. With built-in byte-level deduplication, it reduces data redundancy and storage costs, all while keeping performance lightning-fast. Its Kubernetes-native architecture enables seamless deployment across cloud, on-premises, and hybrid environments. Finally, it features an S3-compatible API for easy integration with a huge range of tools across the data stack.

About SuperAnnotate

SuperAnnotate is the only fully customizable, one-stop platform for building exactly the annotation tools and workflows your AI projects demand—while unifying the management of all your teams, vendors, and data in one place. Forget juggling separate solutions and patchwork processes; SuperAnnotate streamlines high-quality data delivery across your entire AI portfolio.

The Future of AI Data Management Starts Here

We’ve tested it. It works. Now, it’s your turn. Try the integration yourself and see how it transforms your AI workflows.

Click here to book a demo of UltiHash, and make sure to visit SuperAnnotate to get started with their annotation platform.


Share this post:
Check this out:
How to optimize AI data pipelines with SuperAnnotate & UltiHash
Integrating UltiHash with SuperAnnotate enables AI teams to access and annotate their datasets efficiently
Posted by
Tom Lüdersdorf
Founder & CEO
Build faster AI infrastructure with less storage resources
Get 10TB Free
Get started with SuperAnnotate and UltiHash
Get 10TiB Free

How to optimize AI data pipelines with SuperAnnotate & UltiHash

Tom Lüdersdorf
Integrating UltiHash with SuperAnnotate enables AI teams to access and annotate their datasets efficiently

AI teams are constantly pushing the boundaries—building better models, improving data quality, and accelerating iteration cycles. From curating massive datasets to optimizing training pipelines, every step in the AI lifecycle demands efficiency, scalability, and precision.

Yet, as datasets grow larger, storage and data retrieval can become significant bottlenecks. Slow access times, redundant data storage, and complex integrations increase costs and slow down AI workflows.

That’s where SuperAnnotate and UltiHash come in. This integration helps AI teams:

  • Reduce storage usage by up to 60% with UltiHash’s built-in deduplication.
  • Speed up data access by 250%, eliminating slow retrieval times.

By combining UltiHash’s scalable, high-performance object storage with SuperAnnotate’s industry-leading annotation platform, AI teams can now work faster, more efficiently, and at a lower cost—from dataset preparation to model training.

Who Benefits from This Integration?

AI teams processing massive visual datasets – Quickly annotate and manage large-scale data with lower storage costs. (e.g., self-driving car datasets, aerial imagery, medical imaging)

Teams training LLMs & CV models – Accelerate dataset curation with near-instant data retrieval. (e.g., NLP teams training large-scale foundation models, computer vision teams working with video frames)

By removing traditional storage bottlenecks, this integration empowers computer vision, NLP, and AI teams to process and analyze large datasets with unprecedented efficiency.

How to connect your UltiHash Cluster to SuperAnnotate

Integrating UltiHash with SuperAnnotate enables AI teams to access and annotate their datasets efficiently—without unnecessary storage overhead or complex setup. This connection is powered by SuperAnnotate’s Custom Integration API, allowing secure access to data via temporary pre-signed URLs.

  1. Deploy the Pre-Signed URLs Generator
    Ensure the pre-signed URLs generator is running in the Kubernetes cluster hosting UltiHash. This service generates time-limited S3 pre-signed URLs on demand, allowing SuperAnnotate to securely access stored data.

  2. Retrieve Your Credentials
    Retrieve your automatically generated Access Key and Secret Key. These are stored in a Kubernetes secret and can be accessed using kubectl.

  3. Configure SuperAnnotate for UltiHash Storage
  • Request URL: Use the public HTTPS endpoint of the pre-signed URLs generator (e.g., https://<urls-generator-domain-name>/integrate).
  1. Enable CORS
    Ensure Cross-Origin Resource Sharing (CORS) is activated for uninterrupted access.

  2. Secure the Integration Token
    SuperAnnotate will use a secret token to authenticate requests to the pre-signed URLs generator. Keep this token secure to maintain controlled access.

  3. Verify & Start Annotating
    Once the integration is set up, your data is instantly available in SuperAnnotate. Start annotating and managing your datasets—no extra steps required.
For a full tutorial on how to integrate SuperAnnotate with UltiHash, see this documentation

About UltiHash

UltiHash is high-performance object storage built for modern AI and analytics workflows. With built-in byte-level deduplication, it reduces data redundancy and storage costs, all while keeping performance lightning-fast. Its Kubernetes-native architecture enables seamless deployment across cloud, on-premises, and hybrid environments. Finally, it features an S3-compatible API for easy integration with a huge range of tools across the data stack.

About SuperAnnotate

SuperAnnotate is the only fully customizable, one-stop platform for building exactly the annotation tools and workflows your AI projects demand—while unifying the management of all your teams, vendors, and data in one place. Forget juggling separate solutions and patchwork processes; SuperAnnotate streamlines high-quality data delivery across your entire AI portfolio.

The Future of AI Data Management Starts Here

We’ve tested it. It works. Now, it’s your turn. Try the integration yourself and see how it transforms your AI workflows.

Click here to book a demo of UltiHash, and make sure to visit SuperAnnotate to get started with their annotation platform.