How to optimize AI data pipelines with SuperAnnotate & UltiHash
Integrating UltiHash with SuperAnnotate enables AI teams to access and annotate their datasets efficiently
AI teams are constantly pushing the boundaries—building better models, improving data quality, and accelerating iteration cycles. From curating massive datasets to optimizing training pipelines, every step in the AI lifecycle demands efficiency, scalability, and precision.
Yet, as datasets grow larger, storage and data retrieval can become significant bottlenecks. Slow access times, redundant data storage, and complex integrations increase costs and slow down AI workflows.
Reduce storage usage by up to 60% with UltiHash’s built-in deduplication.
Speed up data access by 250%, eliminating slow retrieval times.
By combining UltiHash’s scalable, high-performance object storage with SuperAnnotate’s industry-leading annotation platform, AI teams can now work faster, more efficiently, and at a lower cost—from dataset preparation to model training.
Who Benefits from This Integration?
AI teams processing massive visual datasets – Quickly annotate and manage large-scale data with lower storage costs. (e.g., self-driving car datasets, aerial imagery, medical imaging)
Teams training LLMs & CV models – Accelerate dataset curation with near-instant data retrieval. (e.g., NLP teams training large-scale foundation models, computer vision teams working with video frames)
By removing traditional storage bottlenecks, this integration empowers computer vision, NLP, and AI teams to process and analyze large datasets with unprecedented efficiency.
How to connect your UltiHash Cluster to SuperAnnotate
Integrating UltiHash with SuperAnnotate enables AI teams to access and annotate their datasets efficiently—without unnecessary storage overhead or complex setup. This connection is powered by SuperAnnotate’s Custom Integration API, allowing secure access to data via temporary pre-signed URLs.
Deploy the Pre-Signed URLs Generator Ensure the pre-signed URLs generator is running in the Kubernetes cluster hosting UltiHash. This service generates time-limited S3 pre-signed URLs on demand, allowing SuperAnnotate to securely access stored data.
Retrieve Your Credentials Retrieve your automatically generated Access Key and Secret Key. These are stored in a Kubernetes secret and can be accessed using kubectl.
Configure SuperAnnotate for UltiHash Storage
Request URL: Use the public HTTPS endpoint of the pre-signed URLs generator (e.g., https://<urls-generator-domain-name>/integrate).
Enable CORS Ensure Cross-Origin Resource Sharing (CORS) is activated for uninterrupted access.
Secure the Integration Token SuperAnnotate will use a secret token to authenticate requests to the pre-signed URLs generator. Keep this token secure to maintain controlled access.
Verify & Start Annotating Once the integration is set up, your data is instantly available in SuperAnnotate. Start annotating and managing your datasets—no extra steps required.
For a full tutorial on how to integrate SuperAnnotate with UltiHash, see this documentation.
About UltiHash
UltiHash is high-performance object storage built for modern AI and analytics workflows. With built-in byte-level deduplication, it reduces data redundancy and storage costs, all while keeping performance lightning-fast. Its Kubernetes-native architecture enables seamless deployment across cloud, on-premises, and hybrid environments. Finally, it features an S3-compatible API for easy integration with a huge range of tools across the data stack.
About SuperAnnotate
SuperAnnotate is the only fully customizable, one-stop platform for building exactly the annotation tools and workflows your AI projects demand—while unifying the management of all your teams, vendors, and data in one place. Forget juggling separate solutions and patchwork processes; SuperAnnotate streamlines high-quality data delivery across your entire AI portfolio.
The Future of AI Data Management Starts Here
We’ve tested it. It works. Now, it’s your turn. Try the integration yourself and see how it transforms your AI workflows.
How to optimize AI data pipelines with SuperAnnotate & UltiHash
Tom Lüdersdorf
Integrating UltiHash with SuperAnnotate enables AI teams to access and annotate their datasets efficiently
AI teams are constantly pushing the boundaries—building better models, improving data quality, and accelerating iteration cycles. From curating massive datasets to optimizing training pipelines, every step in the AI lifecycle demands efficiency, scalability, and precision.
Yet, as datasets grow larger, storage and data retrieval can become significant bottlenecks. Slow access times, redundant data storage, and complex integrations increase costs and slow down AI workflows.
Reduce storage usage by up to 60% with UltiHash’s built-in deduplication.
Speed up data access by 250%, eliminating slow retrieval times.
By combining UltiHash’s scalable, high-performance object storage with SuperAnnotate’s industry-leading annotation platform, AI teams can now work faster, more efficiently, and at a lower cost—from dataset preparation to model training.
Who Benefits from This Integration?
AI teams processing massive visual datasets – Quickly annotate and manage large-scale data with lower storage costs. (e.g., self-driving car datasets, aerial imagery, medical imaging)
Teams training LLMs & CV models – Accelerate dataset curation with near-instant data retrieval. (e.g., NLP teams training large-scale foundation models, computer vision teams working with video frames)
By removing traditional storage bottlenecks, this integration empowers computer vision, NLP, and AI teams to process and analyze large datasets with unprecedented efficiency.
How to connect your UltiHash Cluster to SuperAnnotate
Integrating UltiHash with SuperAnnotate enables AI teams to access and annotate their datasets efficiently—without unnecessary storage overhead or complex setup. This connection is powered by SuperAnnotate’s Custom Integration API, allowing secure access to data via temporary pre-signed URLs.
Deploy the Pre-Signed URLs Generator Ensure the pre-signed URLs generator is running in the Kubernetes cluster hosting UltiHash. This service generates time-limited S3 pre-signed URLs on demand, allowing SuperAnnotate to securely access stored data.
Retrieve Your Credentials Retrieve your automatically generated Access Key and Secret Key. These are stored in a Kubernetes secret and can be accessed using kubectl.
Configure SuperAnnotate for UltiHash Storage
Request URL: Use the public HTTPS endpoint of the pre-signed URLs generator (e.g., https://<urls-generator-domain-name>/integrate).
Enable CORS Ensure Cross-Origin Resource Sharing (CORS) is activated for uninterrupted access.
Secure the Integration Token SuperAnnotate will use a secret token to authenticate requests to the pre-signed URLs generator. Keep this token secure to maintain controlled access.
Verify & Start Annotating Once the integration is set up, your data is instantly available in SuperAnnotate. Start annotating and managing your datasets—no extra steps required.
For a full tutorial on how to integrate SuperAnnotate with UltiHash, see this documentation.
About UltiHash
UltiHash is high-performance object storage built for modern AI and analytics workflows. With built-in byte-level deduplication, it reduces data redundancy and storage costs, all while keeping performance lightning-fast. Its Kubernetes-native architecture enables seamless deployment across cloud, on-premises, and hybrid environments. Finally, it features an S3-compatible API for easy integration with a huge range of tools across the data stack.
About SuperAnnotate
SuperAnnotate is the only fully customizable, one-stop platform for building exactly the annotation tools and workflows your AI projects demand—while unifying the management of all your teams, vendors, and data in one place. Forget juggling separate solutions and patchwork processes; SuperAnnotate streamlines high-quality data delivery across your entire AI portfolio.
The Future of AI Data Management Starts Here
We’ve tested it. It works. Now, it’s your turn. Try the integration yourself and see how it transforms your AI workflows.