
Tape is Back.
On-Premises S3 Glacier Without the Lock-In.
XtreemStore is the pure object-native deep archive platform that preserves modern S3 workflows while eliminating filesystem complexity and vendor dependency.
Designed as an object-native archive - not an S3-to-file translation layer.
A New Class of Archive - Where Data Remains Usable Without Recall
Built for object-first storage environments and modern lifecycle-driven workflows. Designed to scale from terabytes to exabytes - and beyond.
Why Now - The Return of Tape at Hyperscale
Tape is undergoing a structural resurgence. Modern rack-scale libraries - including high-density modular designs - are redefining what deep archive looks like. Capacity is growing exponentially, while power consumption and cost-per-terabyte continue to drop.
At the same time: major cloud providers still charge significant egress fees, long-term retention costs are unpredictable, organizations are repatriating cold data, and Glacier-style lifecycle policies are becoming standard.
This is creating a new architectural model: Fast storage stays online. Cold storage moves to deep archive. Object workflows remain unchanged - without introducing new translation layers.
XtreemStore was built specifically for this model.
The shift to object-native deep archive is no longer optional - it is becoming the default architecture for long-term data retention.
A Modern Approach to Deep Archive
Enterprise storage environments are no longer filesystem-centric. They are object-first, policy-driven, multi-tiered, and multi-vendor.
Most archive gateways store objects in arrival order. XtreemStore writes objects intentionally - based on metadata-defined relationships.
True On-Premises S3 Glacier
XtreemStore ingests, organizes, and writes data directly as objects - preserving the native structure of modern S3 workflows and delivering true Glacier semantics on-premises.
Why Object-Native Deep Archive
- Preserves S3 Glacier semantics end-to-end
- No protocol translation overhead
- Direct integration with lifecycle policies
- Simplified data path to tape
- Native S3 workflows remain unchanged
XtreemStore delivers true on-premises S3 Glacier - not a translation layer on top of other storage.
Why Gateway-Based Archives Add Complexity
- Additional protocol translation layers
- Extra metadata processing overhead
- More operational failure points
- Performance tied to gateway scaling
XtreemStore eliminates the gateway layer entirely by writing objects directly to archive media - preserving native S3 workflows without translation overhead.
Built to Work With Any Infrastructure
Any Tape Library
Supports all major LTO libraries, multi-library deployments, mixed-vendor environments. Tape pools can be dynamically assigned and expanded across libraries.
Any Disk Cache or Storage Platform
NVMe cache, SAN storage, NAS storage, distributed cache nodes. Functions as high-speed ingest buffer, usable near-line tier, and staging environment tuned for tape streaming.
Any Glacier-Compatible Workflow
Works directly with Ceph lifecycle policies, VAST Glacier integration, backup platforms, research pipelines, and custom S3 workflows using standard S3 Glacier semantics.
Scale Without Limits - To Exabytes
Linear scalability in both capacity and throughput - from terabytes to exabytes+ with no software-imposed limits.
Used in research archives, AI pipelines, backup retention systems, and long-term compliance storage environments.
Scale Up - Increase System Throughput
Add more tape drives, faster disk cache, or higher bandwidth networking.
Result: Higher sustained ingest and retrieval speeds.
Scale Out - Expand to Exabytes
Add additional nodes, libraries, and workers.
Result: Linear scalability. No architecture redesign required.
Deployed in environments managing hundreds of petabytes and scaling toward exabyte-class archives. Supports billions of objects across distributed archive infrastructures.
Typical deployments include multi-library environments supporting hundreds of petabytes of archived data across geographically distributed sites.
Designed to support billions of objects across distributed archive environments with linear scalability across drives, nodes, and libraries.
Policy-Driven Lifecycle Management
Intelligent Data Flow Pipeline
Policies define grouping rules, container size, movement timing, and storage placement. Tape operations follow configurable lifecycle rules.
Multi-Copy Protection and Geo-Replication
Up to Four Independent Tape Copies
Distribution across libraries or locations with independent durability zones for maximum redundancy.
Multi-Site Replication
Asynchronous replication to secondary XtreemStore clusters supporting distributed archive environments and disaster recovery workflows.
Metadata-Aware Data Organization
Objects can be grouped using project identifiers, dataset structure, metadata tags, or retention classification. This enables logical data segmentation, efficient tape utilization, and simplified compliance workflows.
High-Throughput Parallel Archiving
Objects are grouped into large optimized TAR containers for sustained streaming performance. Each LTO drive delivers sustained streaming performance in the hundreds of MB/s range, enabling aggregate throughput to scale linearly across multiple drives and nodes.
Hardware Freedom - No Vendor Lock-In
Choose any tape vendor, any library model, any disk infrastructure. Change hardware at any time without rewriting your archive. XtreemStore integrates - it does not replace your existing storage platforms.
Open Formats for Long-Term Durability
XtreemStore writes standard TAR containers on LTFS tape format. Your data remains readable without XtreemStore or any proprietary dependencies - protecting long-term data ownership.
Supports LTO-6 through current LTO generations, enabling long-term infrastructure compatibility and upgrade flexibility.
Designed for Layered Storage Architectures
This architecture optimizes cost for cold data, preserves performance where needed, and enables independent scaling of each tier.
Built for Real Enterprise Workloads
Life Sciences
Genomics, proteomics, imaging
Medical Archives
PACS, DICOM with compliance requirements
Media & Entertainment
Large content libraries with metadata-driven access
Research & Government
Long-term retention and regulatory storage
Advanced Capabilities
Bucket-Based Archive Management
Native S3 buckets with bucket-level policy control
Tape Spanning
Seamless multi-tape support for large objects
Distributed Architecture
Modular services for scalable deployments
Air-Gapped Security
Offline tape isolation and ransomware protection
Operational Monitoring & Visibility
Real-time system visibility, job tracking, and tape activity monitoring provide operational transparency and simplified management
XtreemStore Architecture at a Glance
- Pure object-native deep archive
- True on-premises S3 Glacier semantics
- No S3-to-file translation layer
- Supports any tape library vendor
- Up to 4 independent tape copies
- Multi-site replication support
- Open TAR / LTFS data format
- Linear scalability to exabytes+
Comparison Snapshot - Modern Archive Architecture
| Capability | XtreemStore | Bundled Appliances | Filesystem Gateways |
|---|---|---|---|
| Pure Object-Native Archive | ✔ | Limited | No |
| Works With Any Tape Library | ✔ | No | Often limited |
| Works With Any Cache Storage | ✔ | Restricted | Varies |
| Up to 4 Tape Copies + Geo-Replication | ✔ | Limited | Usually not native |
| Open TAR / LTFS Formats | ✔ | Often proprietary | No |
| No Filesystem Namespace Limits | ✔ | N/A | Creates limits |
| Scales Linearly to Exabytes | ✔ | High cost scaling | Challenges at scale |
| Requires S3-to-File Translation Layer | NO | Often | Yes |
| Hardware / Vendor Lock-In | NONE | High | Medium-High |
Frequently Asked Questions
Is my data readable without XtreemStore?
Yes. Data is written in open TAR and LTFS formats, ensuring long-term readability independent of proprietary software.
Can I mix tape vendors and libraries?
Yes. XtreemStore supports multi-vendor and multi-library environments with dynamic pool management.
How does XtreemStore compare to cloud Glacier?
It provides equivalent S3 Glacier semantics and lifecycle policies - without egress fees, unpredictable retrieval costs, or loss of infrastructure control.
How large can XtreemStore scale?
From terabytes to exabytes and beyond. Capacity growth is driven by your infrastructure - not software limits. It handles billions of objects with linear scaling.
Does XtreemStore require proprietary connectors?
No. XtreemStore integrates using standard S3 Glacier-compatible APIs and lifecycle policies. No proprietary translation layers or custom connectors required.
Can I integrate with existing Ceph or VAST environments?
Yes. XtreemStore integrates directly into existing lifecycle workflows without requiring architecture redesign. Your applications continue using standard S3 Glacier API calls.
Does XtreemStore require a filesystem layer?
No. XtreemStore operates as a pure object-native archive platform and does not depend on filesystem namespaces or inode structures.
Ready to See XtreemStore in Action?
See how your environment can scale to exabytes while preserving modern S3 workflows and maintaining complete control.
Learn how metadata-driven grouping makes deep archive data usable - even from tape.
This technical guide explains how metadata-driven grouping enables deterministic archive layout, faster retrieval, and scalable deep archive design.