XtreemStore

Tape is Back.
On-Premises S3 Glacier Without the Lock-In.

XtreemStore is the pure object-native deep archive platform that preserves modern S3 workflows while eliminating filesystem complexity and vendor dependency.

Designed as an object-native archive - not an S3-to-file translation layer.

A New Class of Archive - Where Data Remains Usable Without Recall

Built for object-first storage environments and modern lifecycle-driven workflows. Designed to scale from terabytes to exabytes - and beyond.

Why Now - The Return of Tape at Hyperscale

Tape is undergoing a structural resurgence. Modern rack-scale libraries - including high-density modular designs - are redefining what deep archive looks like. Capacity is growing exponentially, while power consumption and cost-per-terabyte continue to drop.

At the same time: major cloud providers still charge significant egress fees, long-term retention costs are unpredictable, organizations are repatriating cold data, and Glacier-style lifecycle policies are becoming standard.

This is creating a new architectural model: Fast storage stays online. Cold storage moves to deep archive. Object workflows remain unchanged - without introducing new translation layers.

XtreemStore was built specifically for this model.

The shift to object-native deep archive is no longer optional - it is becoming the default architecture for long-term data retention.

A Modern Approach to Deep Archive

Enterprise storage environments are no longer filesystem-centric. They are object-first, policy-driven, multi-tiered, and multi-vendor.

Most archive gateways store objects in arrival order. XtreemStore writes objects intentionally - based on metadata-defined relationships.

True On-Premises S3 Glacier

XtreemStore ingests, organizes, and writes data directly as objects - preserving the native structure of modern S3 workflows and delivering true Glacier semantics on-premises.

Why Object-Native Deep Archive

  • Preserves S3 Glacier semantics end-to-end
  • No protocol translation overhead
  • Direct integration with lifecycle policies
  • Simplified data path to tape
  • Native S3 workflows remain unchanged

XtreemStore delivers true on-premises S3 Glacier - not a translation layer on top of other storage.

Why Gateway-Based Archives Add Complexity

  • x Additional protocol translation layers
  • x Extra metadata processing overhead
  • x More operational failure points
  • x Performance tied to gateway scaling

XtreemStore eliminates the gateway layer entirely by writing objects directly to archive media - preserving native S3 workflows without translation overhead.

Built to Work With Any Infrastructure

Any Tape Library

Supports all major LTO libraries, multi-library deployments, mixed-vendor environments. Tape pools can be dynamically assigned and expanded across libraries.

Any Disk Cache or Storage Platform

NVMe cache, SAN storage, NAS storage, distributed cache nodes. Functions as high-speed ingest buffer, usable near-line tier, and staging environment tuned for tape streaming.

Any Glacier-Compatible Workflow

Works directly with Ceph lifecycle policies, VAST Glacier integration, backup platforms, research pipelines, and custom S3 workflows using standard S3 Glacier semantics.

Scale Without Limits - To Exabytes

Linear scalability in both capacity and throughput - from terabytes to exabytes+ with no software-imposed limits.

Used in research archives, AI pipelines, backup retention systems, and long-term compliance storage environments.

Scale Up - Increase System Throughput

Add more tape drives, faster disk cache, or higher bandwidth networking.

Result: Higher sustained ingest and retrieval speeds.

Scale Out - Expand to Exabytes

Add additional nodes, libraries, and workers.

Result: Linear scalability. No architecture redesign required.

Deployed in environments managing hundreds of petabytes and scaling toward exabyte-class archives. Supports billions of objects across distributed archive infrastructures.

Typical deployments include multi-library environments supporting hundreds of petabytes of archived data across geographically distributed sites.

Designed to support billions of objects across distributed archive environments with linear scalability across drives, nodes, and libraries.

Policy-Driven Lifecycle Management

Intelligent Data Flow Pipeline

Intelligent Data Flow PipelinePure Object Pipeline - No Filesystem Namespace RequiredS3 IngestS3Standard S3Glacier APIDisk CacheHigh-speed cache+ near-line storageNVMe / SAN / NASContainerization(TAR / LTFS)Metadata-awaregroupingParallel TapeStreamingSequential writeto any LTO tapeReplicationUp to 4 copies+ multi-sitePolicy-driven:Grouping rulesContainer sizeRetentionPlacementReplicationXtreemStore - Object-native S3 Glacier Deep ArchiveScales Linearly to Exabytes+ - Capacity and Throughput

Policies define grouping rules, container size, movement timing, and storage placement. Tape operations follow configurable lifecycle rules.

Multi-Copy Protection and Geo-Replication

Up to Four Independent Tape Copies

Distribution across libraries or locations with independent durability zones for maximum redundancy.

Multi-Site Replication

Asynchronous replication to secondary XtreemStore clusters supporting distributed archive environments and disaster recovery workflows.

Metadata-Aware Data Organization

Objects can be grouped using project identifiers, dataset structure, metadata tags, or retention classification. This enables logical data segmentation, efficient tape utilization, and simplified compliance workflows.

High-Throughput Parallel Archiving

Objects are grouped into large optimized TAR containers for sustained streaming performance. Each LTO drive delivers sustained streaming performance in the hundreds of MB/s range, enabling aggregate throughput to scale linearly across multiple drives and nodes.

Hardware Freedom - No Vendor Lock-In

Choose any tape vendor, any library model, any disk infrastructure. Change hardware at any time without rewriting your archive. XtreemStore integrates - it does not replace your existing storage platforms.

Open Formats for Long-Term Durability

XtreemStore writes standard TAR containers on LTFS tape format. Your data remains readable without XtreemStore or any proprietary dependencies - protecting long-term data ownership.

Supports LTO-6 through current LTO generations, enabling long-term infrastructure compatibility and upgrade flexibility.

Designed for Layered Storage Architectures

XtreemStore Reference Architecture - Layered Storage: Performance tier to XtreemStore Object Archive to Tape Archive with benefits callouts

This architecture optimizes cost for cold data, preserves performance where needed, and enables independent scaling of each tier.

Built for Real Enterprise Workloads

Life Sciences

Genomics, proteomics, imaging

Medical Archives

PACS, DICOM with compliance requirements

Media & Entertainment

Large content libraries with metadata-driven access

Research & Government

Long-term retention and regulatory storage

Advanced Capabilities

Bucket-Based Archive Management

Native S3 buckets with bucket-level policy control

Tape Spanning

Seamless multi-tape support for large objects

Distributed Architecture

Modular services for scalable deployments

Air-Gapped Security

Offline tape isolation and ransomware protection

Operational Monitoring & Visibility

Real-time system visibility, job tracking, and tape activity monitoring provide operational transparency and simplified management

XtreemStore Architecture at a Glance

  • Pure object-native deep archive
  • True on-premises S3 Glacier semantics
  • No S3-to-file translation layer
  • Supports any tape library vendor
  • Up to 4 independent tape copies
  • Multi-site replication support
  • Open TAR / LTFS data format
  • Linear scalability to exabytes+

Comparison Snapshot - Modern Archive Architecture

CapabilityXtreemStoreBundled AppliancesFilesystem Gateways
Pure Object-Native ArchiveLimitedNo
Works With Any Tape LibraryNoOften limited
Works With Any Cache StorageRestrictedVaries
Up to 4 Tape Copies + Geo-ReplicationLimitedUsually not native
Open TAR / LTFS FormatsOften proprietaryNo
No Filesystem Namespace LimitsN/ACreates limits
Scales Linearly to ExabytesHigh cost scalingChallenges at scale
Requires S3-to-File Translation LayerNOOftenYes
Hardware / Vendor Lock-InNONEHighMedium-High

A True On-Prem Glacier Platform - Built for the Future

A New Class of Archive - Where Data Remains Usable Without Recall

XtreemStore delivers object-native architecture, hardware independence, open data formats, exabyte scalability, and predictable long-term cost - without filesystem complexity, vendor lock-in, or cloud dependency. Traditional archives store data. Intelligent archives preserve relationships - enabling predictable retrieval and lifecycle-driven placement.

Archive architecture remains flexible and avoids hardware lock-in, supporting billions of objects across mixed-generation LTO environments.

Frequently Asked Questions

Is my data readable without XtreemStore?

Yes. Data is written in open TAR and LTFS formats, ensuring long-term readability independent of proprietary software.

Can I mix tape vendors and libraries?

Yes. XtreemStore supports multi-vendor and multi-library environments with dynamic pool management.

How does XtreemStore compare to cloud Glacier?

It provides equivalent S3 Glacier semantics and lifecycle policies - without egress fees, unpredictable retrieval costs, or loss of infrastructure control.

How large can XtreemStore scale?

From terabytes to exabytes and beyond. Capacity growth is driven by your infrastructure - not software limits. It handles billions of objects with linear scaling.

Does XtreemStore require proprietary connectors?

No. XtreemStore integrates using standard S3 Glacier-compatible APIs and lifecycle policies. No proprietary translation layers or custom connectors required.

Can I integrate with existing Ceph or VAST environments?

Yes. XtreemStore integrates directly into existing lifecycle workflows without requiring architecture redesign. Your applications continue using standard S3 Glacier API calls.

Does XtreemStore require a filesystem layer?

No. XtreemStore operates as a pure object-native archive platform and does not depend on filesystem namespaces or inode structures.

Ready to See XtreemStore in Action?

See how your environment can scale to exabytes while preserving modern S3 workflows and maintaining complete control.

Learn how metadata-driven grouping makes deep archive data usable - even from tape.

This technical guide explains how metadata-driven grouping enables deterministic archive layout, faster retrieval, and scalable deep archive design.