Cookies and Privacy

We use technology on our website to collect information that helps us enhance your experience and understand what information is most useful to visitors.
By clicking "I ACCEPT," you agree to the terms of our privacy policy.

Cirata applies the highest standards to its use of data and its compliance with data-protection regulations across our marketing and website. Our Data Protection Officer can be contacted at DPO@cirata.com. You can change your cookie settings at any time via the link in our footer.

Cookie Setting

Data Migrator extension

Data Migrator extension

Fully automated, zero-downtime or risk, continuous petabyte scale data migration between data environments.

Built by Cirata | Tech Preview

Built for the multi-storage enterprise

Every cloud provider built a migration tool that works inside their own walls. None of these solutions move data between vendors, and none of them keep the source live while doing it. Data Migrator is the vendor-neutral mobility layer that sits in the middle, one extension, one operator surface, every storage system.


Source from
HDFS · Amazon S3 · Azure Data Lake Storage Gen2 · Google Cloud Storage · Ceph · any S3-compatible storage


Target to
Any of the above. Either direction. Cross-cloud, cross-vendor, or on-prem-to-cloud, with the source still in production use.

Built for the multistorage enterprise

What it does

  • Migrate live data, applications keep writing to the source for the entire duration of the move, no maintenance window required.
  • Move data between any pair of HDFS, Amazon S3, Azure Data Lake Storage Gen2, Google Cloud Storage, Ceph, and S3-compatible storage.
  • Detect source changes continuously via HDFS inotify, S3 event streams (SQS or Kafka), or periodic re-scan.
  • Apply declarative exclusions and path mappings once, reuse them across every migration.
  • Prove every byte transferred with a built-in verification framework that reports per-path pass/fail with reason codes.
What it does screenshot 1
What it does screenshot 2

Use cases

Data modernization

Move petabytes off CDP or vanilla HDFS onto cloud object storage without the months-long read-only window that has stalled the project for years.

Multi-cloud repatriation

Move data between clouds, or back on-prem, without bespoke cross-vendor plumbing. The same tool moves AWS to Azure, Azure to GCP, or any cloud back to on-prem object storage.

Lakehouse hydration

Get enterprise file and object data into a cloud lakehouse landing zone, live, verified, and ready for catalog registration, without taking source applications offline.

Regulated DR for unstructured data

Answer DORA, Basel III, NYDFS, and APRA CPS 230 with rehearsable failover ceremonies, per-file verification reports, and full audit trails covering the unstructured estate, not just transactional data.


How it works

Connect

Register source and target filesystems with their native credential models — Kerberos, IAM role, service principal, access key.

Define

Pick paths, attach reusable exclusions and path mappings, choose continuous or one-time mode.

Migrate and verify

The migration runs live; the verification framework proves every transferred file matches the source, path by path.


Technical details

Supported storage HDFS, Amazon S3, Azure Data Lake Storage Gen2, Google Cloud Storage, Ceph, any S3-compatible storage, source or target
Migration modes Live continuous (source stays open for application writes) or one-time
Change detection HDFS inotify, S3 event streams (SQS or Kafka), periodic re-scan fallback
Authentication S3 access-key, IAM role, AWS profile, STS / IRSA; ADLS Shared Key or Service Principal (OAuth2); per-filesystem Kerberos UGI for HDFS
Path translation Declarative source-prefix → target-prefix mappings, applied across storage vendors and protocols
Verification Per-migration, on-demand or scheduled; pass/fail per file with reason codes; full lifecycle through UI, REST, CLI, and MCP
Deployment Distributed + hybrid environments, horizontally scalable via Symphony WorkPartitioner
Time-to-value Minutes to first migration; no agents on source or target, no cluster of your own to operate

Other Cirata Symphony extensions

Cirata Symphony Pulse extension
Control, understand, coordinate, and automate your data estate using simple prompts.
Observability extension
See your entire Cirata Symphony estate as one OpenTelemetry stream, every extension, every signal, into any backend you already run.
Intelligence extension
Connect your data estate to any AI model, without writing a line of integration code.
Orchestration extension
A vendor-neutral control plane for workflow orchestration on Cirata Symphony. Orchestration connects to the workflow engines you already run, behind one unified Orchestrator interface.
Ice Flow extension
Manage data in Iceberg-native, open standard formats, between any pair of catalogs, across vendors, clouds, and on-premises, without compromise or lock-in.
Canon extension
Replicate Kafka across any vendor mix, with offsets every instance agrees on, schemas that travel with records, and failover you can actually rehearse.

Want to see the Cirata Symphony Data Migrator extension in action?

Let us show you how the Cirata Symphony Data Migrator extension is a revolutionary new data extension to fit your organizational needs.
Schedule a direct 1:1 demo with our CTO, Paul Scott-Murphy.