Cirata - Data Migrator extension | Cirata Symphony Extensions

Built for the multi-storage enterprise

Every cloud provider built a migration tool that works inside their own walls. None of these solutions move data between vendors, and none of them keep the source live while doing it. Data Migrator is the vendor-neutral mobility layer that sits in the middle, one extension, one operator surface, every storage system.

Source from
HDFS · Amazon S3 · Azure Data Lake Storage Gen2 · Google Cloud Storage · Ceph · any S3-compatible storage

Target to
Any of the above. Either direction. Cross-cloud, cross-vendor, or on-prem-to-cloud, with the source still in production use.

What it does

Migrate live data, applications keep writing to the source for the entire duration of the move, no maintenance window required.
Move data between any pair of HDFS, Amazon S3, Azure Data Lake Storage Gen2, Google Cloud Storage, Ceph, and S3-compatible storage.
Detect source changes continuously via HDFS inotify, S3 event streams (SQS or Kafka), or periodic re-scan.
Apply declarative exclusions and path mappings once, reuse them across every migration.
Prove every byte transferred with a built-in verification framework that reports per-path pass/fail with reason codes.

Use cases

Data modernization

Move petabytes off CDP or vanilla HDFS onto cloud object storage without the months-long read-only window that has stalled the project for years.

Multi-cloud repatriation

Move data between clouds, or back on-prem, without bespoke cross-vendor plumbing. The same tool moves AWS to Azure, Azure to GCP, or any cloud back to on-prem object storage.

Lakehouse hydration

Get enterprise file and object data into a cloud lakehouse landing zone, live, verified, and ready for catalog registration, without taking source applications offline.

Regulated DR for unstructured data

Answer DORA, Basel III, NYDFS, and APRA CPS 230 with rehearsable failover ceremonies, per-file verification reports, and full audit trails covering the unstructured estate, not just transactional data.

How it works

Connect

Register source and target filesystems with their native credential models — Kerberos, IAM role, service principal, access key.

Define

Pick paths, attach reusable exclusions and path mappings, choose continuous or one-time mode.

Migrate and verify

The migration runs live; the verification framework proves every transferred file matches the source, path by path.

Technical details

Supported storage HDFS, Amazon S3, Azure Data Lake Storage Gen2, Google Cloud Storage, Ceph, any S3-compatible storage, source or target

Migration modes Live continuous (source stays open for application writes) or one-time

Change detection HDFS inotify, S3 event streams (SQS or Kafka), periodic re-scan fallback

Authentication S3 access-key, IAM role, AWS profile, STS / IRSA; ADLS Shared Key or Service Principal (OAuth2); per-filesystem Kerberos UGI for HDFS

Path translation Declarative source-prefix → target-prefix mappings, applied across storage vendors and protocols

Verification Per-migration, on-demand or scheduled; pass/fail per file with reason codes; full lifecycle through UI, REST, CLI, and MCP

Deployment Distributed + hybrid environments, horizontally scalable via Symphony WorkPartitioner

Time-to-value Minutes to first migration; no agents on source or target, no cluster of your own to operate

Other Cirata Symphony extensions

Cirata Symphony Pulse extension

Control, understand, coordinate, and automate your data estate using simple prompts.

view details »

Observability extension

See your entire Cirata Symphony estate as one OpenTelemetry stream, every extension, every signal, into any backend you already run.

view details »

Intelligence extension

Connect your data estate to any AI model, without writing a line of integration code.

view details »

Orchestration extension

A vendor-neutral control plane for workflow orchestration on Cirata Symphony. Orchestration connects to the workflow engines you already run, behind one unified Orchestrator interface.

view details »

Ice Flow extension

Manage data in Iceberg-native, open standard formats, between any pair of catalogs, across vendors, clouds, and on-premises, without compromise or lock-in.

view details »

Canon extension

Replicate Kafka across any vendor mix, with offsets every instance agrees on, schemas that travel with records, and failover you can actually rehearse.

view details »

Data Migrator extension

Data Migrator extension

Built for the multi-storage enterprise

What it does

Use cases

Data modernization

Multi-cloud repatriation

Lakehouse hydration

Regulated DR for unstructured data

How it works

Technical details

Other Cirata Symphony extensions

Want to see the Cirata Symphony Data Migrator extension in action?