Overview
If your data already lives in Parquet files, you can load it into Xata. This is a common path when your source database doesn’t expose a live PostgreSQL connection but can export snapshots to object storage instead. A typical flow is to export your database to Parquet files in an AWS S3 bucket, and then have Xata load those files into your branch.Export to Parquet in AWS S3
Both AWS Aurora and AWS RDS can export snapshots to Amazon S3 in Apache Parquet format. AWS writes the export as a set of Parquet files, one set per table, into the S3 bucket you choose.- AWS Aurora: Use Export to Amazon S3 on a snapshot or cluster. See the AWS Aurora export documentation.
- AWS RDS: Use Export to Amazon S3 on a snapshot. See the AWS RDS export documentation.
Import Parquet files into Xata
Importing Parquet files into Xata is currently handled on a case-by-case basis. If you’d like to import Parquet files — whether exported from AWS Aurora, AWS RDS, or another source — reach out to us and we’ll help you load them into your branch. When you contact us, it helps to share:- The location of your Parquet files (for example, the S3 bucket and prefix)
- The source database and how the export was produced (AWS Aurora, AWS RDS, or other)
- The approximate size and number of tables
-
A schema-only
pg_dumpof your source database:
Verification
After your data is loaded, verify it:-
Connect to Xata Branch:
-
Check Data Integrity: