Loading Data with Redshift's COPY Command

Redshift's COPY command is a powerful tool for loading data into a Redshift cluster. It allows you to quickly and easily load data from a variety of sources, including files, databases, and other data stores. It also supports a wide range of data formats, making it a great choice for loading data into Redshift.

What is the COPY Command?

The COPY command is a SQL command used to load data into a Redshift cluster. It supports a wide range of data formats, including CSV, JSON, Avro, Parquet, and ORC. It also supports loading data from a variety of sources, including files, databases, and other data stores. It is a fast and efficient way to load data into Redshift.

How to Use the COPY Command

Using the COPY command is relatively straightforward. The basic syntax is as follows:


COPY [table_name]
FROM [source]
CREDENTIALS [credentials]
FORMAT [format]
[options]

The table_name is the name of the table you want to load data into. The source is the location of the data you want to load. The credentials are the credentials you need to access the data. The format is the format of the data you are loading. And the options are any additional options you want to specify.

For example, if you wanted to load a CSV file from an S3 bucket, you could use the following command:


COPY my_table
FROM 's3://my-bucket/my-file.csv'
CREDENTIALS 'aws_access_key_id=XXXXXXXXXXXXXXXXXXXX;aws_secret_access_key=XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX'
FORMAT CSV

Additional Info

The COPY command is a powerful tool for loading data into Redshift. It supports a wide range of data formats and sources, making it a great choice for loading data into Redshift. For more information, you can check out the official documentation here.

Want to build your own LLM Apps with AirOps👇👇