23 KiB
Amazon Relational Database Service
- TL;DR
- Storage
- Parameter Groups
- Option Groups
- Backup
- Restore
- Encryption
- Operations
- Troubleshooting
- Further readings
TL;DR
CLI usage
# Show RDS instances.
aws rds describe-db-instances
aws rds describe-db-instances --output 'json' --query "DBInstances[?(DBInstanceIdentifier=='master-prod')]"
# Show Parameter Groups.
aws rds describe-db-parameters --db-parameter-group-name 'default.postgres15'
# Create parameter Groups.
aws rds create-db-parameter-group --db-parameter-group-name 'pg15-source-transport-group' \
--db-parameter-group-family 'postgres15' --description 'Parameter group with transport parameters enabled'
# Modify Parameter Groups.
aws rds modify-db-parameter-group --db-parameter-group-name 'pg15-source-transport-group' \
--parameters \
'ParameterName=pg_transport.num_workers,ParameterValue=4,ApplyMethod=pending-reboot' \
'ParameterName=pg_transport.timing,ParameterValue=1,ApplyMethod=pending-reboot' \
'ParameterName=pg_transport.work_mem,ParameterValue=131072,ApplyMethod=pending-reboot' \
'ParameterName=shared_preload_libraries,ParameterValue="pg_stat_statements,pg_transport",ApplyMethod=pending-reboot' \
'ParameterName=max_worker_processes,ParameterValue=24,ApplyMethod=pending-reboot'
# Restore instances from snapshots.
aws rds restore-db-instance-from-db-snapshot \
--db-instance-identifier 'myNewBbInstance' \
--db-snapshot-identifier 'myOldDbSnapshot'
# Start export tasks.
aws rds start-export-task \
--export-task-identifier 'db-finalSnapshot-2024' \
--source-arn 'arn:aws:rds:eu-west-1:012345678901:snapshot:db-prod-final-2024' \
--s3-bucket-name 'backups' \
--iam-role-arn 'arn:aws:iam::012345678901:role/CustomRdsS3Exporter' \
--kms-key-id 'arn:aws:kms:eu-west-1:012345678901:key/abcdef01-2345-6789-abcd-ef0123456789'
# Get export tasks' status.
aws rds describe-export-tasks
aws rds describe-export-tasks --export-task-identifier 'my-snapshot-export'
# Cancel tasks.
aws rds cancel-export-task --export-task-identifier 'my_export'
Read replicas can be promoted to standalone DB instances.
See Working with DB instance read replicas.
Disk free metrics are available in CloudWatch.
One can choose any of the following retention periods for instances' Performance Insights data:
- 7 days (default, free tier).
- n months, where n is a number from 1 to 24.
In CLI and IaC, this number must be n*31.
Storage
Refer Amazon RDS DB instance storage.
When selecting General Purpose SSD or Provisioned IOPS SSD, RDS automatically stripes storage across multiple volumes to enhance performance depending on the engine selected and the amount of storage requested:
| DB engine | Storage size | Number of volumes provisioned |
|---|---|---|
| Db2 | Less than 400 GiB | 1 |
| Db2 | 400 to 65,536 GiB | 4 |
| MariaDB MySQL PostgreSQL |
Less than 400 GiB | 1 |
| MariaDB MySQL PostgreSQL |
400 to 65,536 GiB | 4 |
| Oracle | Less than 200 GiB | 1 |
| Oracle | 200 to 65,536 GiB | 4 |
| SQL Server | Any | 1 |
When modifying a General Purpose SSD or Provisioned IOPS SSD volume, it goes through a sequence of states.
While the volume is in the optimizing state, volume performance is between the source and target configuration
specifications.
Transitional volume performance will be no less than the lower of the two specifications.
When increasing allocated storage, increases must be by at least of 10%. Trying to increase the value by less than 10%
will result in an error.
The allocated storage cannot be increased when restoring RDS for SQL Server DB instances.
The allocated storage size of any DB instance cannot be lowered after creation.
Decrease the storage size of DB instances by creating a new instance with lower provisioned storage size, then migrate
the data into the new instance.
Use one of the following methods:
- Use the database engine's native dump and restore method. This will require long downtime.
- Consider using transportable DBs when dealing with
PostgreSQL DBs should the requirements match.
This will require some downtime. - Perform an homogeneous data migration using AWS's
DMS
This should require minimal downtime.
Parameter Groups
Refer Working with parameter groups.
Used to specify how a DB is configured.
Learn about available parameters by describing the existing default ones:
aws rds describe-db-parameters --db-parameter-group-name 'default.postgres15'
aws rds describe-db-parameters --db-parameter-group-name 'default.postgres15' \
--query "Parameters[?ParameterName=='shared_preload_libraries']" --output 'table'
aws rds describe-db-parameters --db-parameter-group-name 'default.postgres15' \
--query "Parameters[?ParameterName=='shared_preload_libraries'].ApplyMethod" --output 'text'
Option Groups
Used to enable and configure additional features and functionalities in a DB.
Backup
RDS backup storage for each Region is calculated from both the automated backups and manual DB snapshots for that
Region.
Moving snapshots to other Regions increases the backup storage in the destination Regions.
Backups are stored in S3.
Should one choose to retain automated backups when deleting DB instances, those backups are saved for the full retention
period; otherwise, all automated backups are deleted with the instance.
After automated backups are deleted, they cannot be recovered.
Should one choose to have RDS create a final DB snapshot before deleting a DB instance, one can use that or previously created manual snapshots to recover it.
Automatic backups
Automatic backups are storage volume snapshots of entire DB instances.
Automatic backups are enabled by default.
Setting the backup retention period to 0 disables them, setting it to a nonzero value (re)enables them.
Enabling automatic backups takes the affected instances offline to have a backup created immediately.
It will cause outages.
Automatic backups occur daily during the instances' backup window, configured in 30 minute periods. Should backups require more time than allotted to the backup window, they will continue after the window ends and until they finish.
Backups are retained for up to 35 days (backup retention period).
One can recover DB instances to any point in time from the backup retention period.
The backup window can't overlap with the weekly maintenance window for DB instance or Multi-AZ DB cluster.
During automatic backup windows storage I/O might be suspended briefly while the backup process initializes.
Initialization typically takes up to a few seconds. One might also experience elevated latencies for a few minutes
during backups for Multi-AZ deployments.
For MariaDB, MySQL, Oracle and PostgreSQL Multi-AZ deployments, I/O activity isn't suspended on the primary instance as
the backup is taken from the standby.
Automated backups might occasionally be skipped if instances or clusters are running heavy workloads at the time backups
are supposed to start.
DB instances must be in the available state for automated backups to occur.
Automated backups don't occur while DB instances are in other states (i.e., storage_full).
Automated backups aren't created while a DB instance or cluster is stopped.
RDS doesn't include time spent in the stopped state when the backup retention window is calculated. This means backups
can be retained longer than the backup retention period if a DB instance has been stopped.
Automated backups don't occur while a DB snapshot copy is running in the same AWS Region for the same database.
Manual backups
Back up DB instances manually by creating DB snapshots.
The first snapshot contains the data for the full database. Subsequent snapshots of the same database are incremental.
One can copy both automatic and manual DB snapshots, but only share manual DB snapshots.
Manual snapshots never expire and are retained indefinitely.
One can store up to 100 manual snapshots per Region.
Export snapshots to S3
One can export DB snapshot data to S3 buckets.
RDS spins up an instance from the snapshot, extracts data from it and stores the data in Apache Parquet format.
By default all data in the snapshots is exported, but one can specify specific sets of databases, schemas, or tables
to export.
-
The export process runs in the background and does not affect the performance of active DB instances.
-
Multiple export tasks for the same DB snapshot cannot run simultaneously. This applies to both full and partial exports.
-
Exporting snapshots from DB instances that use magnetic storage isn't supported.
-
The following characters aren't supported in table column names:
, ; { } ( ) \n \t = (space) /Tables containing those characters in column names are skipped during export.
-
PostgreSQL temporary and unlogged tables are skipped during export.
-
Large objects in the data, like BLOBs or CLOBs, close to or greater than 500 MB will make the export fail.
-
Large rows close to or greater than 2 GB will make their table being skipped during export.
-
Data exported from snapshots to S3 cannot be restored to new DB instances.
-
The snapshot export tasks require a role with write-access permission to the destination S3 bucket:
{ "Version": "2012-10-17", "Statement": [{ "Effect": "Allow", "Action": "sts:AssumeRole", "Principal": { "Service": "export.rds.amazonaws.com" } }] }{ "Version": "2012-10-17", "Statement": [{ "Effect": "Allow", "Action": [ "s3:PutObject*", "s3:ListBucket", "s3:GetObject*", "s3:DeleteObject*", "s3:GetBucketLocation" ], "Resource": [ "arn:aws:s3:::bucket", "arn:aws:s3:::bucket/*" ] }] }
After the export, one can analyze the data directly through Athena or Redshift Spectrum.
In the Console
The Export to Amazon S3 console option appears only for snapshots that can be exported to Amazon S3.
Snapshots might not be available for export because of the following reasons:
- The DB engine isn't supported for S3 export.
- The DB instance version isn't supported for S3 export.
- S3 export isn't supported in the AWS Region where the snapshot was created.
Using the CLI
# Start new tasks.
$ aws rds start-export-task \
--export-task-identifier 'db-finalSnapshot-2024' \
--source-arn 'arn:aws:rds:eu-west-1:012345678901:snapshot:db-prod-final-2024' \
--s3-bucket-name 'backups' --s3-prefix 'rds' \
--iam-role-arn 'arn:aws:iam::012345678901:role/CustomRdsS3Exporter' \
--kms-key-id 'arn:aws:kms:eu-west-1:012345678901:key/abcdef01-2345-6789-abcd-ef0123456789'
{
"ExportTaskIdentifier": "db-finalSnapshot-2024",
"IamRoleArn": "arn:aws:iam::012345678901:role/CustomRdsS3Exporter",
"KmsKeyId": "arn:aws:kms:eu-west-1:012345678901:key/abcdef01-2345-6789-abcd-ef0123456789",
"PercentProgress": 0,
"S3Bucket": "backups",
"S3Prefix": "rds",
"SnapshotTime": "2024-06-17T09:04:41.387000+00:00",
"SourceArn": "arn:aws:rds:eu-west-1:012345678901:snapshot:db-prod-final-2024",
"Status": "STARTING",
"TotalExtractedDataInGB": 0
}
# Get tasks' status.
$ aws rds describe-export-tasks
$ aws rds describe-export-tasks --export-task-identifier 'db-finalSnapshot-2024'
$ aws rds describe-export-tasks --query 'ExportTasks[].WarningMessage' --output 'yaml'
# Cancel tasks.
$ aws rds cancel-export-task --export-task-identifier 'my_export'
{
"Status": "CANCELING",
"S3Prefix": "",
"ExportTime": "2019-08-12T01:23:53.109Z",
"S3Bucket": "DOC-EXAMPLE-BUCKET",
"PercentProgress": 0,
"KmsKeyId": "arn:aws:kms:AWS_Region:123456789012:key/K7MDENG/bPxRfiCYEXAMPLEKEY",
"ExportTaskIdentifier": "my_export",
"IamRoleArn": "arn:aws:iam::123456789012:role/export-to-s3",
"TotalExtractedDataInGB": 0,
"TaskStartTime": "2019-11-13T19:46:00.173Z",
"SourceArn": "arn:aws:rds:AWS_Region:123456789012:snapshot:export-example-1"
}
Restore
DB instances can be restored from DB snapshots.
Restoring instances from snapshots requires the new instances to have equal or more allocated storage than what the
original instance had allocated at the time the snapshot was taken.
aws rds restore-db-instance-from-db-snapshot \
--db-instance-identifier 'myNewDbInstance' \
--db-snapshot-identifier 'myDbSnapshot'
Encryption
RDS automatically integrates with AWS KMS for key management.
By default, RDS uses the RDS AWS managed key (aws/rds) for encryption.
This key can't be managed, rotated, nor deleted by users.
RDS will automatically put databases into a terminal state when access to the KMS key is required but the key has been
disabled or deleted, or its permissions have been somehow revoked.
This change could be immediate or deferred depending on the use case that required access to the KMS key.
In this terminal state, DB instances are no longer available and their databases' current state can't be recovered. To
restore DB instances, one must first re-enable access to the KMS key for RDS, and then restore the instances from their
latest available backup.
Operations
PostgreSQL
Reduce allocated storage by migrating using transportable databases
Refer Migrating databases using RDS PostgreSQL Transportable Databases, Transporting PostgreSQL databases between DB instances and Transport PostgreSQL databases between two Amazon RDS DB instances using pg_transport.
When the transport begins, all current sessions on the source database are ended and the DB is put in ReadOnly mode.
Only the specific source database that is being transported is affected. Others are not affected.The in-transit database will be inaccessible on the target DB instance for the duration of the transport.
During transport, the target DB instance cannot be restored to a point in time, as the transport is not transactional and does not use the PostgreSQL write-ahead log to record changes.
A test transfer of a ~350 GB database between two db.t4g.medium instances using gp3 storage took FIXME minutes.
Requirements
-
A source DB to copy from.
-
A target instance to copy the DB to.
Since the transport will create the DB on the target, the target instance must not contain the database that needs to be transported.
Should the target contain the DB already, it will need to be dropped beforehand. -
The transported DB (but not other DBs on the same source instance) to:
- Be put in Read Only mode.
- Have all installed extensions removed.
To avoid locking the operator's machine for the time needed by the transport, it is suggested the use of an EC2 instance in the middle to operate on both DBs.
Try and keep the DBs identifiers under 22 characters.
PostgreSQL will try and truncate the identifier after 63 characters, and AWS will add something like.{{12-char-id}}.{{region}}.rds.amazonaws.comto it.
Procedure
-
Enable the required configuration parameters and
pg_transportextension on the source and target RDS instances.
Create a new RDS Parameter Group or modify the existing one used by the source.Required parameters:
shared_preload_librariesmust includepg_transport.pg_transport.num_workersmust be tuned.
Its value determines the number oftransport.send_fileworkers that will be created in the source. Defaults to 3.max_worker_processesmust be at least (3 *pg_transport.num_workers) + 9.
Required on the destination to handle various background worker processes involved in the transport.pg_transport.work_memcan be tuned.
Specifies the maximum memory to allocate to each worker. Defaults to 131072 (128 MB) or 262144 (256 MB) depending on the PostgreSQL version.pg_transport.timingcan be set to1.
Specifies whether to report timing information during the transport. Defaults to 1 (true), meaning that timing information is reported.
-
Reboot the instances equipped with the Parameter Group to apply changes.
-
Create a new target instance with the required allocated storage.
-
Make sure the middleman can connect to both DBs.
-
Make sure the target DB instance can connect to the source.
-
Prepare both source and target DBs:
-
Connect to the DB:
psql -h 'source-instance.5f7mp3pt3n6e.eu-west-1.rds.amazonaws.com' -p '5432' -U 'admin' --password psql -h 'target-instance.5f7mp3pt3n6e.eu-west-1.rds.amazonaws.com' -p '5432' -U 'admin' --password 'postgres' -
Remove all extensions but
pg_transportfrom the public schema of the DB instance.
Only thepg_transportextension is allowed during the actual transport operation.SELECT "extname" FROM "pg_extension"; DROP EXTENSION "plpgsql", "postgis", "…" CASCADE; -
Install the
pg_transportextension if missing:CREATE EXTENSION "pg_transport";
-
-
[optional] Test the transport by running the
transport.import_from_serverfunction on the target DB instance:-- Keep arguments in *single* quotes here SELECT transport.import_from_server( 'source-instance.5f7mp3pt3n6e.eu-west-1.rds.amazonaws.com', 5432, 'admin', 'source-user-password', 'mySourceDb', 'target-user-password', true ); -
Run the transport by running the
transport.import_from_serverfunction on the target DB instance:SELECT transport.import_from_server( …, …, …, …, …, …, false); -
Validate the data in the target.
-
Add all the needed roles and permissions to the target.
-
Restore uninstalled extensions in the public schema of both DB instances.
pg_transportcan be uninstalled. -
Revert the value of the max_worker_processes parameter.
If the target DB instance has automatic backups enabled, a backup is automatically taken after transport completes.
Point-in-time restores will be available for times after the backup finishes.
Should the transport fail, the pg_transport extension will attempt to undo all changes to the source and target DB
instances. This includes removing the destination's partially transported database.
Depending on the type of failure, the source database might continue to reject write-enabled queries. Should this
happen, allow write-enabled queries manually:
ALTER DATABASE db-name SET default_transaction_read_only = false;
Troubleshooting
ERROR: extension must be loaded via shared_preload_libraries
Refer How can I resolve the "ERROR: <module/extension> must be loaded via shared_preload_libraries" error?
Further readings
- Working with DB instance read replicas
- Working with parameter groups
- How can I resolve the "ERROR: <module/extension> must be loaded via shared_preload_libraries" error?
Sources
- Pricing and data retention for Performance Insights
- Introduction to backups
- Restoring from a DB snapshot
- AWS KMS key management
- Amazon RDS DB instance storage
- How can I decrease the total provisioned storage size of my Amazon RDS DB instance?
- What is AWS Database Migration Service?
- Migrating databases to their Amazon RDS equivalents with AWS DMS
- Transporting PostgreSQL databases between DB instances
- Migrating databases using RDS PostgreSQL Transportable Databases