More about AWS Database
- AWS PostgreSQL: Managed or Self-Managed?
- MySQL Database Migration: Amazon EC2-Hosted vs. Amazon RDS
- AWS MySQL: MySQL as a Service vs. Self Managed in the Cloud
- Database Case Studies with Cloud Volumes ONTAP
- Amazon DocumentDB: Basics and Best Practices
- AWS NoSQL: Choosing the Best Option for You
- AWS Database as a Service: 8 Ways to Manage Databases in AWS
- AWS Database Migration Service: Copy-Paste Your Database to Amazon
- AWS Oracle RDS: Running Your First Oracle Database on Amazon
- SQL Server in AWS: Two Deployment Options
- DynamoDB Pricing: How to Optimize Usage and Reduce Costs
- AWS Database Services: Finding the Right Database for You
- Overcome Amazon RDS Instance Size Limits with Data Tiering to Amazon S3
Of Amazon’s 212 cloud computing services, fifteen are purpose-built database engines. In this post, we focus on eight AWS database services: Amazon RDS, Amazon Aurora, Amazon DynamoDB, Amazon DocumentDB, Amazon ElastiCache, Amazon Neptune, Amazon Timestream, and Amazon Quantum Ledger Database (QLDB).
We offer a review of the functions, feature, capabilities and use cases of each of these services, to help you choose the database that is a perfect match for your use case and project. In addition, we’ll show how NetApp Cloud Volumes ONTAP can help solve AWS database migration and management challenges.
This is part of our series of comprehensive guides on cloud storage technology.
In this article, you will learn:
- Types of databases on AWS
- AWS database services
- AWS migration services
- AWS database management with Cloud Volumes ONTAP
Types of Database Services Offered on AWS
AWS offers a wide range of database services for you to choose from. The service fall into two groups: relational and non-relational (NoSQL).
AWS Relational Database Services
Relational databases store data in tabular form with columns and rows, and can be queried using the SQL query language. In these databases, columns represent attributes and rows represent records. Each field in the table is a data value.
Use cases for relational databases on Amazon include:
- Enterprise resource planning (ERP) apps
- Customer relationship management (CRM) apps
- Finance data
- Data warehousing
The primary Amazon services providing relational databases are:
- Amazon Aurora
- Amazon RDS
- Amazon Redshift
AWS NoSQL Database Services
Relational databases are not suitable for many use cases, especially those requiring very high performance or dynamic scalability. NoSQL, or non-relational databases, break the paradigm of storing data in tables with columns and rows, allowing them to distribute and process data more efficiently. NoSQL is commonly used to handle big data - large volumes of unstructured or semi-structured data.
The table below shows the main NoSQL databases services offered by AWS.
Type of Database
Key-value databases store data as a collection of key-value pairs with the key as an ID. These databases can store various types of data, including simple and compound objects.
● Real-time bidding
● eCommerce shopping carts
● Product catalogs
● Customer preferences
● Amazon DynamoDB
Document databases store data in JSON or JSON-like documents. You can query data using the same document-model format used in programming applications.
● Content management systems
● Customer profiles and personalization
● Mobile apps
● Amazon DocumentDB
In-memory databases store data in-memory for low-latency access. You can use these stores as a database, cache, message broker, or queue.
● Session stores
● Geospatial services
● Pub/sub messaging
● Real-time streaming
● Amazon ElastiCache for Memcached
● Amazon ElastiCache for Redis
Graph databases are a type of NoSQL (non-relational) database. This database type represents relationships directly. You can query data with specific graph languages.
● Fraud detection
● Social networking
● Recommendation engines
● Knowledge graphs
● Data lineage
● Amazon Neptune
Time-series databases store data in time-order and as append-only. You can query data over various time intervals.
● Application monitoring
● Industrial telemetry
● IoT applications
● Amazon Timestream
Ledger databases store data in an immutable, transparent, and cryptographically verifiable log. This log is owned by a trusted central authority to ensure provenance.
● Insurance claims
● HR and payroll
● Retail inventories
●Amazon Quantum Ledger Database
Note: While we listed in-memory databases under the non-relational category, there are in-memory databases that are relational, SQL databases. A notable example is SAP HANA. You can run SAP HANA on AWS using the managed SAP HANA instance.
AWS Databases Services
Once you understand what options are available to you for databases in AWS, you can begin narrowing them down. The following services are some of the most commonly used. Keep in mind when reviewing these descriptions that frequently, AWS customers implement multiple database types to meet their needs. You too should consider multiple options if one doesn’t meet all of your needs.
Amazon RDS is a managed, relational database service that includes six different database options. These include AWS Oracle, PostgreSQL, AWS MySQL, MariaDB, SQL Server, and Amazon Aurora. You can manage these database engines from a centralized management console, a command-line interface, or via API calls. When using this service, many administrative tasks are automated, including database setup, hardware provisioning, backup, and updating.
Use cases of Amazon RDS include:
- Web and mobile applications—provides the scalability, availability, and throughput needed for enterprise-grade applications.
- eCommerce applications—provides flexibility, security, and PCI compliance needed for eCommerce.
- Mobile and online games—provides high-throughput and availability to ensure that games remain online and responsive to players.
Amazon Aurora is a fully managed relational database engine designed specifically for AWS. It is MySQL and PostgreSQL compatible with minor changes to your source database. Aurora includes features for self-healing, fault tolerance, point-in-time recovery, and continuous backup.
Use cases for Amazon Aurora include:
- Enterprise applications—including customer relationship management and enterprise resource planning solutions.
- Software as a Service (SaaS) offerings—including those requiring significant storage and compute scalability.
- Web and mobile gaming applications—including those requiring massive storage, high throughput, and high-availability.
Amazon DynamoDB is a fully managed, document and key-value database. It includes features for multi-master, multi-region used along with built-in security, automated backup and restoration, and in-memory caching. DynamoDB can provide support for serverless web apps, microservices, and mobile backends.
Use cases of Amazon DynamoDB include:
- Ad tech—including clickstreams, user events, and user profiles.
- Gaming—including leaderboards, player data stores, and game states.
- Retail—including online shopping carts, inventory tracking, and customer profiles.
- Banking and finance—including event-driven transaction processing, fraud detection, and change data capture.
- Media and entertainment—including digital rights management, user data stores, and metadata stores.
- Software as a service (SaaS)—including content metadata stores, metadata caches, and relationship graph data stores.
Amazon DocumentDB is a fully managed document database service. It is scalable, highly-available, and compatible with MongoDB. With it, you can store, index, and query JSON files. With DocumentDB, you can scale your compute and storage resources separately for maximum flexibility.
Use cases of Amazon DocumentDB include:
- Content and catalog management—including online publications, point-of-sale terminals, and digital archives.
- Profile management—including user preferences, authentication profiles, and online transactions.
- Mobile and web applications—including applications that demand high-performance and low-latency with millions of requests per second.
Amazon ElastiCache is a fully managed, in-memory data store service. It is compatible with both Redis and Memcached. ElastiCache automates setup, hardware provisioning, configuration, monitoring, updates, and backup and recovery processes. With ElastiCache you can scale both write and memory processes through sharding and data replication.
Use cases of Amazon ElastiCache include:
- Session stores—for web applications and sites.
- Gaming—including leaderboards and chats.
- Geospatial services—including real-time mapping and location.
- Real-time analytics—including Internet of things (IoT) sensor processing and AI applications.
Amazon Neptune is a fully managed graph database service. It enables you to create and run applications using highly-connected data sets. It supports the storage of massive relationship data sets with low-latency access. Neptune supports a variety of graph models and languages, including RDF, SPARQL, and Gremlin. It includes features for point-in-time recovery, read replicas, and continuous backup.
Use cases for Amazon Neptune include:
- Social networking—including user profiles and content prioritization.
- Recommendation engines—including storage of customer contacts, purchase histories, and customer preferences.
- Fraud detection—including fraud related to overlapping email addresses, IP addresses, or credit card numbers.
- Knowledge graphs—including product catalogs or wikis.
- Life sciences—including disease models, gene patterning, or research catalogs.
- Network and IT operations—including creating network visibility, monitoring, or forensic analysis.
Amazon Timestream is a fully managed, time-series database service. It enables you to store, process, and analyze up to 1,000X better query performance at 90% lower cost, compared to relational databases offered on AWS. Timestream provides automatic hardware provisioning, updates, setup and configuration, and data tiering.
Use cases for Amazon Timestream include:
- DevOps—supports performance monitoring and management, network optimization, and server monitoring.
- IoT applications—supports IoT analytics for the implementation of smart devices, such as thermostats or motion sensors.
- Application monitoring—supports clickstream monitoring and analysis.
- Industrial telemetry—including monitoring of industrial equipment, fleet management, trade monitoring, or route optimization.
Amazon Quantum Ledger Database (QLDB)
Amazon (QLDB) is a fully managed, serverless ledger database service. You can use it to track application data changes with a verifiable history. With QLDB, you can avoid the need to build custom ledger applications and associated verification tools. You can query data in QLDB using a SQL-like API.
Use cases for Amazon QLDB include:
- Finance—including credit and debit transactions.
- Insurance—including claim transactions and auditing.
- HR and payroll—including employee benefits, performance histories, or certifications.
- Retail and supply chain management—including batch tracking, product recall processes, and shipping details.
AWS Database Migration Service
Once you’ve decided which services are best for your needs, you need to decide what data you want to migrate and how. Depending on the databases you are currently using, this can be as simple as creating a backup and importing that backup to the new database service. Or, it could require reformatting or re-scheming data.
To make migration easier, Amazon offers the AWS Database Migration Service. This service is free for the first six months of many AWS database services, including Aurora, DynamoDB, and Redshift. With this service, you can generally keep your databases operational during migration. This means minimal downtime and enables you to reduce revenue loss and disruptions to productivity.
Database Migration Service supports most commercial and open-source databases for easy transfer. This includes both homogeneous migrations, for example, SQL Server to SQL Server, or mixed migrations, for example, Oracle to Amazon Aurora. You can also use it to continuously replicate and consolidate databases to a data warehouse via Amazon S3 and RedShift.
AWS Database Migration Service Benefits
Using AWS Database Migration Service can provide several distinct benefits. The most notable benefits include:
Simple to use
Using this service doesn’t require the installation of applications or drivers. It also generally doesn’t require you to make changes to your existing databases. The service is easily launched and managed from the AWS Management Console and once started is fully managed.
To get started from the console, you only need to define your migration parameters in a new migration task. This involves setting up connections between databases and choosing replication instances. Then, you can use these parameters as a template to perform migration tests or multiple live migrations using the same settings.
Any changes you make during the migration process are automatically, and continuously replicated to your target service. This enables you to continue using existing databases up until you are ready to go live with your replacement services. You also have the option of leaving the sync operational indefinitely. This is particularly useful for mission-critical applications that you wish to create a failover for.
AWS Database Migration Service is free for the first six months (depending on the service you’re migrating to). You are only responsible for the compute resources and log storage used during migration. All inbound data transfers are free. Even compute and log storage costs are very low, however. For example, a terabyte-sized migration can generally be done for around USD $3.
For a custom analysis of what migration with this tool might cost, you can check out Amazon’s TCO calculator.
Supports widely used databases
AWS Database Migration Service can migrate data between most commonly used databases. You can use it to easily move databases between platforms or to migrate data between services. For example, you can move data from EC2 to Amazon RDS or from RDS to EC2. You can also use it to transfer data between database types, including NoSQL, SQL, and text-based stores.
When you use the Database Migration Service, your migration is performed in a highly resilient and self–healing way. The service achieves this by continually monitoring your source and target databases, network connectivity, and replication instances. This ensures everything runs smoothly.
However, if issues do arise, the service automatically detects the problem and takes action to correct it or notify you. Typically, this means automatically restarting the migration process from where the issue occurred.
AWS Database Management with Cloud Volumes ONTAP
NetApp Cloud Volumes ONTAP, the leading enterprise-grade storage management solution, delivers secure, proven storage management services on AWS, Azure and Google Cloud. Cloud Volumes ONTAP supports up to a capacity of 368TB, and supports various use cases such as file services, databases, DevOps or any other enterprise workload, with a strong set of features including high availability, data protection, storage efficiencies, Kubernetes integration, and more.
In particular, Cloud Volumes ONTAP helps in addressing database workloads challenges in the cloud, and filling the gap between your cloud-based database capabilities and the public cloud resources it runs on.
In this article, we offered an overview of eight AWS databases, including features, capabilities, and use cases. We also provided a brief guide about AWS migration services, and how you can leverage Cloud Volumes ONTAP to address the challenges of managing databases in the AWS cloud.
There’s a lot more to learn about AWS databases. To continue your research, take a look at the rest of our blogs on this topic:
AWS MySQL: Two Ways to Enjoy MySQL as a Service
You can set up MySQL on AWS on your own, or you can use MySQL databases as a service. The latter is offered as a service called AWS RDS, which enables you to run a managed database instance. Another option is to use Amazon Aurora, an elastic database service which is fully compatible with MySQL.
If you’re looking for information about RDS and Aurora for MySQL, check out our article: AWS MySQL: Two Ways to Enjoy MySQL as a Service.
AWS Oracle: How to Lift and Shift Your Oracle DB to Amazon
Many Oracle databases are still located on-premises, despite the difficulties of on-prem data management. If you are looking to migrate your Oracle database to AWS, there are two main AWS database services that can run Oracle databases: The Relational Database Service (RDS) and Amazon EC2.
Our article: AWS Oracle: How to Lift and Shift Your Oracle DB to Amazon, can help walk you through the process of migrating on-prem Oracle workloads to the cloud.
AWS Oracle RDS: Running Your First Oracle Database on Amazon
Amazon Relational Database Service (RDS) is a managed service for relational databases such as Oracle. You can use the AWS platform to backup, monitor, secure, configure, and scale your Oracle workloads.
This post reviews the features and licenses offered for relational Oracle databases on AWS RDS, and walks you through the process of creating an RDS DB instance.
DynamoDB Pricing: How to Optimize Usage and Reduce Costs
DynamoDB is an AWS database for NoSQL high-performance applications. DynamoDB is a great service for serverless web apps, mobile backends, microservices, and operations that need low-latency data access. Learn how DynamoDB Pricing works, including basic DynamoDB storage and architecture concepts. Discover tips for optimizing your DynamoDB costs.
AWS Database as a Service: 8 Ways to Manage Databases in AWS
AWS Database as a Service (DBaaS) offerings enable cloud users to leverage cloud-based databases without doing administrative infrastructure work. Each AWS DBaaS offers different types of uses, functionalities, and management levels. Learn about the eight types of managed databases offered on AWS, what is different about the AWS database offering, and read real life case studies.
SQL Server in AWS: Two Deployment Options
There are two primary ways to run SQL server in the Amazon cloud, each with its pros and cons. Managed database: deploying SQL server using Amazon RDS and self-managed: deploying SQL server on EC2 with EBS. Learn the details and pros and cons of each of these deployment options.
Amazon DocumentDB: Basics and Best Practices
DocumentDB is a managed service by Amazon Web Services (AWS) that provides access to the popular NoSQL database engine, MongoDB. Learn about the key features of DocumentDB, its architecture, and best practices for successfully deploying MongoDB on AWS.
AWS PostgreSQL: Managed or Self-Managed?
PostgreSQL is a high-performance, enterprise-class open-source relational database that supports both SQL (relational) and JSON (non-relational) queries. Learn about running AWS PostgreSQL using RDS, Aurora, or self-managing PostgreSQL in an EC2 compute instance.
AWS NoSQL: Choosing the Best Option for You
NoSQL databases enable you to store data with flexible schema and a variety of data models. These databases are relatively easy for developers to use, and have the high performance and functionality needed for modern applications. NoSQL AWS databases can hold large volumes of data while still providing low latency. Learn about 6 types of NoSQL Databases Offered on AWS, and leading AWS NoSQL database services, including DynamoDB, DocumentDB, ElastiCache and Neptune.
AWS Database Migration Service: Copy-Paste Your Database to Amazon
AWS Database Migration Service (DMS) is a service for migrating any data type from any data source to the AWS cloud. DMS supports migration to RedShift, Amazon RDS, Aurora, DocumentDB, and DynamoDB. You do not need to install agents or alter your source database to use the Database Migration Service. Instead, you can control the migration process from the AWS Management Console. Once started, migration is a managed service that is resilient and self-healing.
See Our Additional Guides on Key Cloud Storage Topics
We have authored in-depth guides on several other topics that can also be useful as you explore the world of cloud storage.
Learn the basics of storing data in Amazon Simple Storage Service (S3), Amazon’s first cloud service and still one of its most popular.
- S3 Storage: The Complete Guide
- S3 Pricing Made Simple: The Complete Guide
- AWS Certification Cheat Sheet for Amazon S3
Learn about storage solutions in the Microsoft Azure cloud, including object storage, block storage, and file storage solutions.
- Azure Data Catalog: Understanding Concepts and Use Cases
- Azure Table Storage: Cloud NoSQL for Dummies
- Azure Backup Service: 8 Ways to Backup with Azure
File shares support some of the most important workloads that enterprise businesses rely on, and the resources of the public cloud have created interesting new possibilities. Every major public cloud provider now offers its own cloud file sharing service, each with its own target workloads and considerations. But not every enterprise will find what they’re looking for in a fully managed, all-cloud service.
See top articles in our cloud file sharing guide:
- File Share Service Challenges in the Cloud
- Cloud File Sharing Services: Open-Source Solutions
- Cloud Availability Nightmares and How to Avoid Them in Cloud File Sharing
Multicloud strategies are becoming more popular as organizations seek to optimize their cloud services and deployments. These strategies can help you prevent vendor lock-in, increase your flexibility, and help you optimize costs.
This guide explains what multicloud storage is, how it works, what it’s used for, the core requirements for this storage, and how Cloud Volumes ONTAP supports it.
See top articles in our multicloud storage guide:
- One Cloud Out of Many: Why Enterprises Are Turning to Multicloud and Hybrid Cloud Architectures
- Multicloud Architecture: Partitioned, Cloud Burst and DR
- Multicloud Deployment: Creating a Plan With Cloud Volumes ONTAP
Snapshots are a common method for natively backing up cloud data and services. This method enables you to save point in time backups which can be restored when needed.
This guide explains what types of storage snapshots are available, what AWS snapshots are, and how to use AWS snapshots.
See top articles in our AWS snapshots guide:
- Azure and AWS Snapshots Deep Dive: Cloud Volumes Snapshots
- Snapshots Deep Dive: AWS Snapshots and Azure Snapshots
- Understanding AWS Snapshot Pricing: Data Transfer and Storage Costs
Azure provides a wide variety of services to its users to help you manage your cloud data and services reliably. Azure Backup is one such service that can help provide data loss protection and peace of mind.
This guide explains what Azure Backup is and how to use it to backup your Azure data.
See top articles in our Azure Backup guide:
- Storage Options for Lower Azure Storage Costs and Azure Backup Costs
- The 5 Enterprise-Grade Azure Features You Need to Know About: Azure Backup, Security, and More
- Using Azure Backup Server to Backup Workloads and Files to Azure
Azure File Storage
Storing file data in Azure is simple through Azure File Storage service. This service enables you to store files across cloud and on-premises resources, enabling you to flexibly and securely share data and workflows.
This guide explains what Azure File Storage is, common use cases for Files, management concepts and components of the service, how data is accessed and the architecture of the service, and some best practices for securing your data.
See top articles in our Azure file storage guide:
Azure Files is one of several storage services available to users in Azure. It is a service designed to replicate file shares like those commonly used on premises. With this service, you can smoothly transition your files to the cloud and allow file sharing across your teams.
This guide explains what Azure Files is, how it complements other storage services, pricing and use cases for Files, and pros and cons you should be aware of.
See top articles in our Azure Files guide:
- Azure NetApp Files Register
- SMB File Sharing
- NFS and SMB - A Simple File Service Environment in Azure
Nearly every production cloud deployment has one or more databases. These tools provide support for applications, enable workloads, and organize your data meaningfully. Having databases available that support all your needs is essential and Azure offers a range to choose from.
This guide explains what Azure database workloads are supported, how databases work in Azure, and what services are available.
See top articles in our Azure database guide:
- Azure Oracle: Your First Oracle Database on Azure
- Azure Database Migration Service: The Ultimate Guide
- Azure SQL Database: 18 Options for SQL Server on the Cloud
Google Cloud offers a variety of storage options for you to choose from. These services form the base of many other services in the cloud and understanding what your options are can help you manage your cloud more efficiently.
This guide explains what Google Cloud Storage options exist and their common uses.
See top articles in our Google Cloud storage guide:
- Cloud File Sharing Services: Google Cloud Filestore
- Google Cloud Storage Encryption: Key Management in Google Cloud
- Google Cloud Storage Pricing: Get the Best Bang for Your Buckets
Google Cloud’s specialty is flexibility and integration of services and this extends to its database services. In Google Cloud you have a wide variety of database deployments, models, and support to choose from.
This guide explains your options for deploying databases in the cloud, what Google Cloud database services are available, and how to choose the right service for you.
See top articles in our Google Cloud database guide:
- Google Cloud SQL: MySQL, Postgres and MS SQL on Google Cloud
- SQL Server on Google Cloud: Managed Service Vs. Managed Storage
- Google Cloud SQL Pricing, Quotas, and Limits: A Cheatsheet for Cost Optimization
Software developers and DevOps engineers are packaging applications into lightweight units called containers. Kubernetes helps manage and scale containers across clusters of physical machines.
In this environment, Kubernetes storage becomes a significant challenge. By default, containers are ephemeral, meaning that any transient data on the container is lost when it shuts down. However, Kubernetes provides several options for persistent storage.
See top articles in our Kubernetes guide: