AWS Data Services: A Mental Map
AWS has a lot of data services, and the names alone donβt tell you what each one is for. This post is a mental map β how the database, analytics, and data-movement services fit together, and how to pick between them. The mental map βββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ β DATABASES (operational, low latency) β βββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ€ β RDS relational SQL β β DynamoDB NoSQL key-value, serverless β β ElastiCache in-memory cache (Redis/Memcached) β β DocumentDB MongoDB-compatible document DB β β Neptune graph DB (nodes + edges) β β Timestream time-series DB β β Managed Blockchain Hyperledger/Ethereum β βββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ€ β ANALYTICS (batch, big data, querying) β βββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ€ β Redshift data warehouse (columnar SQL at scale) β β EMR managed Hadoop/Spark cluster β β Athena SQL queries directly on S3 β β QuickSight BI dashboards/visualization β βββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ€ β DATA MOVEMENT β βββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ€ β Glue managed ETL (extract/transform/load) β β DMS Database Migration Service β βββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ Databases These are operational, low-latency stores β the databases an application reads and writes in real time....
AWS RDS: Relational Database Service
RDS is a managed SQL database. AWS runs the database server β patching, backups, failover β and you just use it. It is where the structured data of an application lives. What RDS is A managed relational database: you pick an engine, and AWS does the operations. Supported engines: MySQL, PostgreSQL, MariaDB, Oracle, SQL Server, and Aurora β AWSβs own MySQL/Postgres-compatible engine. You connect with standard SQL drivers (JDBC/ODBC) over a hostname and port....
AWS S3: Object Storage
S3 is object storage β durable, effectively infinite, and accessed over an HTTP API. It is the default place to put blobs in AWS: images, backups, logs, static assets, anything. The core pieces Piece What it is Bucket The top-level container for objects. Its name is globally unique, and it lives in one region. Object A blob plus metadata, addressed by a key. The thing you actually store. Key The objectβs βpathβ β e....
AWS ELB & ASG: Load Balancing and Auto Scaling
A single EC2 instance is a single point of failure. To make an app highly available and elastic β able to handle variable load and survive failures automatically β AWS gives you two services that work hand in hand: ELB and ASG. The two pieces Service What it does Question it answers ELB Distributes incoming traffic across multiple EC2s βHow do users reach my fleet?β ASG Adds, removes, and replaces EC2s based on demand or health βHow big should my fleet be?...
AWS EC2: Instance Storage
Once an EC2 instance is running, it needs somewhere to put data. AWS gives you three kinds of disk to attach to a VM, and each one behaves very differently. The options Type Storage Lifecycle Scope EBS Block, network-attached Persistent One AZ Instance Store Block, physically on the host Ephemeral β lost on stop/terminate One host EFS File (NFS), network share Persistent Multi-AZ Mental model EC2 ββattachesββ> EBS volume (its own dedicated disk, persistent) EC2 ββhasββββββ> Instance Store (host's physical NVMe, ephemeral) EC2 ββmountsβββ> EFS (network file share, many EC2s share it) In plain terms:...