AWSSAA-C0375 concepts
SAA-C03 Cheat Sheet
Quick reference for the AWS Certified Solutions Architect – Associate exam.
Quick Navigation
VPC & Networking
- VPC CIDR Range
- VPC supports IPv4 CIDR blocks from /16 (65,536 IPs) to /28 (16 IPs). Subnets must be within the VPC CIDR. AWS reserves 5 IPs per subnet (first 4 + last 1).
- Public vs Private Subnet
- Public subnet: route table has 0.0.0.0/0 → Internet Gateway. Private subnet: route table has 0.0.0.0/0 → NAT Gateway (for outbound internet). NAT Gateway is managed, HA within an AZ; deploy one per AZ for full HA.
- VPC Peering vs Transit Gateway
- VPC Peering: 1-to-1, non-transitive, low latency, no bandwidth limit. Transit Gateway: hub-and-spoke, connects many VPCs/on-prem, supports transitive routing, multicast; use when managing 3+ VPCs.
- VPC Endpoints — Gateway vs Interface
- Gateway Endpoint: free, modifies route table, supports only S3 and DynamoDB, within same Region. Interface Endpoint (AWS PrivateLink): ENI in your subnet, costs per hour + per GB, supports 100+ services, works across accounts within the same Region.
- AWS PrivateLink
- Expose your service to other VPCs without VPC peering or exposing to the internet. Requires a Network Load Balancer on the provider side and an Interface VPC Endpoint on the consumer side.
- NAT Gateway vs NAT Instance
- NAT Gateway: managed, scales to 100 Gbps, no patching, HA per AZ, no security group, billed per hour + per GB. NAT Instance: self-managed EC2, can be used as bastion, lower cost for small traffic, must disable source/dest check.
- Direct Connect vs Site-to-Site VPN
- Direct Connect: dedicated private connection, 1–100 Gbps, consistent latency, not encrypted by default (use MACsec or VPN over DX). Site-to-Site VPN: IPsec over public internet, quick setup, up to 1.25 Gbps per tunnel. Use VPN as backup for DX.
EC2 & Auto Scaling
- Instance Type Families
- General (M, T): balanced CPU/RAM. Compute Optimized (C): high CPU, batch/ML. Memory Optimized (R, X, z): large in-memory workloads. Storage Optimized (I, D, H): high IOPS NVMe or dense HDD. Accelerated (P, G, Inf): GPU/ML. T-series supports CPU credits (burstable).
- Placement Groups
- Cluster: same AZ, same rack — lowest latency, 10 Gbps enhanced networking, high failure risk. Spread: different hardware across AZs — max 7 instances per AZ per group, best HA. Partition: isolated partitions per AZ — Hadoop/Kafka/Cassandra, up to 7 partitions per AZ.
- Launch Template vs Launch Configuration
- Launch Template: supports versioning, mixed instance types, Spot+On-Demand, T2/T3 unlimited, newer features. Launch Configuration: legacy, no versioning, no mixed types. Always prefer Launch Templates for new ASGs.
- ASG Scaling Policies
- Target Tracking: maintain a metric at a target (e.g., CPU at 50%) — simplest, recommended. Step Scaling: scale by defined increments based on CloudWatch alarm breach magnitude. Scheduled: scale at a known time. Predictive: ML-based, pre-scales based on historical load patterns.
- EC2 Purchasing Options
- On-Demand: pay per second, no commitment. Reserved (1–3 yr): up to 72% discount, Standard (no change) or Convertible (can change type). Savings Plans: flexible, commit to $/hr. Spot: up to 90% off, interruptible with 2-min warning. Dedicated Host: physical server, compliance/licensing.
- Instance Store vs EBS
- Instance Store: physically attached NVMe, extremely fast, ephemeral (lost on stop/terminate), no additional cost. EBS: network-attached, persistent, detachable, snapshotted to S3. Use instance store for temporary buffers, scratch data, or cache.
Load Balancing
- ALB vs NLB vs GLB
- ALB (Application): Layer 7 HTTP/HTTPS/gRPC, content-based routing (path/host/header/query), WebSocket, WAF integration. NLB (Network): Layer 4 TCP/UDP/TLS, millions of RPS, static IP/Elastic IP, lowest latency, preserve source IP. GLB (Gateway): Layer 3 GENEVE, inspects traffic via third-party appliances (firewalls, IDS/IPS).
- Health Checks
- All LBs perform health checks on registered targets. ALB checks HTTP response codes (default 200). NLB checks TCP connection or HTTP/HTTPS. Unhealthy targets are taken out of rotation. Configure interval, threshold, and timeout appropriately.
- Sticky Sessions (Session Affinity)
- ALB: application-based (app sets cookie) or duration-based (LB generates cookie). NLB: source-IP stickiness available but disabled by default; TCP flow hashing is not the same as session stickiness. Stickiness can cause uneven load distribution; use sparingly or externalize session state to ElastiCache/DynamoDB.
- Cross-Zone Load Balancing
- Distributes traffic evenly across all registered targets in all enabled AZs. ALB: default on; can be overridden at target group level since 2023, no charge. NLB/GLB: disabled by default, charged for inter-AZ data transfer when enabled. Prevents AZ imbalance when instance counts differ per AZ.
- Connection Draining / Deregistration Delay
- Allows in-flight requests to complete before deregistering a target. Default: 300 seconds (range: 0–3600). Set to 0 for Lambda targets or short-lived connections. Set lower for stateless apps, higher for long-running connections.
- ALB Routing Rules
- Forward to target group, redirect (HTTP→HTTPS), fixed response (503 maintenance), authenticate (Cognito/OIDC). Conditions: host header, path, HTTP method, query string, source IP, HTTP headers. Rules evaluated in priority order; default rule is the catch-all.
S3 Deep Dive
- S3 Storage Classes
- Standard: frequent access, ms latency, 3 AZs. Standard-IA: infrequent, ms latency, retrieval fee, min 30-day charge. One Zone-IA: single AZ, 20% cheaper, for re-creatable data. Intelligent-Tiering: auto-moves between tiers, monitoring fee. Glacier Instant: ms retrieval, min 90 days. Glacier Flexible: 1–5 min to 12 hr retrieval, min 90 days. Glacier Deep Archive: 12–48 hr, min 180 days, cheapest.
- S3 Replication (CRR vs SRR)
- Cross-Region Replication (CRR): compliance, lower latency for global users, cross-account. Same-Region Replication (SRR): log aggregation, live replication between prod/test. Requirements: versioning must be enabled on source and dest. Replicated asynchronously. Delete markers not replicated by default.
- S3 Encryption
- SSE-S3: AWS-managed AES-256 keys, header x-amz-server-side-encryption: AES256, no extra cost. SSE-KMS: customer controls key policy, audit via CloudTrail, header: aws:kms, KMS API call costs apply. SSE-C: customer provides key in request header, AWS does not store key. Client-side: encrypt before upload.
- Versioning & Lifecycle
- Versioning: once enabled, cannot be fully disabled (only suspended). Protects against accidental deletes (delete marker). Lifecycle rules: transition objects between storage classes based on age (e.g., Standard → IA after 30 days → Glacier after 90 days). Can expire versions and delete incomplete multipart uploads.
- Presigned URLs
- Grants temporary access to a private S3 object without changing bucket policy. Signed with IAM credentials of the requester. Default expiry: 3,600 seconds (max 7 days with IAM user credentials; with IAM role temporary credentials, URL expires when the session token expires). Use for: download links for authenticated users, upload without exposing credentials.
- S3 Transfer Acceleration
- Speeds up uploads by routing through CloudFront edge locations via optimized AWS backbone. Best for large files (>1 GB) from distant geographic locations. Enabled per bucket; uses accelerate endpoint: bucketname.s3-accelerate.amazonaws.com. Additional per-GB charge only when faster than standard.
- S3 Performance
- 3,500 PUT/COPY/POST/DELETE and 5,500 GET/HEAD requests per second per prefix. Use multiple prefixes to parallelize. Multipart upload recommended for files >100 MB, required >5 GB. S3 Byte-Range Fetches: parallelize GETs. S3 Batch Operations: bulk operations on billions of objects.
Database Services
- RDS Multi-AZ vs Read Replicas
- Multi-AZ: synchronous standby replica in another AZ, automatic failover (60–120 sec), same Region, used for HA (not performance). Read Replicas: asynchronous replication, up to 15 replicas, cross-Region supported, can be promoted to standalone DB. Read Replicas reduce read load; Multi-AZ reduces downtime.
- Aurora Architecture
- Storage auto-scales 10 GB to 128 TB. 6 copies of data across 3 AZs (4/6 for writes, 3/6 for reads). Up to 15 low-latency read replicas. Aurora Global Database: 1 primary Region + up to 5 secondary Regions, sub-second replication, <1 sec RPO. Failover in <1 min within Region. Aurora Serverless v2: scales in fine-grained increments, ideal for variable workloads.
- DynamoDB Capacity Modes
- Provisioned: specify RCUs/WCUs, supports auto-scaling, predictable cost, can use Reserved Capacity. On-Demand: pay per request, no capacity planning, 2× cost of provisioned at peak. 1 RCU = 1 strongly consistent read of 4 KB/s or 2 eventually consistent reads. 1 WCU = 1 write of 1 KB/s.
- DynamoDB DAX & Global Tables
- DAX: in-memory cache cluster, microsecond latency for reads, write-through, API-compatible but requires using DAX client/endpoint. Global Tables: multi-Region, multi-active replication, last-writer-wins conflict resolution; requires on-demand or auto-scaling. DynamoDB Streams: ordered item-level changes, 24-hr retention, triggers Lambda.
- ElastiCache: Valkey/Redis OSS vs Memcached
- Valkey/Redis OSS: supports persistence, replication, read replicas, cluster mode (sharding), Sorted Sets/Lists/Geospatial, pub/sub, Lua scripting. HA with Multi-AZ + automatic failover. Memcached: pure cache, multi-threaded, no replication/persistence, simpler. Choose Valkey/Redis OSS for session store, leaderboards, queues; Memcached for pure object caching with horizontal scaling.
- RDS Proxy
- Fully managed connection pooler for RDS and Aurora. Reduces database connections (critical for Lambda which creates new connections per invocation). Improves resilience — failover time reduced by up to 66%. Enforces IAM authentication. Serverless, auto-scales.
Serverless
- Lambda Limits & Concurrency
- Max execution: 15 minutes. Max memory: 10,240 MB (10 GB). Ephemeral /tmp storage: up to 10,240 MB. Deployment package: 50 MB zipped, 250 MB unzipped (use Lambda layers). Default concurrency: 1,000 per Region (soft limit). Reserved concurrency: guarantees capacity and acts as throttle ceiling. Provisioned concurrency: pre-initializes instances to eliminate cold starts.
- API Gateway Types
- REST API: full features (usage plans, API keys, request transformation, caching). HTTP API: lower latency, cheaper (70% less), JWT auth, Lambda proxy; fewer features. WebSocket API: bidirectional, real-time apps (chat, games). Edge-Optimized: CloudFront global distribution. Regional: same-Region clients, can combine with own CloudFront.
- Step Functions
- Orchestrates Lambda and 200+ AWS services using state machines. Standard Workflow: up to 1 year, exactly-once, auditable history, higher cost. Express Workflow: up to 5 minutes, at-least-once (async) or at-most-once (sync), high event rate, lower cost. Use for: ETL pipelines, order processing, human approvals.
- SQS vs SNS vs EventBridge
- SQS: pull-based queue, decouples producers/consumers, message retention up to 14 days, fan-out requires SNS. SNS: push-based pub/sub, up to 12.5M subscribers, fan-out to SQS/Lambda/HTTP/email. EventBridge: event bus, rich filtering on event content, 200+ AWS sources, SaaS integration, scheduled events via cron/rate rules. EventBridge is SNS evolution for complex routing.
- Lambda Invocation Types
- Synchronous (RequestResponse): caller waits for result — API Gateway, Cognito, Alexa. Asynchronous (Event): Lambda retries up to 2× on error, DLQ supported — S3, SNS, EventBridge. Polling (Event Source Mapping): Lambda polls SQS, Kinesis, DynamoDB Streams, MSK — batch processing, bisect on error.
- Lambda@Edge vs CloudFront Functions
- Lambda@Edge: runs at 13 CloudFront regional edge locations, Node.js/Python, up to 5 sec (viewer) / 30 sec (origin), 128 MB (viewer requests) / 10,240 MB (origin requests), access to request/response body. CloudFront Functions: runs at 450+ POPs, JavaScript only, sub-millisecond, 2 MB memory, no body access, cheapest. Use CloudFront Functions for URL rewrites/redirects; Lambda@Edge for complex auth or A/B testing.
- ECS vs EKS
- ECS = AWS-native container orchestration (simpler). EKS = managed Kubernetes (portable, complex). Both support Fargate (serverless, no EC2 management) or EC2 launch type (more control).
Security & IAM
- IAM Policy Evaluation Logic
- Evaluation order: 1) Explicit Deny (always wins). 2) Organizations SCP — if SCP denies, stop. 3) Resource-based policy — grants access without identity-based policy needed (same account). 4) Identity-based policy (permissions boundary limits max). 5) Session policies. 6) Default: implicit deny. Cross-account: BOTH resource-based and identity-based policies must allow.
- Resource Policies vs Identity Policies
- Identity Policy: attached to IAM user/role/group, defines what the principal CAN do. Resource Policy: attached to resource (S3 bucket, SQS, KMS), defines who can access it. Resource policies allow cross-account access without assuming a role. S3 bucket policy is a resource policy; IAM role policy is identity policy.
- KMS Key Types
- AWS Managed Keys: free, auto-rotated every year, cannot change policy, key ID: aws/service. Customer Managed Keys (customer managed KMS key): $1/month, custom rotation (optional annual), full key policy control. Customer-Provided Keys (SSE-C / import): you manage key material outside KMS, no KMS charges but higher operational overhead. Multi-Region Keys: same key material in multiple Regions.
- Secrets Manager vs SSM Parameter Store
- Secrets Manager: designed for secrets, automatic rotation via Lambda (RDS/Redshift natively), cross-account sharing, $0.40/secret/month + $0.05 per 10K API calls. SSM Parameter Store: Standard free (4 KB, no rotation), Advanced ($0.05/param/month, 8 KB, policies). Use Secrets Manager for DB credentials with rotation; SSM for configuration/non-secret parameters.
- Security Groups vs NACLs
- Security Groups: stateful (return traffic auto-allowed), instance/ENI level, allow rules only, evaluated as a set (all rules). NACLs: stateless (must explicitly allow return traffic), subnet level, allow and deny rules, rules evaluated in number order (lowest first), default NACL allows all. NACLs are the first line of defense at the subnet boundary.
- IAM Roles & STS
- Roles provide temporary credentials via STS AssumeRole. Key use cases: EC2 instance profile (app accesses AWS services), cross-account access, federated identity (SAML/OIDC/Cognito), service-to-service. STS tokens include: AccessKeyId, SecretAccessKey, SessionToken, Expiration. Trust policy defines who can assume the role.
- AWS WAF & Shield
- WAF: web application firewall, Layer 7, attach to ALB/CloudFront/API Gateway/AppSync. Rules: IP sets, geo-match, rate-based, SQL injection, XSS, custom regex. Managed rule groups available. Shield Standard: free DDoS protection at Layer 3/4 for all AWS customers. Shield Advanced: $3,000/month, Layer 7 DDoS, 24/7 DRT, cost protection.
- IAM Identity Center
- Centralized SSO for multi-account access. Assign permission sets to users/groups across AWS Organizations accounts. Supports SAML 2.0 federation with external IdPs like Microsoft Entra ID.
High Availability & Disaster Recovery
- DR Strategies (RPO/RTO)
- Backup & Restore: hours RPO/RTO, lowest cost, data backed up to S3. Pilot Light: core services replicated (DB sync), minutes–hours RTO, minimal running resources. Warm Standby: scaled-down fully functional environment, minutes RTO. Active-Active (Multi-Site): near-zero RPO/RTO, full capacity in multiple Regions, highest cost.
- Route 53 Routing Policies
- Simple: single resource, no health checks. Weighted: distribute traffic by percentage (A/B testing, gradual migration). Latency: route to lowest-latency Region. Failover: active-passive, health check required. Geolocation: route by user's geographic location. Geoproximity: route by resource location + bias. Multi-Value: up to 8 healthy records returned (not a substitute for LB).
- Route 53 Health Checks
- Monitor endpoints (HTTP/HTTPS/TCP), other health checks (calculated), or CloudWatch alarms. Health checks from 15 global AWS locations. Interval: 30 sec (standard) or 10 sec (fast, higher cost). Integration: trigger SNS notifications, control failover routing. Private endpoints: must use CloudWatch alarm-based health checks.
- Multi-AZ Architecture Pattern
- Deploy resources across minimum 2 AZs (3 recommended). Use ALB/NLB to distribute traffic. ASG spans multiple AZs. RDS Multi-AZ for automatic DB failover. ElastiCache Multi-AZ with replication groups. EFS: automatically replicated across AZs. Architect for at least one AZ failure without service interruption.
- AWS Backup
- Centralized backup service across EC2, EBS, RDS, Aurora, DynamoDB, EFS, FSx, Storage Gateway, S3. Backup plans: schedule, lifecycle, retention. Cross-Region and cross-account backup for compliance. Backup Vault Lock: WORM protection, immutable backups even from root. Integrates with Organizations for org-wide policy.
Storage & Migration
- EBS Volume Types
- gp3 (SSD): baseline 3,000 IOPS / 125 MiB/s, up to 16,000 IOPS / 1,000 MiB/s independently, cost-effective default. gp2 (SSD): IOPS tied to size (3 IOPS/GB, burst to 3,000), being replaced by gp3. io2 Block Express: up to 256,000 IOPS, 4,000 MiB/s, 99.999% durability, multi-attach, for mission-critical DBs. st1 (HDD): throughput-optimized, max 500 MiB/s, big data/Kafka. sc1 (HDD): cold, max 250 MiB/s, lowest cost, infrequent access.
- EFS vs FSx
- EFS: NFS v4, Linux-only, scales automatically, multi-AZ, 3 throughput modes (Bursting/Elastic/Provisioned), storage classes (Standard/IA), per-GB pricing. FSx for Windows: SMB protocol, Active Directory integration, Windows workloads. FSx for Lustre: high-performance parallel FS, ML/HPC, integrates with S3. FSx for ONTAP: multi-protocol, snapshots, SnapMirror. FSx for OpenZFS: NFS/SMB, low latency.
- AWS Storage Gateway
- S3 File Gateway: NFS/SMB to S3, local cache for frequently accessed data. FSx File Gateway: SMB to FSx for Windows, local cache (unavailable to new customers). Volume Gateway (Stored): entire dataset on-prem, async backup to S3 as EBS snapshots. Volume Gateway (Cached): primary data in S3, low-latency cache on-prem. Tape Gateway: virtual tape library to S3/Glacier.
- AWS Snow Family
- Snowcone: 8 TB HDD / 14 TB SSD, smallest, ruggedized, DataSync agent included. Snowball Edge Storage: 80 TB usable, up to 10 Gbps NIC, bulk data transfer. Snowball Edge Compute: 42 TB + EC2 + Lambda, edge compute in disconnected environments. Snowmobile: 100 PB, shipping container, for exabyte-scale migration (>10 PB, prefer Snowmobile).
- AWS DMS & MGN
- DMS (Database Migration Service): migrate databases to AWS with minimal downtime. Supports homogeneous (Oracle→Oracle) and heterogeneous (Oracle→Aurora using Schema Conversion Tool). Continuous replication available. MGN (Application Migration Service): lift-and-shift physical/virtual/cloud servers to AWS EC2. Replicates disks continuously, minimal cutover window. Replaces SMS (Server Migration Service).
- AWS DataSync
- Online data transfer service: NFS/SMB on-prem or inter-cloud → S3/EFS/FSx. Encrypted in transit, data integrity verification. Up to 10× faster than open-source tools. Scheduled or one-time tasks. Agent deployed on-prem or in cloud. Use for migration, replication, and data processing workflows.
Monitoring & Cost Optimization
- CloudWatch Components
- Metrics: time-series data, default namespace per service, 1-min (detailed monitoring, extra cost) or 5-min. Alarms: trigger SNS/Auto Scaling/EC2 actions on metric threshold. Logs: collect from EC2 (agent), Lambda, API Gateway, VPC Flow Logs; Log Insights for querying. Dashboards: cross-Region, cross-account. Contributor Insights: identify top N contributors to log patterns.
- CloudTrail
- Audit log of all API calls (who did what, when, from where). Management events: enabled by default, control-plane ops (CreateInstance, DeleteBucket). Data events: object-level S3 operations, Lambda invocations — must enable explicitly, higher cost. Insights: detect unusual API activity. Trails stored in S3, optionally sent to CloudWatch Logs. Immutable with S3 Object Lock.
- AWS Config
- Tracks resource configuration changes over time. Managed rules: pre-built compliance checks (e.g., s3-bucket-public-read-prohibited, encrypted-volumes). Custom rules via Lambda. Conformance Packs: bundle of rules for frameworks (PCI DSS, CIS). Remediation: auto-remediate via SSM Automation. Does not prevent changes — records and evaluates.
- Savings Plans vs Reserved Instances
- Reserved Instances (RI): commit to specific instance family/Region/OS/tenancy for 1 or 3 years. Standard RI: up to 72% off, no modification. Convertible RI: up to 66% off, can change attributes. EC2 Savings Plans: 1 or 3 yr commitment on $/hr, flexible across size/OS/tenancy within a family+Region. Compute Savings Plans: flexible across family/Region/OS — best flexibility, up to 66% off.
- Cost Explorer & Compute Optimizer
- Cost Explorer: visualize spend by service/tag/account, 13 months historical, 12-month forecast, RI/Savings Plans recommendations. Compute Optimizer: analyzes CloudWatch metrics for EC2, Lambda, EBS, ECS on Fargate, Auto Scaling Groups — recommends right-size. Requires opt-in. Enhanced infrastructure metrics: 3 months data ($0.0003/resource/hr).
- Trusted Advisor
- Automated best-practice checks across 5 pillars: Cost Optimization, Performance, Security, Fault Tolerance, Service Limits. Free tier: 7 core checks (security groups, S3 public buckets, MFA on root, EBS snapshots, IAM use). Business/Enterprise support: all checks + APIs + EventBridge integration + priority recommendations.
Caching & Content Delivery
- CloudFront Key Concepts
- CDN with 450+ POPs (edge locations) + 13 regional edge caches. Supported origins: S3, ALB, EC2, any HTTP endpoint. Cache behavior: path patterns routed to different origins. TTL controlled by Cache-Control/Expires headers or CloudFront policy. Geo-restriction: allowlist/blocklist by country. HTTPS: ACM certificate (must be in us-east-1 for CloudFront).
- CloudFront Origin Access Control (OAC)
- Successor to Origin Access Identity (OAI). Restricts S3 bucket access to CloudFront only. Supports SSE-KMS encrypted S3 buckets and all S3 operations. Apply bucket policy to allow CloudFront service principal. Recommended approach: do NOT make S3 bucket public when using CloudFront.
- ElastiCache Caching Strategies
- Lazy Loading (Cache-Aside): read from cache; on miss, read DB and populate cache. Cache can become stale; add TTL to mitigate. Write-Through: write to DB and cache simultaneously. Cache always up-to-date but write penalty + cache churn (data written but never read). Combine: write-through for critical data + TTL for stale data removal.
- Global Accelerator
- Uses AWS global network (Anycast static IPs → edge locations → AWS backbone) for non-HTTP use cases or when static IPs are required. Provides 2 static Anycast IPs globally. Supports TCP/UDP, weighted endpoint routing, health checks with instant failover (<30 sec). Unlike CloudFront (caches content), Global Accelerator proxies to your actual endpoint — good for APIs, gaming, IoT, VoIP.
- CloudFront vs Global Accelerator
- CloudFront: HTTP/HTTPS caching CDN, reduces origin load, best for static assets and dynamic web content. Global Accelerator: no caching, TCP/UDP, static IPs, 60% faster TCP by routing over AWS backbone, best for non-cacheable content requiring consistent low latency or when static IP is a firewall requirement.
Decoupling & Event-Driven Architecture
- SQS Standard vs FIFO
- Standard: unlimited throughput, at-least-once delivery, best-effort ordering. FIFO: exactly-once processing, ordered within message group, 300 TPS (3,000 with batching), deduplication (5-min window). FIFO queue name must end in .fifo. Use FIFO for financial transactions, order processing where order and dedup matter.
- SQS Visibility Timeout & DLQ
- Visibility Timeout: after consumer reads a message, it's hidden from other consumers for this duration (default 30 sec, max 12 hr). If not deleted within timeout, message reappears. Set to ~6× Lambda function timeout. Dead Letter Queue (DLQ): after maxReceiveCount failures, message moved to DLQ for debugging. DLQ alarms recommended. DLQ redrive: reprocess messages after fixing the bug.
- SNS Fan-Out Pattern
- SNS topic publishes to multiple SQS queues, Lambda functions, HTTP endpoints, email, SMS. Fan-out: decouple event producers from consumers, fully parallel processing. SNS FIFO: ordered delivery to SQS FIFO queues, up to 300 publishes/sec. Message Filtering: per-subscription filter policy (JSON), reduces unnecessary processing. SNS + SQS: asynchronous fan-out to buffered consumers.
- Amazon EventBridge
- Serverless event bus. Default bus (AWS services) + custom buses + partner buses (SaaS: Zendesk, Datadog, etc.). Rules: pattern matching on event content → route to target (Lambda, SQS, Step Functions, 20+ targets). Schema Registry: auto-discovers and documents event schemas. Pipes: point-to-point event enrichment (source → optional filter/enrich → target). EventBridge evolved from CloudWatch Events.
- Kinesis Data Streams vs Amazon Data Firehose
- Kinesis Data Streams: real-time, 1 ms latency, retain 24 hr–365 days, custom consumers (Lambda/KCL), manual scaling of shards (1 MB/s in, 2 MB/s out per shard). Amazon Data Firehose (formerly Kinesis Data Firehose): near-real-time (60 sec or 1 MB buffer), fully managed delivery to S3/Redshift/OpenSearch/Splunk, auto-scaling, built-in transformation via Lambda.
- Kinesis Data Streams Scaling
- Each shard: 1 MB/s ingest (1,000 records/sec), 2 MB/s read. Shard splitting: increase capacity. Shard merging: reduce cost. Enhanced Fan-Out: dedicated 2 MB/s per consumer per shard via HTTP/2 push, reduces shared read contention. Partition key determines shard; hot partitions can cause throttling — use random key prefix.