Skills Development AWS Well-Architected Review Workflow

AWS Well-Architected Review Workflow

v20260610
aws-well-architected-review
This automated workflow performs a comprehensive review against the AWS Well-Architected Framework (WAF). It scans IaC definitions (Terraform, CloudFormation, CDK) and deployed resources to identify compliance gaps, security risks, and optimization opportunities across six core pillars. Findings are structured and automatically generated as actionable GitHub issues with remediation code examples.
Get Skill
248 downloads
Overview

AWS Well-Architected Review

This workflow performs a structured AWS Well-Architected Framework (WAF) review against your workload's IaC files and deployed infrastructure. It identifies risks across all 6 WAF pillars and creates GitHub issues to track remediation.

Prerequisites

  • AWS CLI configured and authenticated
  • IaC files present in the repository (Terraform, CloudFormation, CDK, or SAM)
  • GitHub MCP server configured and authenticated

Workflow Steps

Step 1: Load Well-Architected Framework Reference

Fetch current AWS WAF best practices:

  • https://docs.aws.amazon.com/wellarchitected/latest/framework/welcome.html
  • Pillar-specific lenses relevant to the workload type (Serverless, SaaS, etc.)

Step 2: Discover IaC & Architecture

Scan the repository for IaC files:

  • Terraform: **/*.tf
  • CloudFormation/SAM: **/*.yaml, **/*.json (CFn templates)
  • CDK: lib/**/*.ts, bin/**/*.ts, cdk.json

Identify key AWS services in use (compute, data, networking, security, observability) and generate a Mermaid architecture diagram.

Step 3: Pillar-by-Pillar Review

Pillar 1: Operational Excellence

  • All infrastructure defined as IaC (no manual console changes)
  • Consistent tagging strategy applied across all resources
  • CloudWatch alarms defined for key metrics
  • Automated deployment pipeline present (no manual deployments)
  • CloudTrail enabled for audit logging
  • Runbooks or operational documentation present

Pillar 2: Security

  • IAM roles use least-privilege policies (no * actions without justification)
  • No hardcoded credentials in IaC or code
  • Secrets managed via Secrets Manager or SSM Parameter Store
  • S3 buckets have public access blocked and server-side encryption enabled
  • Sensitive resources placed in private subnets
  • Security groups restrict inbound to minimum required ports/CIDRs
  • KMS encryption enabled for sensitive data stores (RDS, EBS, S3, SQS, DynamoDB)
  • SSL/TLS enforced on all endpoints (enforceSSL: true)
  • GuardDuty enabled (aws guardduty list-detectors)
  • AWS WAF configured on public-facing APIs and CloudFront distributions
  • MFA delete enabled on critical S3 buckets

Pillar 3: Reliability

  • Multi-AZ deployments for production databases (RDS Multi-AZ, DynamoDB Global Tables)
  • Auto Scaling configured with appropriate policies for EC2/ECS
  • S3 versioning and lifecycle policies configured
  • RDS automated backups enabled with appropriate retention period
  • DynamoDB Point-in-Time Recovery (PITR) enabled
  • Dead Letter Queues (DLQ) configured for Lambda, SQS, SNS
  • Route 53 health checks configured for DNS failover
  • Lambda reserved concurrency set to prevent noisy-neighbor throttling

Pillar 4: Performance Efficiency

  • Right-sized instance types (Lambda memory, EC2 type, RDS class)
  • Graviton/ARM instances used where available (Lambda arm64, EC2 Graviton)
  • Caching implemented (ElastiCache, DAX, CloudFront, API Gateway caching)
  • CloudFront used for global static content delivery
  • Aurora Serverless or DynamoDB On-Demand for variable load patterns
  • Lambda Provisioned Concurrency for latency-critical synchronous paths

Pillar 5: Cost Optimization

  • EC2 Reserved Instances or Savings Plans for steady-state workloads
  • S3 lifecycle policies moving data to cheaper storage tiers
  • Lambda arm64 architecture adopted (20% cost reduction)
  • VPC Endpoints for S3/DynamoDB to avoid NAT Gateway charges
  • gp2 EBS volumes migrated to gp3 (same performance, 20% cheaper)
  • Development/test environments have auto-shutdown schedules
  • AWS Budgets and Cost Anomaly Detection configured
  • Unattached EBS volumes and idle EC2 instances identified

Pillar 6: Sustainability

  • Graviton/ARM instances selected where available
  • Serverless/managed services preferred over always-on EC2
  • S3 lifecycle policies reduce unnecessary long-term data storage
  • Auto Scaling configured to avoid over-provisioning
  • Region selection considers AWS renewable energy commitments

Step 4: Risk Classification

For each finding, classify:

  • High Risk: Security vulnerability, single point of failure, no backup/recovery
  • Medium Risk: Suboptimal reliability, cost inefficiency, performance concern
  • Low Risk: Best practice deviation, minor optimization opportunity

Step 5: User Confirmation

๐Ÿ—๏ธ AWS Well-Architected Review Summary

๐Ÿ“Š Review Results:
โ€ข IaC Files Analyzed: X
โ€ข AWS Services Identified: Y
โ€ข Total Findings: Z
  โ€ข High Risk: A (immediate action required)
  โ€ข Medium Risk: B (should address soon)
  โ€ข Low Risk: C (nice to have)

๐Ÿ”ด Top High Risk Findings:
1. [Pillar]: [Finding] โ€” [Why it matters]
2. [Pillar]: [Finding] โ€” [Why it matters]

๐Ÿ’ก This will create Z individual GitHub issues + 1 EPIC issue.

โ“ Proceed with creating GitHub issues? (y/n)

Step 6: Create Individual Finding Issues

Label with "well-architected" and the pillar name (e.g., "security", "reliability").

Title: [WAF-<PILLAR>] [Brief Finding] โ€” [Risk Level]

Body:

## ๐Ÿ—๏ธ Well-Architected Finding: [Brief Title]

**Pillar**: [Name] | **Risk Level**: [High/Medium/Low] | **Effort**: [Low/Medium/High]

### ๐Ÿ“‹ Description
[Clear explanation of the finding and why it matters]

### ๐Ÿ”ง Remediation

**IaC Fix** (preferred):
```hcl
# Terraform example
resource "aws_s3_bucket_server_side_encryption_configuration" "example" {
  bucket = aws_s3_bucket.example.id
  rule {
    apply_server_side_encryption_by_default {
      sse_algorithm = "aws:kms"
    }
  }
}

AWS CLI fallback:

aws s3api put-bucket-encryption --bucket <name> \
  --server-side-encryption-configuration '{"Rules":[{"ApplyServerSideEncryptionByDefault":{"SSEAlgorithm":"aws:kms"}}]}'

๐Ÿ“š AWS Reference

  • [WAF Best Practice Link]
  • [AWS Documentation Link]

โœ… Validation

  • Change implemented in IaC and deployed
  • AWS Config rule passes (if applicable)
  • Security Hub finding resolved (if applicable)

Well-Architected Question: [WAF question this maps to]


### Step 7: Create EPIC Tracking Issue
Label with "well-architected" and "epic".

**Title**: `[EPIC] AWS Well-Architected Review โ€” X findings across 6 pillars`

**Body**: Executive summary with pillar breakdown table (finding counts by pillar and risk level), Mermaid architecture diagram, prioritized checklist linking all individual issues (High โ†’ Medium โ†’ Low), and success criteria:
- All High-risk findings resolved
- Medium findings have accepted mitigation plans
- No regression in existing CloudWatch alarms or Config rules

## Error Handling
- **No IaC Files Found**: Limit review to live resource discovery via AWS CLI and note the gap
- **Insufficient AWS Permissions**: List required read-only permissions for the review
- **GitHub Creation Failure**: Output all findings as formatted markdown to console

## Success Criteria
- โœ… All 6 WAF pillars reviewed against IaC and live infrastructure
- โœ… All findings classified by risk level and pillar
- โœ… Actionable remediation steps with IaC examples for each finding
- โœ… GitHub issues created for team tracking
- โœ… Architecture diagram generated for EPIC context
- โœ… AWS documentation references included
Info
Category Development
Name aws-well-architected-review
Version v20260610
Size 7.44KB
Updated At 2026-06-11
Language