Skip to content
Pablo Rodriguez

Global Infrastructure

Global Scale

The AWS Global Cloud Infrastructure is “a secure, extensive, and reliable cloud platform, offering more than 200 fully featured services from data centers globally.”

  • 102 Availability Zones in 32 geographic regions around the world
  • Enables highly available, scalable, and flexible architectures near customers
  • Lower latency and increased performance

AWS Regions

Physical geographical locations with two or more Availability Zones

Availability Zones

One or more data centers designed for fault isolation

AWS Local Zones

Extensions of Regions placing services closer to population centers

AWS Data Centers

Where data resides and processing occurs

Points of Presence (PoPs)

Edge locations for reduced latency to customers

  • Definition: A Region is a geographical area consisting of two or more Availability Zones
  • Connectivity: Connected to multiple ISPs and private global network backbone
  • Isolation: Regions are isolated from one another for fault tolerance and stability
  • Regions introduced before March 20, 2019 are enabled by default
  • Regions introduced after March 20, 2019 (Asia Pacific Hong Kong, Middle East Bahrain) are disabled by default
  • You must enable these newer Regions before use
  • Some Regions have restricted access (e.g., AWS GovCloud for US government agencies)
  • You control data replication across Regions
  • AWS provides information about country and state where each Region resides
  • Select Regions based on compliance and network latency requirements
  • AWS products and services availability varies by Region
  • Each Availability Zone comprises one or more data centers
  • Some Availability Zones have as many as six data centers
  • No data center can be part of two Availability Zones
  • Each Availability Zone designed as independent failure zone
  • Physically separated in typical metropolitan Region
  • Located in lower-risk floodplains
  • Discrete, uninterruptible power supply and on-site backup generation
  • Fed by different grids from independent utilities
  • Redundantly connected to multiple tier-1 transit providers
  • Availability Zone is most granular level of specification for services like Amazon EC2
  • You are responsible for selecting Availability Zones where systems reside
  • AWS recommends replicating across Availability Zones for resiliency
  • Systems can span multiple Availability Zones
  • Design systems to survive temporary or prolonged Availability Zone failure

Local Zones “make it possible for you to run latency-sensitive portions of applications closer to end users and resources in a specific geography.”

  • Extension of a Region - not standalone infrastructure
  • Place AWS compute, storage, database, and select services closer to population centers
  • Deliver single-digit millisecond latency for specific use cases
  • Managed and supported by AWS
  • Media and entertainment content creation
  • Real-time gaming
  • Reservoir simulations
  • Electronic design automation
  • Machine learning (ML)
  • High-bandwidth, secure connection between local workloads and Region workloads
  • Seamless connection to full range of in-Region services
  • Same APIs and toolsets as regular AWS services
  • Provides elasticity, scalability, and security benefits of cloud
  • Foundation for AWS infrastructure
  • Location where actual data resides and data processing occurs
  • Typically has tens of thousands of servers
  • All data centers are online and serving customers
  • State-of-the-art, highly available data centers
  • Automated processes move customer data traffic away from affected areas during failures
  • Core applications deployed in N+1 configuration
  • Sufficient capacity for traffic load balancing to remaining sites during data center failure
  • Custom network equipment sourced from multiple Original Device Manufacturers (ODMs)
  • Customized network protocol stack
  • ODMs design and manufacture products based on AWS specifications

CloudFront uses global network including more than 410 PoPs:

  • 400 edge locations
  • 13 regional mid-tier caches
  • AWS data centers and servers located close to customers
  • Designed to deliver services with lowest latency possible
  • Ensure popular content can be served quickly to customers
  • Support services like Amazon Route 53, AWS Global Accelerator, and CloudFront
  • AWS data centers between origin server and edge location
  • Have longer cache duration than edge locations
  • Bring more content closer to customers even if not popular enough to stay at edge location
  • Increase efficiency and are transparent to end user
  • Used by default with CloudFront

Edge locations located in:

  • North America, Europe, Asia, Australia
  • South America, Middle East, Africa, China
  1. Regional edge caches absorb content not frequently accessed enough for edge locations
  2. Provide alternative to fetching content from origin server
  3. Content flows from origin → regional edge cache → edge location → end user

The AWS Global Infrastructure provides a comprehensive foundation for building resilient, high-performance applications that can serve customers worldwide with optimal latency and reliability.