Architecting for Scale, 2nd Edition
Read it now on the O’Reilly learning platform with a 10-day free trial.
O’Reilly members get unlimited access to books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.
Book description
Every day, companies struggle to scale critical applications. As traffic volume and data demands increase, these applications become more complicated and brittle, exposing risks and compromising availability. With the popularity of software as a service, scaling has never been more important.
Updated with an expanded focus on modern architecture paradigms such as microservices and cloud computing, this practical guide provides techniques for building systems that can handle huge quantities of traffic, data, and demand—without affecting the quality your customers expect. Architects, managers, and directors in engineering and operations organizations will learn how to build applications at scale that run more smoothly and reliably to meet the needs of customers.
- Learn how scaling affects the availability of your services, why that matters, and how to improve it
- Dive into a modern service-based application architecture that ensures high availability and reduces the effects of service failures
- Explore the Single Team Owned Service Architecture paradigm (STOSA)—a model for scaling your development organization in tandem with your application
- Understand, measure, and mitigate risk in your systems
- Use the cloud to build highly scalable applications
Show and hide more
Publisher resources
Table of contents Product information
Table of contents
- Forewords
- Foreword for Second Edition
- Foreword for First Edition
- Who Should Read This Book
- Why I Wrote This Book
- A Word on Scale Today
- What’s New in the Second Edition
- Using the Cloud
- Services Versus Microservices
- Modern Digital Customer Experiences
- Navigating This Book
- Tenet 1. Availability: Maintaining Availability in Modern Applications
- Tenet 2. Modern Application Architecture: Using Services
- Tenet 3. Organization: Scaling Your Organization for Modern Applications
- Tenet 4. Risk: Risk Management for Modern Applications
- Tenet 5. Cloud: Utilizing the Cloud
- Availability Versus Reliability
- What Causes Poor Availability?
- Measuring Availability
- The Nines
- Planned Outages Are Still Outages
- Availability by the Numbers
- Measure and Track Your Current Availability
- Automate Your Manual Processes
- Improve Your Systems
- Keep on Top of Availability in Your Changing and Growing Application
- Focus #1: Build with Failure in Mind
- Focus #2: Always Think About Scaling
- Focus #3: Mitigate Risk
- Focus #4: Monitor Availability
- Focus #5: Respond to Availability Issues in a Predictable and Defined Way
- Two Mistakes High
- Scenario #1: Losing a Node
- Scenario #2: Problems During Upgrades
- Scenario #3: Data Center Resiliency
- Scenario #4: Hidden Shared Failure Types
- Scenario #5: Failure Loops
- The Monolith Application Versus the Service-Based Application
- The Ownership Benefit
- The Scaling Benefit
- What Should Be a Service?
- Guideline #1: Specific Business Requirements
- Guideline #2: Distinct and Separable Team Ownership
- Guideline #3: Naturally Separable Data
- Guideline #4: Shared Capabilities/Data
- Mixed Reasons
- Stateless Services—Services Without Data
- Stateful Services—Services with Data
- Data Partitioning
- Timely Handling of Growing Pains
- Cascading Service Failures
- Responding to a Service Failure
- Predictable Response
- Understandable Response
- Reasonable Response
- Graceful Degradation
- Graceful Backoff
- Fail as Early as Possible
- Customer-Caused Problems
- Single Team Owned Service Architecture
- Advantages of a STOSA Application and Organization
- What Does It Mean to “Own” a Service?
- Using Core Teams and Services
- Summary
- Application Complexity
- What Are Service Tiers?
- Assigning Service Tier Labels to Services
- Expectations
- Responsiveness
- Dependencies
- What Are SLAs?
- External Versus Internal SLAs
- Why Are Internal SLAs Important?
- Limit SLAs
- Top Percentile SLAs
- SLA Conditionals
- Identify Risk
- Remove Worst Offenders
- Mitigate
- Review Regularly
- Managing Risk Summary
- The Top 10 List: Low Likelihood, Low Severity Risk
- The Order Database: Low Likelihood, High Severity Risk
- Custom Fonts: High Likelihood, Low Severity Risk
- T-Shirt Photos: High Likelihood, High Severity Risk
- Scope of the Risk Matrix
- Creating the Risk Matrix
- Using the Risk Matrix for Planning
- Maintaining the Risk Matrix
- Staging Versus Production Environments
- Staging/Test Environments
- Production Environments
- Technique #1: Introduce Redundancy
- Idempotent Interfaces
- Redundancy Improvements That Increase Complexity
- Six Levels of Cloud Maturity
- Level 1: Experimenting with the Cloud
- Level 2: Securing the Cloud
- Level 3: Using Servers and Applications in the Cloud
- Level 4: Enabling Value-Added Managed Services
- Level 5: Enabling Cloud-Unique Services
- Level 6: Cloud All In
- Organization Versus Application Maturity Level
- Trap #1: Not Trusting Cloud Security
- Trap #2: Performing Cloud Migration via Lift-and-Shift
- Trap #3: The Lure of Serverless—Depending Too Much on the Hype
- Defining What We Mean by Multiple Clouds
- Which Model? Which Cloud?
- What Has Changed in the Cloud?
- Change #1: Acceptance of Microservice-Based Architectures
- Change #2: Smaller, More Specialized Cloud Services
- Change #3: Greater Focus on the Application
- Change #4: The Micro Startup
- Change #5: Security and Compliance Has Matured
- Comparing Managed Hosting and Different Types of SaaS
- Managed Hosting
- Multi-Tenant SaaS
- Single-Tenant SaaS
- AWS Architecture
- AWS Region
- AWS Availability Zone
- Data Center
- AWS—Mapping Availability Zones in Multiple Accounts
- Structure of Cloud-Based Services
- Raw Resource
- Server-Based Managed Resource
- Serverless Managed Resource
- Usage-Based Resources Allocation
- Allocated-Capacity Resource Allocation
- Changing Allocations
- Automated Allocation of Resource Capacity
- Issues with Automatic Allocation
- Dynamic Allocation, Dynamic Cost
- Example Application #1: Event Processing
- Example Application #2: Mobile Backend
- Example Application #3: Internet of Things Data Intake
- Advantages and Disadvantages of FaaS
- Serverless Hype and the Future of FaaS
- Edge Computing Today
- Why We Care
- What Should Be in the Edge Versus the Cloud?
- How Do We Decide? The Driverless Car
- Criteria for Using Edge Versus Cloud
- #1: Be Smart About What Goes on the Edge
- #2: Don’t Ignore DevOps Principles in the Edge
- #3: Nail a Highly Distributed Deployment Strategy
- #4: Reduce Versioning as Much as Possible
- #5: Reduce Per Node Provisioning and Configuration Options
- #6: Scaling Is an Edge Issue, Not Just a Cloud Issue
- #7: Nail Monitoring and Analytics
- #8: The Edge Is Not Magic
- Cloud Matters Everywhere, But at Different Levels
- Replacement Mentality Impacts How You Adopt Cloud
- Which Cloud Is Most Important?
- Important Technologies Differ
- Data Sovereignty Is Universal
- My Take
- Tenet #1—Availability
- Tenet #2—Architecture
- Tenet #3—Organization
- Tenet #4—Risk
- Tenet #5—Cloud
- Architecting for Scale
Show and hide more
Product information
- Title: Architecting for Scale, 2nd Edition
- Author(s): Lee Atchison
- Release date: March 2020
- Publisher(s): O'Reilly Media, Inc.
- ISBN: 9781492057178
You might also like
Check it out now on O’Reilly
Dive in for free with a 10-day trial of the O’Reilly learning platform—then explore all the other resources our members count on to build skills and solve problems every day.