Assess your understanding of Elastic Load Balancing and Auto Scaling principles, including traffic distribution, scalability, health checks, and high availability. This quiz covers key concepts and best practices for building robust, automatically scalable systems to handle variable workloads.
Which primary function does an Elastic Load Balancer perform for web applications handling multiple user requests?
Explanation: An Elastic Load Balancer's main job is to distribute incoming network traffic across several servers, ensuring no single server becomes overwhelmed. This helps improve application availability and reliability. Storing backups, encrypting at rest, and analyzing security threats are important tasks but are handled by different system components. Choosing the incorrect options would conflate traffic management with unrelated security or storage features.
What is the main benefit of using an Auto Scaling Group when demand on your web service fluctuates unexpectedly?
Explanation: Auto Scaling Groups help maintain performance and cost efficiency by increasing or decreasing the number of servers according to traffic. Permanently raising server resources would not optimize for cost during low demand. Blocking traffic and scheduling reboots are administrative tasks unrelated to dynamic scaling. The correct option describes the key advantage of auto scaling.
How does a load balancer handle a server that fails its health checks during operation?
Explanation: When a server fails health checks, the load balancer directs traffic away from that server until it passes the checks again. It does not shut down servers, as that could disrupt recovery; redirecting all traffic to a failed server would worsen outages. Increasing health check frequency may not help if the server is already unhealthy. The correct option supports reliability and uptime.
Which feature allows a load balancer to consistently send a returning user's requests to the same server for session continuity?
Explanation: Sticky sessions, also called session affinity, ensure a user's requests are sent to the same server, which is helpful for applications that store session data locally. Distributed backups relate to data storage, not request routing. Network firewalls help with security, not session management. Round robin forwarding distributes requests evenly and does not guarantee user continuity.
Which type of policy can an Auto Scaling Group use to increase resources when average CPU usage passes a certain threshold?
Explanation: A target tracking policy adjusts the size of the group based on a defined metric, such as average CPU usage. Fixed termination policies are not used for scaling based on metrics. Data replication and error forwarding policies do not control the number of resources in use based on performance data. Only the correct option relates to automatic scaling by monitoring particular metrics.
Why is it recommended to distribute your Auto Scaling group across multiple Availability Zones?
Explanation: Spreading resources across zones ensures continued service in the event one zone becomes unavailable, supporting high availability. Fast backups are unrelated to zone distribution. The number of health checks is not typically affected by multi-zone deployment. Disabling internet access is a security measure unconnected to the distribution of Auto Scaling groups.
Which load balancer type operates at the transport layer and is commonly used for routing TCP traffic?
Explanation: Network load balancers work at the transport layer (Layer 4) and are best suited for TCP traffic management. Application load balancers work at the application layer (Layer 7). The terms 'Object router balancer' and 'Packet session balancer' do not describe standard types of load balancers. The distinction is important for choosing the right load balancing method for different traffic types.
What happens to servers in an Auto Scaling Group when the group scales down due to reduced traffic?
Explanation: When scaling down, unnecessary servers are terminated, which reduces costs and resources. Backup operations and simultaneous reboots are not automatically initiated by scaling down. Rerouting traffic directly to users would bypass the infrastructure which is not secure or practical. The correct answer represents efficient resource management.
Which advantage does load balancing provide for applications experiencing a sudden surge in user traffic?
Explanation: By spreading traffic among multiple servers, load balancing helps prevent any single server from being overwhelmed, maintaining application stability and availability. Compressing user data and generating logs are unrelated to traffic distribution. Disabling services is a resource-saving measure but not directly linked to the core benefit of load balancing.
What is the primary purpose of the cooldown period in an Auto Scaling Group after a scaling activity occurs?
Explanation: A cooldown period delays further scaling activities, allowing the system to stabilize and metrics to update after a scaling action. Disk encryption, session resets, and firewall settings are unrelated to the timing of scaling events. This prevents inefficient or excessive scaling based on outdated information, which could lead to resource wastage or instability.