Explore essential concepts of auto scaling groups in cloud computing, focusing on how EC2 fleets automatically adjust to demand. This quiz reviews scaling policies, group configurations, and best practices to optimize resource usage and uptime.
What is the primary purpose of an auto scaling group for EC2 instances in a cloud environment?
Explanation: The main function of an auto scaling group is to increase or decrease the number of instances as needed to meet demand. This helps ensure availability and cost efficiency. Increasing data encryption refers to a security measure, not scaling. Maintaining a static number ignores changing requirements. Scheduling maintenance is a different management task.
When configuring an auto scaling group, what does the minimum size parameter define?
Explanation: Minimum size sets a baseline, ensuring a minimum number of instances are always running. Disk size and network bandwidth are unrelated to this parameter. The number of users allowed is a separate concern handled by application logic, not scaling groups.
If your application's CPU load often exceeds 80%, which scaling policy would help maintain performance?
Explanation: Adding an instance helps handle increased load, which maintains performance. Adding storage may not address CPU load issues. Sending alerts is helpful for monitoring but does not fix the problem automatically. Reducing the minimum size could make things worse by lowering resources.
What does the desired capacity specify in an auto scaling group configuration?
Explanation: Desired capacity reflects how many instances you ideally want running, and the auto scaling group works to maintain this number. Storage size, billing limits, and choosing a network port are managed elsewhere and are not controlled by the desired capacity setting.
What action does an auto scaling group typically take when it detects an unhealthy instance?
Explanation: Upon detecting an issue, the group removes the bad instance and starts a new one to maintain service. Changing the IP address or increasing size doesn't resolve unhealthy instance problems. Disabling monitoring would likely hide rather than fix the issue.
Which of the following metrics could trigger a scaling out event in an auto scaling group?
Explanation: High CPU usage is a common metric to trigger scaling since it reflects resource strain. Password expiration and help desk tickets are user and support related, not infrastructure performance. Disk encryption status affects security, not load-based scaling.
What happens if an auto scaling group reaches its maximum size setting during a high demand period?
Explanation: Once the maximum is reached, no more instances are added, which helps control costs and resource usage. The group cannot increase its own maximum automatically. Terminating oldest instances or switching hardware do not address scaling constraints.
Why is a launch template or launch configuration necessary when creating an auto scaling group?
Explanation: A launch template provides needed details like image, instance type, and security rules for new instances. Network encryption, domain name setup, and backup scheduling are managed elsewhere, not by the launch template.
What is the benefit of using multiple availability zones in an auto scaling group?
Explanation: Distributing instances across zones improves resilience in case one zone faces issues. It does not increase instance power or disable important features. Preventing user access contradicts the purpose of scaling and reliability.
What does a 'scaling in' event do in the context of an auto scaling group?
Explanation: Scaling in decreases the number of instances to save costs when fewer resources are needed. Adding instances is scaling out, not in. Switching billing types and upgrading software are separate operations unrelated to the scaling in concept.