Cloud migration using EMR

Category

Case Study

Author

Wissen Team

Published

May 24, 2024

Business Need:

Locomotive Industry vertical has a wide range of usage pattern in which business has some peak hours and non-peak hours. Business requirement is to have a specific architecture where it can handle various types of Loads during the day and night. It should provide a facility to scale up and scale down based on the usage of the system.

Solution and Approach:

AWS MapReduce (EMR) provides the feasibility to scale up and scale down the resources based on the CPU/Memory/Requests on the cluster. Based on Minimum and Maximum Servers combination in the EMR configuration, Number of Servers can be directly scaled up or down between the Minimum and Maximum numbers provided during the installation.

With AWS EMR in place, locomotive industry vertical can handle the distinct Loads like Low or High resource consumption data loads. S3 is used as backend storage for EMR clusters, which provides the backup facility by default as it has versioning maintained.

AWS EMR is very fast in scaling up/down as it might not take more than 20 Mins. Resources like CPU/Memory/Compute Power will be used efficiently without the Manual Intervention, with the Architecture shown below.

Benefits:

Administration and Operational efforts reduced more than 40% as compared to the Traditional Warehouse Maintenance.

Cost Optimization has been achieved by adopting EMR, AWS Managed Service as resources will be scaled down during the Idle Time of the cluster. It led to reduce the overall cost of cluster to 28%.

AWS EMR is having MapReduce (Query Engine) as default, which gave 60% faster Analytics when compared to previous way of handling.