AWS Elastic Map Reduce (EMR) Service

From GM-RKB
(Redirected from AWS EMR)
Jump to navigation Jump to search

An AWS Elastic Map Reduce (EMR) Service is a distributed data analytics platform service that is an AWS service.



References

2018

2016

  • https://en.wikipedia.org/wiki/Apache_Hadoop#Amazon_Elastic_MapReduce
    • Elastic MapReduce (EMR)[1] was introduced by Amazon.com in April 2009. Provisioning of the Hadoop cluster, running and terminating jobs, and handling data transfer between EC2(VM) and S3(Object Storage) are automated by Elastic MapReduce. Apache Hive, which is built on top of Hadoop for providing data warehouse services, is also offered in Elastic MapReduce.[2]

      Support for using Spot Instances[3] was later added in August 2011.[4] Elastic MapReduce is fault-tolerant for slave failures,[5] and it is recommended to only run the Task Instance Group on spot instances to take advantage of the lower cost while maintaining availability.

2016b

2014