I'm trying to evaluate the differences between these two options. Here are some pros and cons I can think of :
Elastic Map Reduce => Better support from Amazon, No need to administer cluster, More Expensive (?)
EC2 + Hadoop => More control of your hadoop configuration, Cheaper (?)
I'm wondering if anyone might have benchmarked the performance of EC2 + Hadoop vis a vis EMR? Is there any significant difference in cost for large cluster deployments? What other differences exist?
Copyright Notice:Content Author:「OckhamsRazor」,Reproduced under the CC 4.0 BY-SA copyright license with a link to the original source and this disclaimer.
Link to original article:https://stackoverflow.com/questions/15177908/hadoop-on-ec2-vs-elastic-map-reduce