Configuring hadoop manually on ec2
NickName:RTF Ask DateTime:2013-04-11T00:31:13

Configuring hadoop manually on ec2

Can someone please point me in the direction of any resources that will help me manually setup/configure Hadoop (1.0.4) on EC2. I agree that there are lots of resources for accomplishing this using tools, services etc. but what I'm looking for is some help figuring out what modifications to manually make to the conf/*.xml files for both slaves and master in order to get Hadoop working.

Right now, I have 5 ec2 instances running and all of them are capable of running hadoop jobs individually in psuedo-distributed mode. So, I need to turn one into the master and the rest into slaves by way of configuring the conf files, such that the slaves know where the namenode and jobtracker is and the master knows about all the slaves.

My understanding is that I will also have to configure the EC2 security group of the instances so that they can all talk to one another on the right port. I think I'm OK with this.

Can anyone help me out with the configuration part, or point me towards something that might help?

Copyright Notice:Content Author:「RTF」,Reproduced under the CC 4.0 BY-SA copyright license with a link to the original source and this disclaimer.
Link to original article:https://stackoverflow.com/questions/15931458/configuring-hadoop-manually-on-ec2

More about “Configuring hadoop manually on ec2” related questions

Configuring hadoop manually on ec2

Can someone please point me in the direction of any resources that will help me manually setup/configure Hadoop (1.0.4) on EC2. I agree that there are lots of resources for accomplishing this using...

Show Detail

Hadoop on Amazon EC2 : Job tracker not starting properly

We are running Hadoop on Amazon EC2 cluster. We start the master, slaves and attach the ebs volumes and finally waiting for hadoop jobtracker, tasktracker etc to start and we have timeout of 3600 s...

Show Detail

configuring Hadoop on OS X got the error

I got the error: line 144: /Library/Java/JavaVirtualMachines/jdk1.7.0_51.jdk/Contents/Home/bin/java: No such file or directory After the configuration following http://www.talendbyexample.com/

Show Detail

Setting up multi node Hadoop cluster automatically

I have an EC2 image that I made with Hadoop installed. However, I set it up to be roleless upon instantiation (it isn't a slave or a master). In order to start a Hadoop cluster I launch as many ins...

Show Detail

Configuring hadoop 2.5 in eclipse

I'm trying to configure map-reduce in eclipse indigo with hadoop version 2.5. I downloaded hadoop 2.5 source and added all the libraries in the eclipse project. While trying to run the project, it...

Show Detail

getting started with the latest version of hadoop and ec2

I am following Hadoop in Action to get started with hadoop with ec2. I'm running on ubuntu and have downloaded and installed the latest version of Hadoop. I am hitting a road block at this command:

Show Detail

Script to set up Hadoop on EC2

The script to setup Hadoop on EC2 as described in https://wiki.apache.org/hadoop/AmazonEC2 has been removed from recent hadoop release. Google points me to an alternative http://whirr.apache.org/...

Show Detail

Hadoop on EC2 vs Elastic Map Reduce

I'm trying to evaluate the differences between these two options. Here are some pros and cons I can think of : Elastic Map Reduce => Better support from Amazon, No need to administer cluster, More

Show Detail

Hadoop initial setup on AWS EC2?

I am just a newbie of Hadoop. does anyone know how to setup hadoop on EC2(micro) instance? or It would be helpful if anyone knows good reference on-line links for Hadoop on EC2. Thank you

Show Detail

firewall configuring on EC2 in AWS

Few days ago I configured firewall on EC2 in AWS. But, the problem was when I configure firewall in EC2, that server went down. Then when I remove this firewall, then the server went up again. but ...

Show Detail