Where can I find AMI for Hadoop on EC2?
NickName:Ashwin Ask DateTime:2013-10-09T07:49:59

Where can I find AMI for Hadoop on EC2?

I am trying to set up Hadoop permanently on Amazon EC2. Currently what I am doing is every morning launch EC2 instances and set up Hadoop. Is there any way i can avoid this tedious step? I am looking for an Hadoop image which can be loaded on EC2 and make things easy for me.

I know I can use EMR for hadoop services. But I dont know how to start a EMR (hadoop) cluster without submitting a job flow. I mean I need a hadoop cluster without any jobs running in it.

Ultimately my aim is to run bioinformatics applications like Distmap and Seal. For these applications to run there are many dependencies. So I need a free hadoop cluster to set up the environment and then run these applications. I hope its clear what I am trying to do.

Thanks.

Copyright Notice:Content Author:「Ashwin」,Reproduced under the CC 4.0 BY-SA copyright license with a link to the original source and this disclaimer.
Link to original article:https://stackoverflow.com/questions/19260557/where-can-i-find-ami-for-hadoop-on-ec2

Answers
andreimarinescu 2013-10-11T09:27:27

What you can do is one of the below:\n\nOption 1. Start out with an EBS backed EC2 instance with your favourite Linux distro. Go ahead and install Hadoop software that you need. Create as many EC2 instances as the types of instances you are going to need (master / slaves /etc). You can create then your own AMIs in the AWS Console (right click on the EC2 instance and click \"Create AMI\"). You can then launch your own instances, as many as you need, based on this AMI. You can also create AMI's from instance-store backed instances, but that will mean dumping everything to S3 and creating an AMI from there. There are a lot of tutorials about this available, please leave a comment if you need directions :)\n\nOption 2. Start out with a Hadoop based AMI, repeat the steps above after doing your own configurations / adding dependencies to them. I went ahead and searched for Hadoop AMI's from the AWS console and there are 48 in EU-West-1 (not sure what region you're working with).\n\nOption 3. Start an EMR Cluster in interactive mode. There is also an option to keep the cluster alive after finishing job flows. If you also set the EC2 keys for the EMR instances, you should be able to SSH into them and have a functional Hadoop cluster (not sure about the dependencies though, you might be better of rolling your own).\n\nI hope I understood correctly what you're trying to achieve and this helps a little bit.",


More about “Where can I find AMI for Hadoop on EC2?” related questions

Where can I find AMI for Hadoop on EC2?

I am trying to set up Hadoop permanently on Amazon EC2. Currently what I am doing is every morning launch EC2 instances and set up Hadoop. Is there any way i can avoid this tedious step? I am looki...

Show Detail

Is there an Amazon community AMI for Hadoop/HBase?

I would like to test out Hadoop & HBase in Amazon EC2, but I am not sure how complicate it is. Is there a stable community AMI that has Hadoop & HBase installed? I am thinking of something ...

Show Detail

ec2 share ami with another account

I created an AMI on EC2 and shared with another EC2 account, but I can't access the AMI from the other EC2 account. Any help will be appreciated. Here is what I did so far: Created an instance us...

Show Detail

How I can query not the latest AMI image id for EC2?

AWS CloudFormation template allow to select AMI Image ID for EC2 instance using construction like this ECSAMI: Description: AMI ID Type: AWS::SSM::Parameter::Value<AWS::EC2::Image::Id&...

Show Detail

Find AMI provider username for ec2 instance SSH connect

My ec2 instance doesn't have a key pair. So I am trying to use EC2 instance connect on aws console. The problem is that ec2 connect is just a black screen and doens't show any terminal. I already e...

Show Detail

/home/hadoop/bin/hadoop missing in ami 4.x

I am trying to migrate a legacy mapreduce pipeline that is using ami 3.x to ami 4.x. It currently has bash scripts as part of the bootstrapping and one of them calls hadoop fs-get s3n://somefile ~/

Show Detail

How to find out information about an EC2 instance from AMI id?

This gives me information about all the snapshots I have: aws ec2 describe-snapshots From this, I can extract AMI id and find out more about the ami using: aws ec2 describe-images --image-ids ...

Show Detail

Configuration of EC2 AMI

it is said after we configure a system in an EC2, we can create EC2 AMI of that, then later on we use the AMI for auto scaling, fail over, etc. But usually the configuration will set something rela...

Show Detail

Is it possible to find the source AMI for an existing AMI?

Given a specific AMI, I would like to find the AMI it was originally built from. I know that an AMI is built from an EC2 instance, but what AMI was it created with? Is there a way to find out the

Show Detail

Can I get AMI details with out creating the EC2 instance from AMI in AWS

Is it possible to get AMI details like Operating System Type, Operating System Version and Softwares that were used to build the AMI without creating EC2 instance of it. I know that I can get the

Show Detail