Hadoop Professionals

A Community for Hadoop Users

This network is a place to discuss and learn Hadoop, Solr, Katta, Map Reduce, Machine Learning and Big Data

Members

Latest Activity

vinoth459 posted a discussion

Problem in Hadoop-eclipse configuration

Hi all,I am currently trying to execute map/reduce using eclipse.I am referring this linkhttp://developer.yahoo.com/hadoop/tutorial/module3.htmlWhen i was trying to do  Step 4: Add the Server"Advanced" tab---> I could not able to see  hadoop.job.ugi .So,I couldn't move further, what should i do next ?I attached some image file , it will show how my eclipse looks now.Apart from this , is there anyway to connect eclipse and…See More
7 hours ago
Profile IconHuangBin, Morning Liu, AnExplorer and 1 more joined Hadoop Professionals
yesterday
vinoth459 updated their profile
yesterday
vinoth459 and Yerriswamy.Andra are now friends
yesterday
lavanya posted a discussion

Hadoop mapreduce , Too many files open , can not execute process.

I am using hadoop 1.0.0 In my reducer implementation R code is executed through Rcaller . In the middle of execution i got an error Too many file open handles . can not execute process. I checked Rcaller code it is not creating many open file handles. It seems mapreduce has many open handles. Could you please suggest me any solution for this problem,and why Too many file open handles?  See More
Tuesday
vinoth459 commented on Yerriswamy.Andra's blog post How to make connection between Hadoop and Eclipse
"Hi friend, I am also searching related on that. I got something. Please check this links, 1) http://code.google.com/p/hadoop-eclipse-plugin/downloads/detail?name=hadoop-0.20.1-eclipse-plugin.jar&can=2&q=2)…"
Monday
lavanya is now a member of Hadoop Professionals
May 8
Debarshi Bhattacharya posted a discussion

Chaining hadoop mapreduce jobs

I have two casesCase 1: I wish to run two mapreduce jobs as chaining say the output of one map will be the input of the otherCase 2: I wish to run two separate mapreduce jobs in a single mapreduce algorithmAny help in type of code will be really helpful. Urgent.....See More
May 7
Debarshi Bhattacharya is now a member of Hadoop Professionals
May 7
Profile Iconsambath, Nagamallikarjuna and Anamika joined Hadoop Professionals
Apr 30
vinoth459 posted a discussion

Problem in connectivity

Hi all,I am currently working related on loading data from hadoop to hive. I can load data from hadoop to hive in single node Hadoop environment. But in cluster ,I got some exception when I am trying to start hivethe error as follows:hduser@cloudx:~/hive/hive-0.8.1-bin/bin$ ./hiveWARNING: org.apache.hadoop.metrics.jvm.EventCounter is deprecated. Please use org.apache.hadoop.log.metrics.EventCounter in all the log4j.properties files.Logging initialized using configuration in…See More
Apr 26
vinoth459 replied to vinoth459's discussion Need execution steps
"thank you very much friend"
Apr 25
Profile Iconshamsheerkhan, Radhakrishnan and Mahesh Hiremath joined Hadoop Professionals
Apr 21
Terry Healy replied to Terry Healy's discussion Balancer fails running 1.0.1
"You were correct Ulul - thank you. I ran the start-balancer.sh with progressively smaller -threshold values and when I got down to 2 (%), it started working. Thanks for your help. -Terry"
Apr 20
Ulul replied to Terry Healy's discussion Balancer fails running 1.0.1
"Hi Terry, In my understanding, the threshold that the balancer considers before working is a difference between disk usage. You say you use between 93 and 351 blocks, that represents between 6 and 21 GB (with the default block size of 64 MB). If you…"
Apr 19
Ulul replied to vinoth459's discussion Need execution steps
"Hi Vinoth, Sorry for this long-to-come reply but I'm having a hard time trying to make the couchbase sqoop plugin working. I wrote a very simple program as an example of what you want to do (extract Couchbase data to put it into Hadoop). The…"
Apr 19

Photos

Loading…
  • Add Photos
  • View All
 

Help With Hadoop

A great place to learn Hadoop, and to tune your map reduce jobs.

Ask specific Hadoop questions here to get help from an expert :)

Forum

Problem in Hadoop-eclipse configuration

Started by vinoth459 7 hours ago.

Chaining hadoop mapreduce jobs

Started by Debarshi Bhattacharya May 7.

Events

Blog Posts

How to make connection between Hadoop and Eclipse

Hi Friends, 

                Very thanks to you for spending valuable time for me.

I had installed one single node cluster in psudo-distributed mode on linux(RHEL). i had executed one sample mapreduce program through hadoop commands.I got the output correctly. Please help me i want make new programs using eclipse IDE .how to make connection between them.i didn't find any hadoop-ecclipse plugin in contrib folder.please help me the…

Continue

Posted by Yerriswamy.Andra on April 4, 2012 at 8:34pm — 1 Comment

How to set classpath to 'hadoop jar' command line?

Hi,

I wrote a map/red job and tested it in local eclipse. Then I deployed the map/red to the saver linux machine where the hadoop cluster runs by: making my map/red into a jar and collecting all the dependencies in a lib directory. So the deployment is very simple like this:

 

lib

<my map/red jar>

Now, at the command line I tried:

  hadoop jar <my map/red jar> <my map/red class> -libjars lib/<the dependency jar>

But…

Continue

Posted by Yigang Chen on March 19, 2012 at 3:11pm — 3 Comments

Is there a way to assign a codename to a cluster?

Hi,

I'm totally new to hadoop admin. I recently set up a small hadoop cluster. The web map/red admin page shows something like:

172 Hadoop Map/Reduce Administration

 

I wonder if there a way to configure a codename for the cluster so that the admin page would show something like:

<the codename of the cluster> Map/Reduce Administration

 

Anyone knows how to (or not) do this?

Posted by Yigang Chen on March 8, 2012 at 9:36am

re-start hadoop cluster and lost nodes

Hi,

I recently built a small hadoop cluster (hadoop 1.0.0, 4 nodes), and had it run for a while. Then I stopped the cluster to modify the hadoop-env.sh file on all 4 nodes. Specifically I increased the heap size by uncomment

# export HADOOP_HEAPSIZE=2000

to

export HADOOP_HEAPSIZE=2000

When I restarted the cluster, the Hasoop Map/Reduce Administration page showed 0 nodes:

 

Running Map…
Continue

Posted by Yigang Chen on March 8, 2012 at 9:00am — 1 Comment

I want to know the Way of Learning Hadoop

Hi All,this is Yerriswamy.I am beginner in  Hadoop.i am very happy to learn Hadoop.But i need some help from you.i want  to know how to start and what are the major concepts,it means the flow of learning Hadoop.pls help me.

Posted by yerriswamy on February 8, 2012 at 11:03pm — 1 Comment

Yahoo Hadoop Developer Blog

Hadoop Summit 2011 – A Different Approach

Hadoop Summit 2011 is over. If you saw this tweet ”#hadoopsummit planned for 1,500. upped on demand to 1,600. finally accommodated 1,700. ran out of space, good problem to have. :-),” then you probably got an idea of how exciting and mobbed the conference was this year. With folks dropping by from coast-to-coast, and quite [...]

Fourth Annual Hadoop Summit: The Countdown Begins!

On June 29, Yahoo! will host the 4th annual Hadoop Summit at the Santa Clara Convention Center. Hadoop Summit 2011 brings together some of the most influential thought leaders in the space - from Yahoo, Facebook, IBM, NetApp, and others. Jay Rossiter, Senior Vice President of the Yahoo! Cloud Platform Group will open the show [...]

Slides from eric14 talks @ #IbmBigData

Hi Folks, Here are my slides from the IBM big data symposium. This was a good event. IBM announced a new release of their Apache Hadoop based Big Insights platform. It is great to hear their commitment to Apache. Yahoo was there talking about our experiences and uses of Hadoop. I got a lot of [...]

Hadoop Summit CFP closing tomorrow!

Stack and I are the track organizers for the community track at the Hadoop Summit this year. The community track is for presentations on roadmap, developments and features in Apache Hadoop. So if you've added a new feature to Hadoop and want to publicize it to the world's largest and most important Hadoop conference, please [...]

Call for participation in the Hadoop Summit Research Track

Hadoop Summit is a great annual gathering of developers to talk about all things Hadoop. The attendance is great, we are expecting 2000 this year; the presentations are excellent; and the hallway conversations are a great way to meet new people and come up with new ideas. This environment is especially great if you have [...]

Cloudera Hadoop Blog

Meet the Presenter: Todd Lipcon

Today’s interview features Todd Lipcon, software engineer for Cloudera. Todd will be presenting Optimizing MapReduce Job Performance at Hadoop Summit. Question: Tell us about your current role and how you interact with Apache Hadoop? Todd: I’m a software engineer on Cloudera’s platform engineering team, where I spend most of my time contributing code to open [...]

Cloudera Manager 4.0 Beta released

We’re happy to announce the Beta release of Cloudera Manager 4.0.  This version of Cloudera Manager includes support for CDH4 Beta2 and several new features for both the Free edition and the Enterprise edition. Please try it out and send your comments to beta@cloudera.com. As always, we look forward to your feedback. 

CDH3 update 4 is now available

We are happy to officially announce the general availability of CDH3 update 4. This update consists primarily of reliability enhancements as well as a number of minor improvements. First, there have been a few notable HBase updates. In this release, we’ve upgraded Apache HBase to upstream version 0.90.6, improving system robustness and availability. Also, some [...]

Meet the Presenters: Aaron Myers from Cloudera and Suresh Srinivas from Hortonworks

This was originally posted on the Hadoop Summit 2012 blog. Today’s “Meet the Presenters” interview features two speakers: Aaron Myers from Cloudera and Suresh Srinivas from Hortonworks. Aaron and Suresh will be presenting on HDFS NameNode High Availability, one of the hottest topics in the Apache Hadoop space today. Question: Tell us about your current role and [...]

Announcing Apache Hive 0.9.0

This past Monday marked the official release of Apache Hive 0.9.0. Users interested in taking this release of Hive for a spin can download a copy from the Apache archive site. The following post is a quick summary of new features and improvements users can expect to find in this update of the popular data warehousing system for Hadoop. The 0.9.0 release continues the trend of [...]
 
 
 



Groups

Badge

Loading…

© 2012   Created by Jason Venner.

Badges  |  Report an Issue  |  Terms of Service