Hadoop Professionals

A Community for Hadoop Users

Anantha Srivathsa N
  • Male
  • Pune, Maharashtra
  • India
Share on Facebook Share Twitter

Anantha Srivathsa N's Groups

Anantha Srivathsa N's Discussions

ETL on Hadoop

Hi All,I am looking for integrating an ETL tool with Hadoop. Can someone suggest me the concerns as I have done some background work on basic ETL operations which can be performed for efficient…Continue

Tags: Integration, Hadoop, Hive, ETL

Started Dec 12, 2010

Gifts Received

Gift

Anantha Srivathsa N has not received any gifts yet

Give a Gift

 

Welcome, Anantha Srivathsa N!

Latest Activity

Anantha Srivathsa N commented on Yigang Chen's blog post re-start hadoop cluster and lost nodes
"Please check the logs, you may find the problem which is causing you to restart the cluster... secondly, HEAP SIZE depends on the size of RAM as well. Please configure accordingly, but I don't think this creating problem."
Mar 14
Anantha Srivathsa N joined Jason Venner's group
Thumbnail

HBase Users

A group for HBase users to share use cases, solutions and problems.
Feb 19
Anantha Srivathsa N updated their profile
Nov 27, 2011
Anantha Srivathsa N replied to Jonathan Viccary's discussion JAVA_HOME not set
"Hadoop-0.21.0 is unstable version. Please use the current stable version. http://hadoop.apache.org/common/releases.html"
Jun 17, 2011
Anantha Srivathsa N replied to Jonathan Viccary's discussion JAVA_HOME not set
""source ~/.bash_profile" in the terminal  "
Jun 17, 2011
Anantha Srivathsa N replied to Jonathan Viccary's discussion JAVA_HOME not set
"There is a file called .bash-profile please add the following lines in it at the bottom of the file export JAVA_HOME=<give the path excluding bin> export PATH=$JAVA_HOME/bin:$PATH   and then source this .bash_profile and your JAVA_HOME is…"
Jun 17, 2011
Anantha Srivathsa N updated their profile photo
May 30, 2011
Anantha Srivathsa N replied to Paulo Henrique Ramos's discussion Configuration of Hadoop and a Virtual Machine
"Hi Paulo, This may help u There are a few approaches to configure your Hadoop MapReduce: You can use MRUnit (http://www.cloudera.com/hadoop-mrunit) to write tests for your MapReduce. You'll be able to do this within Eclipse, so it's easy…"
Apr 6, 2011
Anantha Srivathsa N posted a blog post

How Big Data is analyzed using Hadoop

 IntroductionHadoop is rapidly becoming the technology of choice for enterprises that need to effectively collect, store and process large amounts of structured and complex data. The Pentaho BI project is open source application software for enterprise reporting, analysis, dashboard, data mining, workflow and ETL capabilities for business intelligence needs. Business Intelligence is a process for increasing the competitive advantage of a business by intelligent use of available data in…See More
Dec 21, 2010
Anantha Srivathsa N posted a blog post

How Big Data is analyzed using Hadoop

 IntroductionHadoop is rapidly becoming the technology of choice for enterprises that need to effectively collect, store and process large amounts of structured and complex data. The Pentaho BI project is open source application software for enterprise reporting, analysis, dashboard, data mining, workflow and ETL capabilities for business intelligence needs. Business Intelligence is a process for increasing the competitive advantage of a business by intelligent use of available data in…See More
Dec 20, 2010
Anantha Srivathsa N posted a discussion

ETL on Hadoop

Hi All,I am looking for integrating an ETL tool with Hadoop. Can someone suggest me the concerns as I have done some background work on basic ETL operations which can be performed for efficient performance for a large cluster.Currently I have setup a 3 node cluster and Hive is on top of it to extract the data from the HDFS.Any suggestion would be helpful. Thank YouSee More
Dec 12, 2010
Anantha Srivathsa N replied to Madhu's discussion Sample Test Data to Run MapReduce Jobs
"Hi Thanda, You can use any files which contains data. You need not go to any specific data for performing your operation. Hadoop supports structured and Unstructured data."
Nov 19, 2010
Anantha Srivathsa N replied to Sangeetha Sundar's discussion Setting a hadoop cluster
"Hi Sangita Sundar, As per your log files, port 9001 is already allocated to some other service in jobtracker machine. Because of this, datanode & tasktracker are unable to connect to the masternode. Can you please check that."
Nov 19, 2010
Anantha Srivathsa N replied to Balaji's discussion Regarding Hadoop history log files which are in .crc format
"Hi Balaji B, The history web UI is accessible from job tracker web UI. The history files are also logged to user specified directory hadoop.job.history.user.location which defaults to job output directory. The files are stored in…"
Nov 3, 2010
Anantha Srivathsa N replied to Arati Mahimane's discussion How does secondary namenode work?
"Hi, I suppose you need to check your secondary namenode log file where you can observe the errors or warnings. As I have seen log and identified that hadoop saves or updates fsimage file and edits log file after every checkpoint with its location in…"
Sep 21, 2010
Anantha Srivathsa N replied to Arati Mahimane's discussion How does secondary namenode work?
"Hi, Which hadoop version are you using. Because, hadoop-0.20.2 will give you those files immediately when you run cluster mode of hadoop. Instead But coming to version of hadoop-0.21.0, secondary namenode is being replaced by checkpoint node and…"
Sep 21, 2010

Profile Information

Hadoop Experience Level
Intermediate
Current Project
Currently I am working in TCS for CEG-Open Source group which focusses on Open Source products. We do explore the Open Source products like Compiere for ERP, Eucalyptus for Cloud, Portals like Liferay, Alfresco, BI like Jasper Reports, Pentaho and many more
Available for Consulting
Yes
Search Expertise
Beginner
HBase Expertise
Beginner

Anantha Srivathsa N's Blog

How Big Data is analyzed using Hadoop





 

Introduction

Hadoop is rapidly becoming the technology of choice for enterprises that need to effectively collect, store and process large amounts of structured and complex data.

 

The Pentaho BI project is open…

Continue

Posted on December 19, 2010 at 11:00pm

Comment Wall

You need to be a member of Hadoop Professionals to add comments!

Join Hadoop Professionals

  • No comments yet!
 
 
 



Groups

© 2012   Created by Jason Venner.

Badges  |  Report an Issue  |  Terms of Service