Hadoop Professionals

A Community for Hadoop Users

This network is a place to discuss and learn Hadoop, Solr, Katta, Map Reduce, Machine Learning and Big Data


Latest Activity


  • Add Photos
  • View All

Help With Hadoop

A great place to learn Hadoop, and to tune your map reduce jobs.

Ask specific Hadoop questions here to get help from an expert :)


Online and Classroom Training on Big Data Technology 1 Reply

Started by Mehul Singh. Last reply by jaemsmikky Sep 22, 2014.

Executive briefing on Big Data with Hadoop MasterClass 1 Reply

Started by Mehul Singh. Last reply by jaemsmikky Sep 22, 2014.

Hadoop and BIG DATA Online Training And Placement Assistance 1 Reply

Started by praveenreddy. Last reply by jaemsmikky Sep 19, 2014.


Blog Posts

Big Data Hadoop Online Training with Certification

Big data is a buzzword, or catch-phrase, used to describe a massive volume of both structured and unstructured data that is so large that it's difficult to process using traditional database and software techniques. In most enterprise scenarios the data is too big or it moves too fast or it exceeds current processing capacity. Big data has the potential to help companies improve operations and make faster, more intelligent decisions.

Features of Big…


Posted by MikeWilliam on March 10, 2015 at 9:48pm

Hadoop and BIG DATA Online Training

Qatestingonlinetraining.com is a placeholder of H2Kinfosys renders online training program for big data course. We teach various concepts of big data and its advanced techniques. You can learn in a non-stressful environment with the advantage of self-paced training. Many students have benefitted through our online training programs and got placed in top companies. More emphasis will be given to the beginners and non-technical background learners. Our online courses make our students to get…


Posted by praveenreddy on September 18, 2014 at 9:37pm

Hadoop Online and Classroom Training


UNICOM is starting  new  Hadoop Online Training Batch  from 11th Oct 2013. This training will help you to understand Big Data Hadoop Ecosystem and will give hands on trainings. We also provide Hadoop Classroom Trainings.


For more details you can drop a Email at contact@unicomlearning.com or you can give us a call at  +91-9538878795



Training Wing - UNICOM

Visit us at …


Posted by Mehul Singh on September 16, 2013 at 2:01am

How to install small HDP cluster of two nodes in windows server 2012

This document helps you to install small HDP cluster  of two nodes.

One node is master which will be running as NameNode and Job Tracker and one node is slave which will be running as data node and task tracker.


To get into the details of prerequisites, we need to go through following link.



Posted by Mahabubur Rahaman on May 5, 2013 at 10:33am

Intellipaat provides Hadoop online Training

Intellipaat provides Hadoop online Training.

Hi, We will start a new Hadoop Developer batch from 11th May’13.

Course Content Link: http://intellipaat.com/courses/big-data/#HadoopDevelopers-1

Interested candidates please drop an email for registration at sales@intellipaat.com or give us a call.


Sales Intellipaat Team

Mob: 91-9019368913

Visit us at…


Posted by soniya on May 4, 2013 at 4:08am

Yahoo Hadoop Developer Blog

Cloudera Hadoop Blog

How-to: Build a Machine-Learning App Using Sparkling Water and Apache Spark

Thanks to Michal Malohlava, Amy Wang, and Avni Wadhwa of H20.ai for providing the following guest post about building ML apps using Sparkling Water and Apache Spark on CDH. The Sparkling Water project is nearing its one-year anniversary, which means Michal Malohlava, our main contributor, has been very busy for the better part of this [...]

Continuous Distribution Goodness-of-Fit in MLlib: Kolmogorov-Smirnov Testing in Apache Spark

Thanks to former Cloudera intern Jose Cambronero for the post below about his summer project, which involved contributions to MLlib in Apache Spark. Data can come in many shapes and forms, and can be described in many ways. Statistics like the mean and standard deviation of a sample provide descriptions of some of its important [...]

Kudu: New Apache Hadoop Storage for Fast Analytics on Fast Data

This new open source complement to HDFS and Apache HBase is designed to fill gaps in Hadoop’s storage layer that have given rise to stitched-together, hybrid architectures. The set of data storage and processing technologies that define the Apache Hadoop ecosystem are expansive and ever-improving, covering a very diverse set of customer use cases used [...]

RecordService: For Fine-Grained Security Enforcement Across the Hadoop Ecosystem

This new core security layer provides a unified data access path for all Hadoop ecosystem components, while improving performance. We’re thrilled to announce the beta availability of RecordService, a distributed, scalable, data access service for unified access control and enforcement in Apache Hadoop. RecordService is Apache Licensed open source that we intend to transition to [...]

How-to: Prepare Your Apache Hadoop Cluster for PySpark Jobs

Proper configuration of your Python environment is a critical pre-condition for using Apache Spark’s Python API. One of the most enticing aspects of Apache Spark for data scientists is the API it provides in non-JVM languages for Python (via PySpark) and for R (via SparkR). There are a few reasons that these language bindings have [...]




© 2015   Created by Jason Venner.

Badges  |  Report an Issue  |  Terms of Service