Home » Hadoop

Tag Archives: Hadoop

Apache Hadoop Getting Started Example

1. Introduction This is an in-depth article related to the Apache Hadoop Example. Hadoop is an opensource project which has software modules like Pig Hive, HBase, Phoenix, Spark, ZooKeeper, Cloudera, Flume, Sqoop, Oozie, and Storm. Map Reduce is part of Hadoop which is used for big data processing. 2. Apache Hadoop Getting Started Hadoop is an opensource framework for distributed ...

Read More »

Prerequisites for Learning Hadoop

In this article, we will dig deep to understand what are the prerequisites of learning and working with Hadoop. We will see what are the required things and what are the industry standard suggested things to know before you start learning Hadoop                   1. Introduction Apache Hadoop is the entry point or ...

Read More »

Apache Hadoop Wordcount Example

In this example, we will demonstrate the Word Count example in Hadoop. Word count is the basic example to understand the Hadoop MapReduce paradigm in which we count the number of instances of each word in an input file and gives the list of words and the number of instances of the particular word as an output. 1. Introduction Hadoop ...

Read More »

Hadoop Hello World Example

1. Introduction In this post, we feature a comprehensive Hadoop Hello World Example. Hadoop is an Apache Software Foundation project. It is the open source version inspired by Google MapReduce and Google File System. It is designed for distributed processing of large data sets across a cluster of systems often running on commodity standard hardware. Hadoop is designed with an assumption ...

Read More »