Home » Enterprise Java » Apache Hadoop » Page 2

Apache Hadoop

Hadoop Sequence File Example

In the article we will have a look at Hadoop Sequence file format. Hadoop Sequence Files are one of the Apache Hadoop specific file formats which stores data in serialized key-value pair. We have look into details of Hadoop Sequence File in the subsequent sections. 1. Introduction Apache Hadoop supports text files which are quite commonly used for storing the ...

Read More »

The Best Hadoop Analytics Solutions

Data Analytics using Hadoop is one of the most important requirement in businesses today due to the amount of data being generated and the value the businesses can generate from this data. We will look into some of the best Hadoop Analytics Solutions available in the market which can be used for data analysis.             ...

Read More »

How Does Hadoop Work

Apache Hadoop is an open source software used for distributed computing that can process large amount of data and get the results faster using reliable and scalable architecture. Apache Hadoop runs on top of a commodity hardware cluster consisting of multiple systems which can range from couple of systems to thousands of systems. This cluster and involvement of multiple systems ...

Read More »

The Hadoop Ecosystem Explained

In this article, we will go through the Hadoop Ecosystem and will see of what it consists and what does the different projects are able to do. 1. Introduction Apache Hadoop is an open source platform managed by Apache Foundation. It is written in Java and is able to process large amount of data (generally called Big Data) in distributed ...

Read More »

Big Data Hadoop Tutorial for Beginners

This tutorial is for the beginners who want to start learning about Big Data and Apache Hadoop Ecosystem. This tutorial gives the introduction of different concepts of Big Data and Apache Hadoop which will set the base foundation for further learning. Table Of Contents 1. Introduction 2. Big Data? 2.1 Examples of Big Data. 3. Characteristics of Big Data 3.1 ...

Read More »

Prerequisites for Learning Hadoop

In this article, we will dig deep to understand what are the prerequisites of learning and working with Hadoop. We will see what are the required things and what are the industry standard suggested things to know before you start learning Hadoop                   1. Introduction Apache Hadoop is the entry point or ...

Read More »

Hadoop Mapreduce Combiner Example

In this example, we will learn about Hadoop Combiners. Combiners are highly useful functions offered by Hadoop especially when we are processing large amount of data. We will understand the combiners using a simple question. 1. Introduction Hadoop Combiner class is an optional class in the MapReduce framework which is added in between the Map class and the Reduce class ...

Read More »

Apache Hadoop as a Service Options

In this article, we will have a look at the available option for making use of Hadoop as a service aka HDaaS. Implementing Hadoop Cluster on own/in-house infrastructure is a complex task in itself and need a dedicated and expert team. To solve this complexity, there are many vendors providing cloud implementations of Hadoop clusters and we will have a ...

Read More »

Apache Hadoop Hue Tutorial

In this tutorial, we will learn about Hue. This will be the basic tutorial to start understanding what Hue is and how it can be used in the Hadoop and Big Data Ecosystem. 1. Introduction First of all, let us look into what is Hue? Hue is an open source Web interface for analyzing data with any Apache Hadoop based ...

Read More »

Apache Hadoop Administration Tutorial

In this tutorial, we will look into the administration responsibilities and how to administer the Hadoop Cluster. 1. Introduction Apache Hadoop Administration includes Hadoop Distributed File System(HDFS) administration as well as MapReduce administration. We will look into both the aspects. MapReduce administration means the admin need to monitor the running applications and tasks, application status, node configurations for running MapReduce ...

Read More »