Installing Hadoop On Ubuntu – Single node cluster

Apache Hadoop is a fault – tolerant open-source software framework used for the distributed processing of Big Data across clusters of computers.

There are two components of Hadoop.

  • HDFS -Hadoop Distributed File System (NameNode, DataNode)
  • MapReduce (Job Tracker, Task Tracker)

Installation steps: This tutorial describes the installation of Hadoop single node cluster on Ubuntu platform.

Continue reading

Introduction to Linux terminal commands

If you are using Linux distribution (like Ubuntu), you have to learn some basic commands to work with terminal. You can perform routine tasks very quickly. For example you can enter into various directories or search for files etc from the terminal itself. These commands will also be helpful when you write BASH Script. Following are the list of basic commands you need to know if you are a beginner!

Continue reading