Tag «Local(Standalone)Mode»

BigData Investigation 8 – Using Hadoop Streaming on Hadoop Cluster in Local (Standalone) Mode

In this post I will explain how to run the Hadoop Streaming utility on a Hadoop Cluster in Local (Standalone) Mode. Hadoop Streaming uses executables or scripts to create a MapReduce job and submits the job to a Hadoop cluster.  In an earlier post I have explained how to download and install Apache Hadoop in …

BigData Investigation 7 – Installing Apache Hadoop in Local (Standalone) Mode

In this post I will explain how to download Apache Hadoop and install it on CentOS 7 Linux in Local (Standalone) Mode. In earlier posts I have used the Cloudera Quickstart VM to describe how to create MapReduce applications with Python and Hadoop Streaming. Using pre-configured Hadoop clusters like the Cloudera Quickstart VM is convenient …

BigData Investigation 6 – Hadoop Cluster Modes

In the last post (BigData Investigation 5 – MapReduce with Python and Hadoop Streaming) we came across different Hadoop cluster modes. This post explains the three supported Hadoop cluster modes. A Hadoop cluster can be configured in one of three modes. Fully-Distributed Mode allows to configure Hadoop clusters ranging from a few nodes to thousands …