If you've landed on this page, I know your feelings. Wanna know how it feels when it's done? Ride on, it's like Roller Coaster... You have successfully configured a single-node cluster in 7 easy steps . Good! But you are yet to taste the real essence of Hadoop. Recall that the primary purpose of Hadoop is to distribute a very lengthy task to more than one machines. This is exactly what we are going to do, but the only difference is that we will be doing so in Virtual machines. Step 1: Networking We have several things to do first with the existing VM, beginning with disabling IPv6. This is a recommendation because Hadoop currently does not support IPv6 according to their official Wiki . In order to do so, you will have to modify a fine named /etc/sysctl.conf : - Launch your Virtual Machine from Virtualbox - Open your terminal and run: $ sudo nano /etc/sysctl.conf - Add the following lines at the end of the file: # Disable ipv6 net.ipv6.conf.all.disable_ipv6 =...