Hello everyone, in this article we are going to learn how to set up Cloudera Quickstart VM on a windows machine. When we start learning Hadoop, we need its installation on a server or standalone system to practice. Though setting up Hadoop on a single machine is not very difficult, there is an easier way to set up Hadoop environment quickly. We can use Cloudera’s Quickstart VMs. These VMs have Hadoop preconfigured and they are free and quick to set up.
For Cloudera Quickstart VM to run on a single system, it should have at least 8 GB of RAM. We can run this on a system with 4GB ram, but performance will be abysmal.
After this, we will need to download hypervisor. The hypervisor is a piece of software on which wraps our VM and runs on a host machine. We can set up Cloudera Quickstart VM using three hypervisors.
1. VMWare (need to pay for a license after free trial)
2. Virtualbox (opensource)
3. KVM (opensource)
You can choose any of these hypervisors. In this article, we are going to use Virtualbox. You can download Virtualbox from here. Choose windows host and download Virtualbox. Once the download is complete, install Virtualbox on your machine.
Cloudera Quickstart VM
After that, we need to download VM from Cloudera. You can get that from this link. On this page, in platform choose Virtualbox and click on GET IT NOW. Then it will ask you to fill up small form. Once you fill up these details and accept terms and conditions, VM download will begin automatically. After the download is complete, we need to extract VM. We can use 7zip for this.
Setting up VM
Now start Virtualbox and click on the Settings icon and then System and choose motherboard tab. There you can set how much RAM Virtualbox can use. Set this value to 4GB or more.
After that, click on File-> Import Appliance. From new window, browse to location to where you have extracted Quickstart VM in an earlier step. Choose *.ovf file. Then click on Next then on import.
It starts importing Cloudera VM on Virtualbox. You can run this VM by clicking on the Start icon in Virtualbox.
Validating VM Set up
Once VM starts running, we can validate it by logging to hue. The default username and password for this all Cloudera VMs is cloudera, cloudera respectively. You can check different services like Hive, Pig from hue.
In this article, we have set up Cloudera Quickstart VM on windows. We will use the same for our future tutorials and Hadoop practice. If you have faced any issues with this setup, then please ask me in the comment section below. See you in the next article.
I am passionate about data analytics, machine learning, and artificial intelligence. Recently I have started blogging about my experience while learning these exciting technologies.