In this tutorial, we will discuss the steps to install HDFS on Kali Linux Latest. Hadoop Distributed File System (HDFS) is a distributed file system that is designed to run on commodity hardware. It provides a scalable and fault-tolerant storage system for data-intensive applications. HDFS is a core component of the Hadoop ecosystem.
Before we begin with the installation process, ensure that you have the following prerequisites in place:
Follow the steps given below to install HDFS on Kali Linux:
First, extract the Hadoop installation package that you have downloaded from the official website. Open the terminal and navigate to the directory containing the downloaded package. Use the following command to extract the package:
tar -xzvf hadoop-2.x.x.tar.gz
Replace 2.x.x with the version of Hadoop that you have downloaded.
Next, you need to set up environment variables for Hadoop. Open the .bashrc file using the following command:
nano ~/.bashrc
Add the following lines at the end of the file:
export HADOOP_HOME=/path/to/hadoop
export PATH=$PATH:$HADOOP_HOME/bin
Replace /path/to/hadoop with the path where you have extracted the Hadoop installation package.
Save and close the file by pressing Ctrl+X followed by Y.
Now, you need to configure HDFS by editing the hdfs-site.xml file located in the hadoop-2.x.x/etc/hadoop directory. Open the file using the following command:
nano /path/to/hadoop/etc/hadoop/hdfs-site.xml
Add the following lines between the <configuration> and </configuration> tags:
<property>
  <name>dfs.replication</name>
  <value>1</value>
</property>
<property>
  <name>dfs.namenode.name.dir</name>
  <value>/path/to/namenode/directory</value>
</property>
<property>
  <name>dfs.datanode.data.dir</name>
  <value>/path/to/datanode/directory</value>
</property>
Replace /path/to/namenode/directory with the directory where you want to store the NameNode data and /path/to/datanode/directory with the directory where you want to store the DataNode data.
Save and close the file by pressing Ctrl+X followed by Y.
Before starting HDFS, you need to format the NameNode. Use the following command to format the NameNode:
hdfs namenode -format
Finally, you can start HDFS using the following command:
start-dfs.sh
This will start the NameNode and DataNode daemons. You can verify the status of the daemons using the following command:
jps
This command will display a list of running Java processes, including the NameNode and DataNode daemons.
Congratulations! You have successfully installed HDFS on Kali Linux. You can now use HDFS to store and access data for your data-intensive applications.
If you want to self-host in an easy, hands free way, need an external IP address, or simply want your data in your own hands, give IPv6.rs a try!
Alternatively, for the best virtual desktop, try Shells!