Hadoop Distributed File System (HDFS) is a distributed file system that provides scalable and reliable storage for big data applications. In this tutorial, we will learn how to install HDFS on POP! OS Latest.
Before starting the installation process, ensure that:
java -version
. If you do not have JDK installed, you can install it using the command sudo apt install default-jdk
.Follow the below steps to install HDFS on your system:
Open your terminal and run the command sudo apt update
to update the system packages.
Next, we need to download hadoop from the Apache website. Run the command below to download the latest stable release of Hadoop:
wget https://www.apache.org/dist/hadoop/common/hadoop-3.3.1/hadoop-3.3.1.tar.gz
Extract the downloaded tarball file by running the command tar -xzf hadoop-3.3.1.tar.gz
.
Move the extracted folder to the /usr/local
directory by running the command sudo mv hadoop-3.3.1 /usr/local/hadoop
.
Next, we need to set up the environment variables for Hadoop. Open your terminal and run the command sudo nano /etc/environment
to open the environment file in nano editor. Add the following lines to the end of the file:
HADOOP_HOME=/usr/local/hadoop
PATH=$PATH:$HADOOP_HOME/bin
PATH=$PATH:$HADOOP_HOME/sbin
Save and close the file by pressing CTRL + X
, then Y
, and then Enter
.
Finally, we need to format the HDFS file system. To do this, run the command below:
hdfs namenode -format
The above command will format the HDFS file system.
Congratulations! You have successfully installed HDFS on your POP! OS Latest system.
In this tutorial, we have learned how to install HDFS on POP! OS Latest. HDFS is an essential component for distributed storage in a Hadoop environment. Now that you have installed HDFS, you can start exploring big data applications that make use of it.
If you want to self-host in an easy, hands free way, need an external IP address, or simply want your data in your own hands, give IPv6.rs a try!
Alternatively, for the best virtual desktop, try Shells!