🔗Related content
You can find notebook related in:
You can find video related in:
You can find repo related in:
🐱🏍GitHub
You can connect with me in:
Resume 🧾
I will install Hadoop program and will use a library of Python to write a job that answer the question, how many row exists by each rating?
1st – Install Hadoop 🐘
I use following command but you can change to get current last version:
!wget https://downloads.apache.org/hadoop/common/hadoop-3.3.4/hadoop-3.3.4.tar.gz
You would can get other version if you need in: https://downloads.apache.org/hadoop/common/ and later replace it in the before command.
2nd – Unzip and copy 🔓
I use following command:
!tar -xzvf hadoop-3.3.4.tar.gz && cp -r hadoop-3.3.4/ /usr/local/
3rd – Set up Hadoop’s Java ☕
I use following command:
Source link
[gs-fb-comments]