清理实验环境
[hadoop@server1 hadoop]$ sbin/stop-yarn.sh [hadoop@server1 hadoop]$ sbin/stop-dfs.sh [hadoop@server1 hadoop]$ jps 1583 Jps server1-3 [hadoop@server1 tmp]$ rm -rf *准备环境
在server2 server3 server4 server5中安装jdk并配置环境变量 [hadoop@server3 ~]$ tar zxf jdk-8u181-linux-x64.tar.gz [hadoop@server3 ~]$ ln -s jdk1.8.0_181/ java [hadoop@server3 ~]$ vim .bash_profile [hadoop@server3 ~]$ source .bash_profile [hadoop@server3 ~]$ jps 1647 Jps server2 server3 server4 server5 上安装hadoop [hadoop@server3 ~]$ tar zxf hadoop-3.0.3.tar.gz [hadoop@server3 ~]$ ln -s hadoop-3.0.3 hadoop [hadoop@server1 hadoop]$ vim hadoop-env.sh 54 export JAVA_HOME=/home/hadoop/java搭建zookeeper 添加从节点信息
[hadoop@server2 ~]$ tar zxf zookeeper-3.4.9.tar.gz [hadoop@server2 ~]$ cd zookeeper-3.4.9 [hadoop@server2 zookeeper-3.4.9]$ ls bin dist-maven LICENSE.txt src build.xml docs NOTICE.txt zookeeper-3.4.9.jar CHANGES.txt ivysettings.xml README_packaging.txt zookeeper-3.4.9.jar.asc conf ivy.xml README.txt zookeeper-3.4.9.jar.md5 contrib lib recipes zookeeper-3.4.9.jar.sha1 [hadoop@server2 zookeeper-3.4.9]$ cd conf/ [hadoop@server2 conf]$ ls configuration.xsl log4j.properties zoo_sample.cfg [hadoop@server2 conf]$ cp zoo_sample.cfg zoo.cfg [hadoop@server2 conf]$ vim zoo.cfg server.1=172.25.76.2:2888:3888 server.2=172.25.76.3:2888:3888 server.3=172.25.76.4:2888:3888 [hadoop@server2 conf]$ scp zoo.cfg server3:/home/hadoop/zookeeper-3.4.9/conf hadoop@server3's password: zoo.cfg 100% 1015 1.0KB/s 00:00 [hadoop@server2 conf]$ scp zoo.cfg server4:/home/hadoop/zookeeper-3.4.9/conf hadoop@server4's password: zoo.cfg配置id 并启动zookeeper
[hadoop@server2 conf]$ mkdir /tmp/zookeeper [hadoop@server3 conf]$ mkdir /tmp/zookeeper [hadoop@server4 conf]$ mkdir /tmp/zookeeper [hadoop@server2 conf]$ echo 1 > /tmp/zookeeper/myid [hadoop@server2 zookeeper-3.4.9]$ bin/zkServer.sh start ZooKeeper JMX enabled by default Using config: /home/hadoop/zookeeper-3.4.9/bin/../conf/zoo.cfg Starting zookeeper ... STARTED [hadoop@server3 ~]$ echo 2 > /tmp/zookeeper/myid [hadoop@server3 zookeeper-3.4.9]$ bin/zkServer.sh start ZooKeeper JMX enabled by default Using config: /home/hadoop/zookeeper-3.4.9/bin/../conf/zoo.cfg Starting zookeeper ... STARTED [hadoop@server4 ~]$ echo 3 > /tmp/zookeeper/myid [hadoop@server4 ~]$ cd zookeeper-3.4.9 [hadoop@server4 zookeeper-3.4.9]$ bin/zkServer.sh start ZooKeeper JMX enabled by default Using config: /home/hadoop/zookeeper-3.4.9/bin/../conf/zoo.cfg Starting zookeeper ... STARTED查看状态
[hadoop@server2 zookeeper-3.4.9]$ bin/zkServer.sh status ZooKeeper JMX enabled by default Using config: /home/hadoop/zookeeper-3.4.9/bin/../conf/zoo.cfg Mode: follower [hadoop@server3 zookeeper-3.4.9]$ bin/zkServer.sh status ZooKeeper JMX enabled by default Using config: /home/hadoop/zookeeper-3.4.9/bin/../conf/zoo.cfg Mode: leader [hadoop@server4 zookeeper-3.4.9]$ bin/zkServer.sh status ZooKeeper JMX enabled by default Using config: /home/hadoop/zookeeper-3.4.9/bin/../conf/zoo.cfg Mode: follower [hadoop@server2 bin]$ ./zkCli.sh 回车进入命令行在server1 上配置hadoop
[hadoop@server1 hadoop]$ vim core-site.xml <configuration> <property> <name>fs.defaultFS</name> <value>hdfs://masters</value> </property> <property> <name>ha.zookeeper.quorum</name> <value>172.25.76.2:2181,172.25.76.3:2181,172.25.76.4:2181</value> </property> </configuration> [hadoop@server1 hadoop]$ vim hdfs-site.xml <configuration> <property> <name>dfs.replication</name> <value>3</value> </property> <property> <name>dfs.nameservices</name> <value>masters</value> </property> <property> <name>dfs.ha.namenodes.masters</name> <value>h1,h2</value> </property> <property> <name>dfs.namenode.rpc-address.masters.h1</name> <value>172.25.76.1:9000</value> </property> <property> <name>dfs.namenode.http-address.masters.h1</name> <value>172.25.76.1:9870</value> </property> <property> <name>dfs.namenode.rpc-address.masters.h2</name> <value>172.25.76.5:9000</value> </property> <property> <name>dfs.namenode.shared.edits.dir</name> <value>qjournal://172.25.76.2:8485;172.25.76.3:8485;172.25.76.4:8485/masters</value> </property> <property> <name>dfs.journalnode.edits.dir</name> <value>/tmp/journaldata</value> </property> <property> <name>dfs.ha.automatic-failover.enabled</name> <value>true</value> </property> <property> <name>dfs.client.failover.proxy.provider.masters</name> <value>org.apache.hadoop.hdfs.server.namenode.ha.ConfiguredFailoverProxyProvider</value> </property> <property> <name>dfs.ha.fencing.methods</name> <value> sshfence shell(/bin/true) </value> </property> <property> <name>dfs.ha.fencing.ssh.connect-timeout</name> <value>30000</value> </property> </configuration>在server 2 3 4 上启动 zookeeper 集群节点
[hadoop@server2 hadoop]$ bin/hdfs --daemon start journalnode WARNING: /home/hadoop/hadoop-3.0.3/logs does not exist. Creating. [hadoop@server2 hadoop]$ jps 1920 Jps 1768 QuorumPeerMain 1902 JournalNode [hadoop@server3 hadoop]$ bin/hdfs --daemon start journalnode WARNING: /home/hadoop/hadoop-3.0.3/logs does not exist. Creating. [hadoop@server3 hadoop]$ jps 1808 Jps 1677 QuorumPeerMain 1790 JournalNode [hadoop@server4 hadoop]$ bin/hdfs --daemon start journalnode WARNING: /home/hadoop/hadoop-3.0.3/logs does not exist. Creating. [hadoop@server4 hadoop]$ jps 1462 Jps 1432 JournalNode 1322 QuorumPeerMain向server5传文件配置高可用
[hadoop@server1 hadoop]$ pwd /home/hadoop/hadoop [hadoop@server1 hadoop]$ bin/hdfs namenode -format [hadoop@server1 hadoop]$ scp -r /tmp/hadoop-hadoop 172.25.76.5:/tmp/ [hadoop@server1 hadoop]$ pwd /home/hadoop/hadoop/etc/hadoop [hadoop@server1 hadoop]$ scp core-site.xml hdfs-site.xml hadoop@172.25.76.5:/home/hadoop/hadoop/etc/hadoop格式化zookeeper
[hadoop@server1 hadoop]$ bin/hdfs zkfc -formatZK打开hdfs集群
[hadoop@server1 hadoop]$ sbin/start-dfs.sh Starting namenodes on [server1 server5] server1: namenode is running as process 2365. Stop it first. Starting datanodes Starting journal nodes [172.25.76.2 172.25.76.3 172.25.76.4] 172.25.76.3: journalnode is running as process 1790. Stop it first. 172.25.76.2: journalnode is running as process 1902. Stop it first. 172.25.76.4: journalnode is running as process 1432. Stop it first. Starting ZK Failover Controllers on NN hosts [server1 server5] [hadoop@server1 hadoop]$ jps 2711 DFSZKFailoverController 4235 Jps 2365 NameNode [hadoop@server5 ~]$ jps 1328 NameNode 1398 DFSZKFailoverController 1485 Jps
关闭server1 的NameNode
[hadoop@server1 hadoop]$ kill 2365 [hadoop@server1 hadoop]$ jps 4261 Jps 2711 DFSZKFailoverController在server1上传文件成功 通过server5
[hadoop@server1 hadoop]$ bin/hdfs dfs -mkdir -p /user/hadoop [hadoop@server1 hadoop]$ bin/hdfs dfs -mkdir input [hadoop@server1 hadoop]$ bin/hdfs dfs -put etc/hadoop/* input再次打开server1
[hadoop@server1 hadoop]$ bin/hdfs --daemon start namenode [hadoop@server1 hadoop]$ jps 4500 NameNode 2711 DFSZKFailoverController 4572 Jps