hadoop2.2.0 集群安装 ubuntu12
10 March 2014
author:xiajun
写这篇文章主要是记录一下自己的安装过程和踩下的坑 2.2.0的安装和1.x的安装前奏是一样的,jdk,免密钥登录等,可以参考1.1.0安装文章
安装:
1.下载hadoop2.2.0版本
http://apache.fayea.com/apache-mirror/hadoop/common/stable/(注意:如果你是在32位机器上安装可以下载编译好的文件hadoop-2.2.0.tar.gz,如果是64位的很不幸你需要下载源码包自己编译)
2.编译
如果你没有安装maven请先行安装并注意版本最好是3.0.5,同时jdk最好是1.6版本否则编译时会有很多问题
在编译hadoop时由于需要使用protobuf2.5和cmake
protobuf2.5安装:
tar -zxf protobuf-2.5.0.tar.gz //版本最好是2.5
cd protobuf-2.5.0
./configure --prefix=/usr //注意:如果是ubuntu 就必须安装在/usr目录
make check
make install
cmake安装:
sudo apt-get install cmake //ubuntu下这样安装
开始安装hadoop
解压hadoop2.2.0的压缩包,进入 hadoop-2.2.0-src
mvn package -Pdist,native -DskipTests -Dtar //执行后等待10分钟左右
编译完成后文件存放在 hadoop-2.2.0-src/hadoop-dist/target/hadoop-2.2.0.tar.gz
3.安装
将编译后的文件拷贝到服务器上解压
tar -zvxf hadoop-2.2.0.tar.gz
cd hadoop-2.2.0/etc/hadoop
修改:core-site.xml 修改:hdfs-site.xml fs.defaultFS hdfs://master:9000 io.file.buffer.size 131072 hadoop.tmp.dir /usr/local/jd/hadoop-2.2.0/tmp A base for other temporarydirectories. hadoop.proxyuser.root.hosts master hadoop.proxyuser.root.groups * 修改:yarn-site.xml dfs.namenode.name.dir /usr/local/jd/hadoop-2.2.0/namenode dfs.datanode.data.dir /usr/local/jd/hadoop-2.2.0/data dfs.replication 2 dfs.permissions false 修改:mapred-site.xml yarn.resourcemanager.address master:18040 yarn.resourcemanager.scheduler.address master:18030 yarn.resourcemanager.resource-tracker.address master:18025 yarn.resourcemanager.admin.address master:18041 yarn.resourcemanager.webapp.address master:8088 yarn.nodemanager.local-dirs /usr/local/jd/hadoop-2.2.0/my yarn.nodemanager.log-dirs /usr/local/jd/hadoop-2.2.0/logs yarn.nodemanager.log.retain-seconds 10800 yarn.nodemanager.remote-app-log-dir /logs yarn.nodemanager.remote-app-log-dir-suffix logs yarn.log-aggregation.retain-seconds -1 yarn.log-aggregation.retain-check-interval-seconds -1 yarn.nodemanager.aux-services mapreduce_shuffle yarn.nodemanager.aux-services.mapreduce.shuffle.class org.apache.hadoop.mapred.ShuffleHandler mapreduce.framework.name yarn mapreduce.jobhistory.address master:10020 mapreduce.jobhistory.webapp.address master:19888 mapreduce.jobhistory.intermediate-done-dir /mr-history/tmp mapreduce.jobhistory.done-dir /mr-history/done
修改yarn-env.sh
export JAVA_HOME=/usr/local/jd/jdk1.6.0_25
修改hadoop-env.sh
export JAVA_HOME=/usr/local/jd/jdk1.6.0_25
export HADOOP_HOME=/usr/local/jd/hadoop-2.2.0
配置环境变量
export JAVA_HOME=/usr/local/jd/jdk1.6.0_25
export PATH=$JAVA_HOME/bin:$PATH
export HADOOP_HOME=/usr/local/jd/hadoop-2.2.0
export HADOOP_COMMON_HOME=$HADOOP_HOME
export HADOOP_HDFS_HOME=$HADOOP_HOME
export HADOOP_MAPRED_HOME=$HADOOP_HOME
export HADOOP_YARN_HOME=$HADOOP_HOME
export HADOOP_CONF_DIR=$HADOOP_HOME/etc/hadoop
export PATH=$PATH:$HADOOP_HOME/bin:$HADOOP_HOME/sbin:$HADOOP_HOME/lib
export HADOOP_OPTS="-Djava.library.path=$HADOOP_HOME/lib"
export JAVA_LIBRARY_PATH=$HADOOP_HOME/lib/native
启动:
./sbin/start-all.sh
http://master:8088 查看运行job
http://master:50070 查看datanode
blog comments powered by Disqus