(Python version, 較新)Python+Spark 2.0+Hadoop機器學習與大數據分析實戰,林大貴,出版商: 博碩,出版日期: 2016-10-03,語言: 繁體中文,ISBN: 9864341537,ISBN-13: 9789864341535
部落格 http://pythonsparkhadoop.blogspot.tw ,
Hadoop Multi-Node SetUp (I627 PC : RAM 2GB, HD: 20 GB)
- VirtualBox & Lunix Installization (Python+Spark 2.0+Hadoop機器學習與大數據分析實戰,林大貴)
Virtual Box
-
VirtualBox Download
VirtualBox Download(old)
-
1_VirtualBox_DirSetup
VM : 檔案 (File)/喜好設定(Preferences)/VirDefault Directory/Your Own Subdirectory(Backup)
- VM => Name:Hadoop, TypeL Linux, Verions: Ubuntu (64 bit)
Hardware: RAM: 4GB, HD :100 GB
Check "Your Own Subdirectory(Backup)"
-
Ubuntu_Download
OS:
ubuntu-14.04.5-desktop-amd64.iso14.04 LTS(支援至 2019 年 04 月) (LTS: Long Term Support)(Ok)
16.04 LTS (unknown)(支援至 2021 年 04 月)
OS: Linux Ubuntu 14.04
(check:disable)(Download update while installing)
(check: ?)(Install this third-party software)
UserName: hduser
{reboot}r
安裝 Guest Additions
{裝置/插入 Guest Addtion CD image}
{reboot}
設定共用剪貼簿: {Devices/Shared Clipboard/bidirection}
設定最佳下載伺服器(for apt-get){系統設定/系統設定值=> Software& Updates/Select Best Server}:
Chapter Ubuntu Linux Installization
Single-Node:
第4章 Hadoop 2.6 Single Node Cluster 安裝指令
Hadoop 官網
http://apache.stu.edu.tw/hadoop/common/hadoop-2.6.4/hadoop-2.6.4-src.tar.gz (Non-available)
http://apache.stu.edu.tw/hadoop/common/hadoop-2.6.5/hadoop-2.6.5-src.tar.gz
Multi-Node :第5章 Hadoop 2.6 Multi Node Cluster安裝指令
第6章. Hadoop HDFS命令介紹
第7章. Hadoop MapReduce介紹