- Hadoop+Spark大數據巨量分析與機器學習整合開發實戰, 作者:林大貴 ,出版社:博碩 ,出版日期:2015/11/0
- 10-1 Scala IDE
- 10-2 下載 Library
- mkdir -p ~/workspace/Lib
- sudo cp /usr/local/spark/lib/spark-assembly-1.4.0-hadoop2.6.0.jar ~/workspace/Lib
- cd ~/workspace/Lib
- 下載 joda-time
- wget http://www.java2s.com/Code/JarDownload/joda/joda-time-2.2.jar.zip
- unzip -j joda-time-2.2.jar.zip
- 下載 jfreechart
- wget http://www.java2s.com/Code/JarDownload/jfreechart/jfreechart-1.0.3.jar.zip
- unzip -j jfreechart-1.0.3.jar.zip
- 下載 jcommon
- wget http://www.java2s.com/Code/JarDownload/jcommon/jcommon-1.0.16.jar.zip
- unzip -j jcommon-1.0.16.jar.zip
- rm *.zip
- ll
- 10-3 eclipse
- 10-7 下載 WordCount 測試資料
- cd ~/workspace/WordCount
- mkdir data
- cd data
- wget http://www.gutenberg.org/cache/epub/5000/pg5000.txt
- 10-16 下載範例程式(jdwang: YouTube)
- 範例程式
- cd ~/
- wget http://www.drmaster.com.tw/download/example/MP21517_sample.zip
- ll MP21517_sample.zip
- unzip MP21517_sample.zip
- ll wordcount
- ll SparkExample