【Github】Scala and Spark for Big Data Analytics

0关注
0粉丝

已卖：1522份资源

博士生

59%

还不是VIP/贵宾

-

TA的文库 其他...

可解釋的機器學習

Operations Research(运筹学)

国际金融(Finance)

0%

威望: 0 级
论坛币: 41203 个
通用积分: 2.6773
学术水平: 7 点
热心指数: 5 点
信用等级: 5 点
经验: 2201 点
帖子: 198
精华: 1
在线时间: 36 小时
注册时间: 2015-6-1
最后登录: 2024-3-3

楼主

Reader's 发表于 2017-8-13 04:41:57 |AI写论文

是否 +2 论坛币

k人参与回答

经管之家送您一份

应届毕业生专属福利!

求职就业群

赵安豆老师微信：zhaoandou666

经管之家联合CDA

送您一个全额奖学金名额~ !

立即领取

感谢您参与论坛问题回答

经管之家送您两个论坛币！

+2 论坛币

Scala and Spark for Big Data Analytics

This is the code repository for Scala and Spark for Big Data Analytics, published by Packt. It contains all the supporting project files necessary to work through the book from start to finish.

About the Book

This book is divided into three parts. In the first part, it will introduce you to Scala programming, helping you understand its fundamentals and be able to program with Spark. It will then move on to introducing you to Spark and the design choices beneath it and show you how to perform data analysis with it. Finally, to shake things up, the book moves on to Advanced Spark and teaches you advanced topics, such as monitoring, configuration, debugging, testing, and finally deployment.

Instructions and Navigation

All of the code is organized into folders. Each folder starts with a number followed by the application name. For example, Chapter02.

The code will look like the following:

package com.chapter11.SparkMachineLearningimport org.apache.spark.mllib.feature.StandardScalerModelimport org.apache.spark.mllib.linalg.{ Vector, Vectors }import org.apache.spark.sql.{ DataFrame }import org.apache.spark.sql.SparkSession

To follow this book, you need basic to medium-level knowledge of the Java programming language. A basic knowledge of concurrency concepts is welcome too.

Related Products

Suggestions and Feedback

Click here if you have any feedback or suggestions.

https://github.com/PacktPublishing/Scala-and-Spark-for-Big-Data-Analytics

扫码加我拉你入群

请注明：姓名-公司-职位

以便审核进群资格，未注明则拒绝

分享0 收藏0 回帖

关键词：Analytics Analytic Big data GitHub Spark

本帖被以下文库推荐

· 编程语言(Coding Languages)|主题: 3936, 订阅: 126

沙发

Reader's 发表于 2017-8-13 04:42:57

package com.chapter3.ScalaFP
import scala.collection._
import scala.collection.mutable.Buffer
import scala.collection.mutable.HashMap
object CollectionExample {
def main(args: Array[String]) {
val x = 10
val y = 15
val z = 19
Traversable(1, 2, 3)
Iterable("x", "y", "z")
Map("x" -> 10, "y" -> 13, "z" -> 17)
Set("Red", "Green", "Blue")
SortedSet("Hello,", "world!")
Buffer(x, y, z)
IndexedSeq(0.0, 1.0, 2.0)
LinearSeq(x, y, z)
List(2, 6, 10)
HashMap("x" -> 20, "y" -> 19, "z" -> 16)
val list = List(1, 2, 3) map (_ + 1)
println(list)
val set = Set(1, 2, 3) map (_ * 2)
println(set)
val list2 = List(x, y, z).map(x => x * 3)
println(list2)
}
}

复制代码

藤椅

Reader's 发表于 2017-8-13 04:43:41

package com.chapter3.ScalaFP
object ListScala {
def main(args: Array[String]) {
val eventList = List(2, 4, 6, 8, 10) // A simple list
val mappedList = eventList.map(x => x*2) // Mapped each value by multiplying them by 2
println("Original list: "+ eventList)
println("Mapped list: "+ mappedList)
//Use map to return a list from function
def func(x: Int) = if (x > 4) Some(x) else None
val newList = eventList.map(x=> func(x))
println("New list: " + newList)
}
}

复制代码

板凳

Reader's 发表于 2017-8-13 04:44:14

package com.chapter3.ScalaFP
object MonadiacExample {
def main(args: Array[String]) {
//Monadiac example 1
for (x <- 10 until (0, -2))
yield x
//Monadiac example 2
for (x <- 1 to 10 if x % 2 == 0)
yield x
// Monodiac example 3
for (x <- 1 to 10; y <- 1 until x)
yield (x, y)
(1 to 10).flatMap(i => (1 until i).map(j => (i, j)))
}
}

复制代码

报纸

Reader's 发表于 2017-8-13 04:44:57

package com.chapter3.ScalaFP
import java.io.IOException
import java.io.FileReader
import java.io.FileNotFoundException
object TryCatch {
def main(args: Array[String]) {
try {
val f = new FileReader("data/data.txt")
} catch {
case ex: FileNotFoundException => println("File not found exception")
case ex: IOException => println("IO Exception")
} finally {
println("Finally block always executes");
}
}
}

复制代码

地板

Reader's 发表于 2017-8-13 04:45:37

package com.chapter3.ScalaFP
object filterExample {
def main(args: Array[String]) {
val range = List.range(1, 10)
println(range)
// Filter out only odd values
val odds = range.filter( x=> x % 2 != 0)
println("Odd values: " + odds)
// Filter out only even values
val even = range.filter( x=> x % 2 == 0)
println("Odd values: " + even)
}
}

复制代码

7楼

Reader's 发表于 2017-8-13 04:45:56

package com.chapter3.ScalaFP
object flatMapExample {
def main(args: Array[String]) {
val eventList = List(2, 4, 6, 8, 10) // A simple list
println("Original list: "+ eventList)
//Use map
def around(x: Int) = List(x-1, x, x+1)
val newList1 = eventList.map(x=> around(x))
println("New list from map : " + newList1)
//Use flatMap
val newList2 = eventList.flatMap(x=> around(x))
println("New list from flatMap: " + newList2)
}
}

复制代码

8楼

Reader's 发表于 2017-8-13 04:46:46

package com.chapter3.ScalaFP
object mapExample {
implicit class MapReduceTraversable[T, N](val traversable: Traversable[(T, N)]) {
def reduceByKey(f: (N, N) => N) = traversable.par.groupBy(_._1).mapValues(_.map(_._2)).mapValues(_.reduce(f))
}
def main(args: Array[String]) {
val eventList = List(2, 4, 6, 8, 10) // A simple list
println("Original list: " + eventList)
//Use map
val newList1 = eventList.map(x => x * 2)
println(newList1)
def func(x: Int) = if (x > 4) Some(x) else None
val newList2 = eventList.map(x => func(x))
println(newList2)
val myList = List(1, 1, 1, 1, 1, 1, 1)
val reduce = myList.reduce { (x, y) => println(s"$x+$y=${x + y}"); x + y }
println()
val parReduce = myList.par.reduce { (x, y) => println(s"$x+$y=${x + y}"); x + y }
println()
val fruits = List("apple", "apple", "orange", "apple", "mango", "orange", "apple", "apple", "apple", "apple")
val reducebyKeyValue = fruits.map(f => (f, 1)).reduceByKey(_ + _)
println(reducebyKeyValue)
}
}

复制代码

9楼

军旗飞扬

发表于 2017-8-13 06:31:02

谢谢楼主分享！

10楼

lianqu 发表于 2017-8-15 10:32:39

【Github】Scala and Spark for Big Data Analytics [推广有奖]

经管之家送您一份

经管之家联合CDA

感谢您参与论坛问题回答

扫码加我拉你入群

相关帖子

本帖被以下文库推荐

浏览过的帖子

浏览过的版块

本版微信群

【Github】Scala and Spark for Big Data Analytics [推广有奖]

经管之家送您一份

经管之家联合CDA

感谢您参与论坛问题回答

扫码加我 拉你入群

相关帖子

本帖被以下文库推荐

浏览过的帖子

浏览过的版块

本版微信群

扫码加我拉你入群