搜索
人大经济论坛 附件下载

附件下载

所在主题:
文件名:  Intro to Apache Spark Stanford University.pdf
资料下载链接地址: https://bbs.pinggu.org/a-2000864.html
附件大小:
[hide]
  • stanford.edu/~rezab/sparkclass
[/hide]
Spark ClassHosted by Stanford ICME
August 13-15, 2014
Clark Center Auditorium, Stanford University


Organizers:
Reza Zadeh | Matei Zaharia | Ion Stoica


A three-day class on distributed computing, using the high-speed cluster programming framework, Spark. Throughout the class, there will be hands-on exercises with computing resources provided by the organizers.

The class will include introductions to the many Spark features, case studies from current users, best practices for deployment and tuning, future development plans, and hands-on exercises.

Clark Center Auditorium
James H. Clark Center
Stanford, CA 94305
Wednesday August 13th to 15th, 2014

Please register here: Spark class registration

Hands-on Exercises

Please download the course materials here and slides


Course Prerequisites:


  • Laptop with WiFi capabilities
  • Java 6 or 7
Schedule

Day 1 (10am-4pm, lunch break 12:30-1:30pm)

An introduction to Distributed Computing and Spark (Reza Zadeh) [slides]

Hands-on exercises (Paco Nathan): [slides]

  • Installing Spark, then running a first app
  • Theory of operation, major abstractions
  • Historical background
  • Writing/running several example apps
  • Review of the API in Scala, Python, Java

Language Clustering Demo

Databricks Cloud Demo

Day 2 (10am-4pm, lunch break 12:30-1:30pm)

Hands-on exercises (Paco Nathan): [slides]

  • Review: coding assignment
  • Extended Spark examples
  • Unified engine across batch, iterative, SQL, ML, etc.
  • Software development lifecycle: build, deploy, monitor
  • Tooling: Maven, SBT, IPython notebook, etc.
  • Production case studies
  • Other resources for learning

Installing the Cassandra / Spark OSS Stack

Additional materials and exercises

Day 3 (10am-2:30pm, lunch break 11:30-1pm)

  • MLlib and Distributing the Singular Value Decomposition (Reza Zadeh) [slides]
  • Towards an Optimizer for MLbase (Ameet Talwalkar) [slides]
  • Graph Processing with the GraphX library (Ankur Dave) [slides]
  • Spark Streaming (Tathagata Das) [slides]





    熟悉论坛请点击新手指南
下载说明
1、论坛支持迅雷和网际快车等p2p多线程软件下载,请在上面选择下载通道单击右健下载即可。
2、论坛会定期自动批量更新下载地址,所以请不要浪费时间盗链论坛资源,盗链地址会很快失效。
3、本站为非盈利性质的学术交流网站,鼓励和保护原创作品,拒绝未经版权人许可的上传行为。本站如接到版权人发出的合格侵权通知,将积极的采取必要措施;同时,本站也将在技术手段和能力范围内,履行版权保护的注意义务。
(如有侵权,欢迎举报)
二维码

扫码加我 拉你入群

请注明:姓名-公司-职位

以便审核进群资格,未注明则拒绝

GMT+8, 2026-1-12 01:46