搜索
人大经济论坛 附件下载

附件下载

所在主题:
文件名:  Lecture Notes Big Data Programming.zip
资料下载链接地址: https://bbs.pinggu.org/a-2000891.html
附件大小:
Syllabus - CS378 - Big Data Programming
Spring 2015
MW 9:30 - 11:00 WAG 214
Unique: 52022
Description

The map-reduce programming paradigm is a fundamental tool used in processing large data sets, and is supported in current tools such as Hadoop and MongoDB. Apache Spark offers another programming paradigm for processing large data sets. In this course you will gain an understanding of the concepts embodied in map-reduce, and will investigate how map-reduce is used to address various problems in processing and analyzing large data sets. This course will explore map-reduce as implemented in Hadoop, as well as the associated distributed file system (HDFS). In this course you will gain an understanding of the concepts offered and supported in Spark, and will investigate how to apply these concepts to address various problems including those you addressed using map-reduce.

Objectives

Upon completing this course, the student will be able to design and implement map-reduce programs for various large data set processing tasks, and will be able to deisgn and implement programs using Apache Spark.

Prerequisites

Data structures, Java programming experience.

Textbooks

  • Required: MapReduce Design Patterns, by Donald Miner and Adam Shook
    • O'Reilly Media
    • Print ISBN: 978-1-4493-2717-0 | ISBN 10: 1-4493-2717-6
    • Ebook ISBN: 978-1-4493-4197-8 | ISBN 10: 1-4493-4197-7
  • Required: Learning Spark, by Holden Karau, Andy Konwinsky, Patrick Wendell, Matei Zaharia
    • O'Reilly Media
    • Print ISBN: 978-1-4493-5862-4 | ISBN 10: 1-4493-5862-4
    • Ebook ISBN: 978-1-4493-5860-0 | ISBN 10: 1-4493-5860-8
  • Recommended: Hadoop: The Definitive Guide, 3rd Edition, by Tom White
    • O'Reilly Media/Yahoo Press
    • Print ISBN: 978-1-4493-1152-0 | ISBN 10: 1-4493-1152-0
    • Ebook ISBN: 978-1-4493-1151-3 | ISBN 10: 1-4493-1151-2

Instructor

David Franke
Email: dfranke@cs.utexas.edu
Office: GDC 4.706
Office Hours:

  • M 11:00 AM - 12:00 PM
  • T 12:00 PM - 1:00 PM
  • By appointment
[hide][/hide]




    熟悉论坛请点击新手指南
下载说明
1、论坛支持迅雷和网际快车等p2p多线程软件下载,请在上面选择下载通道单击右健下载即可。
2、论坛会定期自动批量更新下载地址,所以请不要浪费时间盗链论坛资源,盗链地址会很快失效。
3、本站为非盈利性质的学术交流网站,鼓励和保护原创作品,拒绝未经版权人许可的上传行为。本站如接到版权人发出的合格侵权通知,将积极的采取必要措施;同时,本站也将在技术手段和能力范围内,履行版权保护的注意义务。
(如有侵权,欢迎举报)
二维码

扫码加我 拉你入群

请注明:姓名-公司-职位

以便审核进群资格,未注明则拒绝

GMT+8, 2026-1-10 19:13