楼主: olderp
1975 8

【独家发布】跟上时代学点儿Big Data 知识---海量数据集的挖掘英文原版P341 [推广有奖]

贵宾

已卖:13410份资源

学术权威

92%

还不是VIP/贵宾

-

TA的文库  其他...

大数据-想说爱你并不容易

经济类童书推荐

有趣演讲

威望
3
论坛币
436686 个
通用积分
104.8211
学术水平
302 点
热心指数
371 点
信用等级
304 点
经验
213028 点
帖子
5540
精华
3
在线时间
5291 小时
注册时间
2007-1-19
最后登录
2025-12-2

楼主
olderp 发表于 2013-10-21 21:46:16 |AI写论文

+2 论坛币
k人 参与回答

经管之家送您一份

应届毕业生专属福利!

求职就业群
赵安豆老师微信:zhaoandou666

经管之家联合CDA

送您一个全额奖学金名额~ !

感谢您参与论坛问题回答

经管之家送您两个论坛币!

+2 论坛币
Mining of Massive Datasets.pdf (1.99 MB, 需要: 3 个论坛币)
跟上时代学点儿Big Data 知识---
海量数据集的挖掘
英文原版P341
得嘱咐两句,本书研究的可是“very large amounts of data, that is, data so large  it does not fit in main memory. Because of the emphasis on size, many of our  examples are about the Web or data derived from the Web.”,即海量数据集。
What the Book Is About
At the highest level of description, this book is about data mining. However,
it focuses on data mining of very large amounts of data, that is, data so large
it does not fit in main memory. Because of the emphasis on size, many of our
examples are about the Web or data derived from the Web. Further, the book
takes an algorithmic point of view: data mining is about applying algorithms
to data, rather than using data to “train” a machine-learning engine of some
sort. The principal topics covered are:
1. Distributed file systems and map-reduce as a tool for creating parallel
algorithms that succeed on very large amounts of data.
2. Similarity search, including the key techniques of minhashing and localitysensitive
hashing.
3. Data-stream processing and specialized algorithms for dealing with data
that arrives so fast it must be processed immediately or lost.
4. The technology of search engines, including Google’s PageRank, link-spam
detection, and the hubs-and-authorities approach.
5. Frequent-itemset mining, including association rules, market-baskets, the
A-Priori Algorithm and its improvements.
6. Algorithms for clustering very large, high-dimensional datasets.
7. Two key problems for Web applications: managing advertising and recommendation
systems.

二维码

扫码加我 拉你入群

请注明:姓名-公司-职位

以便审核进群资格,未注明则拒绝

关键词:Big data 海量数据 英文原版 Data 数据集 英文原版 emphasis Because derived highest

已有 1 人评分经验 论坛币 收起 理由
苹果六人行 + 60 + 10 精彩帖子

总评分: 经验 + 60  论坛币 + 10   查看全部评分

本帖被以下文库推荐

沙发
繁清(未真实交易用户) 发表于 2013-10-21 21:49:32
顶一个
一切皆有可能!

藤椅
soarice(未真实交易用户) 发表于 2013-10-23 05:50:01
看看撒,虽然下载不了,我没有论坛比萨。。。

板凳
DHLcynthia(未真实交易用户) 发表于 2013-10-23 06:48:15
啦啦啦啦啦啦

报纸
ef2001(未真实交易用户) 发表于 2013-10-28 15:50:05

地板
tintindchen(真实交易用户) 发表于 2013-12-14 23:44:43
谢谢楼主分享

7
jinhuang922(未真实交易用户) 发表于 2013-12-24 10:00:45
很好 跟上时代的步伐!谢谢分享
已有 1 人评分论坛币 收起 理由
olderp + 1 鼓励积极发帖讨论

总评分: 论坛币 + 1   查看全部评分

8
vynchi(未真实交易用户) 发表于 2013-12-24 15:03:12
赞啊

9
edwinfung(未真实交易用户) 发表于 2014-4-1 13:23:04
Thanks for sharing

您需要登录后才可以回帖 登录 | 我要注册

本版微信群
jg-xs1
拉您进交流群
GMT+8, 2025-12-30 03:47