楼主: cwh2008
2203 3

[Hadoop] Hadoop 2.x Administration Cookbook [推广有奖]

  • 0关注
  • 1粉丝

硕士生

5%

还不是VIP/贵宾

-

威望
0
论坛币
291 个
通用积分
0.0600
学术水平
0 点
热心指数
0 点
信用等级
0 点
经验
1090 点
帖子
95
精华
0
在线时间
140 小时
注册时间
2008-9-3
最后登录
2023-2-11

相似文件 换一批

+2 论坛币
k人 参与回答

经管之家送您一份

应届毕业生专属福利!

求职就业群
赵安豆老师微信:zhaoandou666

经管之家联合CDA

送您一个全额奖学金名额~ !

感谢您参与论坛问题回答

经管之家送您两个论坛币!

+2 论坛币
Hadoop is a distributed system with a large ecosystem, which is growing
at an exponential rate, and hence it becomes important to get a grip on
things and do a deep dive into the functioning of a Hadoop cluster in
production. Whether you are new to Hadoop or a seasoned Hadoop
specialist, this recipe book contains recipes to deep dive into Hadoop
cluster configuration and optimization.
What this book covers
Chapter 1, Hadoop Architecture and Deployment, covers Hadoop's
architecture, its components, various installation modes and important
daemons, and the services that make Hadoop a robust system. This chapter
covers single-node and multinode clusters.
Chapter 2, Maintaining Hadoop Cluster – HDFS, wraps the storage layer
HDFS, block size, replication, cluster health, Quota configuration, rack
awareness, and communication channel between nodes.
Chapter 3, Maintaining Hadoop Cluster – YARN and MapReduce, talks
about the processing layer in Hadoop and the resource management
framework YARN. This chapter covers how to configure YARN
components, submit jobs, configure job history server, and YARN
fundamentals.
Chapter 4, High Availability, covers high availability for a Namenode and
Resourcemanager, ZooKeeper configuration, HDFS storage-based
policies, HDFS snapshots, and rolling upgrades.
Chapter 5, Schedulers, talks about YARN schedulers such as fair and
capacity scheduler, with detailed recipes on configuring Queues, Queue
ACLs, configuration of users and groups, and other Queue administration
commands.
Chapter 6, Backup and Recovery, covers Hadoop metastore, backup and
restore procedures on a Namenode, configuration of a secondary
Namenode, and various ways of recovering lost Namenodes. This chapter
also talks about configuring HDFS and YARN logs for troubleshooting.
Chapter 7, Data Ingestion and Workflow, talks about Hive configuration
and its various modes of operation. This chapter also covers setting up
Hive with the credential store and highly available access using
ZooKeeper. The recipes in this chapter give details about the process of
loading data into Hive, partitioning, bucketing concepts, and configuration
with an external metastore. It also covers Oozie installation and Flume
configuration for log ingestion.
Chapter 8, Performance Tuning, covers the performance tuning aspects of
29
HDFS, YARN containers, the operating system, and network parameters,
as well as optimizing the cluster for production by comparing benchmarks
for various configurations.
Chapter 9, Hbase and RDBMS, talks about HBase cluster configuration,
best practices, HBase tuning, backup, and restore. It also covers migration
of data from MySQL to HBase and the procedure to upgrade HBase to the
latest release.
Chapter 10, Cluster Planning, covers Hadoop cluster planning and the best
practices for designing clusters are, in terms of disk storage, network,
servers, and placement policy. This chapter also covers costing and the
impact of SLA driver workloads on cluster planning.
Chapter 11, Troubleshooting, Diagnostics, and Best Practices, talks about
the troubleshooting steps for a Namenode and Datanode, and diagnoses
communication errors. It also covers details on logs and how to parse them
for errors to extract important key points on issues faced.
Chapter 12, Security, covers Hadoop security in terms of data encryption,
in-transit encryption, ssl configuration, and, more importantly, configuring
Kerberos for the Hadoop cluster. This chapter also covers auditing and a
recipe on securing ZooKeeper.

二维码

扫码加我 拉你入群

请注明:姓名-公司-职位

以便审核进群资格,未注明则拒绝

关键词:Cookbook ADMINI Hadoop ration ATION

Hadoop 2.x Administration Cookbook.pdf

25.69 MB

需要: 3 个论坛币  [购买]

Hadoop 2.x Administration cookbook

本帖被以下文库推荐

沙发
军旗飞扬 发表于 2017-10-28 21:54:46 |只看作者 |坛友微信交流群
感谢分享

使用道具

藤椅
西门高 发表于 2017-11-3 22:33:36 |只看作者 |坛友微信交流群
谢谢分享

使用道具

板凳
franky_sas 发表于 2017-11-5 12:58:28 |只看作者 |坛友微信交流群

使用道具

您需要登录后才可以回帖 登录 | 我要注册

本版微信群
加好友,备注cda
拉您进交流群

京ICP备16021002-2号 京B2-20170662号 京公网安备 11010802022788号 论坛法律顾问:王进律师 知识产权保护声明   免责及隐私声明

GMT+8, 2024-4-26 18:49