Preface . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . ix
1. Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1
2. HDFS . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7
Goals and Motivation 7
Design 8
Daemons 9
Reading and Writing Data 11
The Read Path 12
The Write Path 13
Managing Filesystem Metadata 14
Namenode High Availability 16
Namenode Federation 18
Access and Integration 20
Command-Line Tools 20
FUSE 23
REST Support 23
3. MapReduce . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 25
The Stages of MapReduce 26
Introducing Hadoop MapReduce 33
Daemons 34
When It All Goes Wrong 36
YARN 37
4. Planning a Hadoop Cluster . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 41
Picking a Distribution and Version of Hadoop 41
Apache Hadoop 41
Cloudera’s Distribution Including Apache Hadoop 42
Versions and Features 42
HadoopOperations.pdf
(3.5 MB, 需要: 10 个论坛币)
WebCorpusConstruction.pdf
(2.12 MB, 需要: 5 个论坛币)
TheIntelligentWeb.pdf
(3.32 MB, 需要: 5 个论坛币)
TheDataWarehouseToolkit3rdEdition.pdf
(5.84 MB, 需要: 5 个论坛币)
SurfaceComputingandCollaborativeAnalysisWork.pdf
(7.1 MB, 需要: 5 个论坛币)
SublinearAlgorithmsforBigDataApplications.pdf
(1.92 MB, 需要: 5 个论坛币)
SplunkOperationalIntelligenceCookbook.pdf
(15.85 MB, 需要: 5 个论坛币)
SoftwareEngineeringDesignTheoryandPractice.pdf
(3.39 MB, 需要: 5 个论坛币)
Securing Hadoop.pdf
(4.05 MB, 需要: 5 个论坛币)
Scaling Big Data with Hadoop and Solr.pdf
(2.53 MB, 需要: 5 个论坛币)
RealTimeBigDataAnalytics.pdf
(4.72 MB, 需要: 5 个论坛币)
RealTimeAnalytics.pdf
(4.24 MB, 需要: 5 个论坛币)
ProjectManagementwithSAPProjectSystem3rdedition.pdf
(42.27 MB, 需要: 5 个论坛币)
PythonforFinance.pdf
(13.56 MB, 需要: 5 个论坛币)
ProgrammingPig.pdf
(4.44 MB, 需要: 5 个论坛币)
Professional Hadoop Solutions.pdf
(8.17 MB, 需要: 5 个论坛币)
Programming Hive.pdf
(3.85 MB, 需要: 5 个论坛币)
ProblemSolvingandDataAnalysisUsingMinitab.pdf
(18.3 MB, 需要: 5 个论坛币)
ProbabilisticGraphicalModelsPrinciplesandApplications.pdf
(8.49 MB, 需要: 5 个论坛币)
Pro Apache Hadoop, 2nd Edition.pdf
(6.27 MB, 需要: 5 个论坛币)
Pro Hadoop.pdf
(6.89 MB, 需要: 5 个论坛币)
Practical Hadoop Security.pdf
(5.03 MB, 需要: 5 个论坛币)
PracticalDataAnalysis.pdf
(9.96 MB, 需要: 5 个论坛币)
PigDesignPatterns.pdf
(1.98 MB, 需要: 5 个论坛币)
PerceptualImageCodingwithDiscreteCosineTransform.pdf
(3.7 MB, 需要: 5 个论坛币)
NetworkSecurityThroughDataAnalysis.pdf
(9.29 MB, 需要: 5 个论坛币)
PentahoKettleSolutions.pdf
(16.08 MB, 需要: 5 个论坛币)
MondrianinAction.pdf
(7.56 MB, 需要: 5 个论坛币)
MulticriteriaDecisionAnalysis.pdf
(5.97 MB, 需要: 5 个论坛币)
ModelsandAnalysisforDistributedSystems.pdf
(4.01 MB, 需要: 5 个论坛币)
MasteringSplunk.pdf
(8.15 MB, 需要: 5 个论坛币)
ModelBasedDevelopmentApplications.pdf
(4.05 MB, 需要: 5 个论坛币)
MapReduceDesignPatterns.pdf
(5.46 MB, 需要: 5 个论坛币)
Mastering Hadoop.pdf
(4.89 MB, 需要: 5 个论坛币)
MakingSenseofDataI2ndEdition.pdf
(8.03 MB, 需要: 5 个论坛币)
LinearandNonlinearProgramming4edition.pdf
(5.39 MB, 需要: 5 个论坛币)
LearningStorm.pdf
(2.5 MB, 需要: 5 个论坛币)
LearningSPARQL2ndEdition.pdf
(15.28 MB, 需要: 5 个论坛币)
LearningSPARQL.pdf
(7.31 MB, 需要: 5 个论坛币)
LearningInformaticaPowerCenter9x.pdf
(11.42 MB, 需要: 5 个论坛币)
Learning Hadoop 2.pdf
(2.61 MB, 需要: 5 个论坛币)
LatestAdvancesinInductiveLogicProgramming.pdf
(5.21 MB, 需要: 5 个论坛币)
KnowledgeNeedsandInformationExtraction.pdf
(4.43 MB, 需要: 5 个论坛币)
KNIMEEssentials.pdf
(2.56 MB, 需要: 5 个论坛币)
InvitationtoComputerScience6thedition.pdf
(43.17 MB, 需要: 5 个论坛币)
BigDataImperatives.pdf
(8.63 MB, 需要: 5 个论坛币)
BigDataGlossary.pdf
(4.71 MB, 需要: 5 个论坛币)
BigDataForDummies.pdf
(4.44 MB, 需要: 5 个论坛币)
BigDataBigInnovation.pdf
(3.07 MB, 需要: 5 个论坛币)
BigDataAPrimer.pdf
(7.27 MB, 需要: 5 个论坛币)
BigDataApplicationArchitectureQA.pdf
(4.74 MB, 需要: 5 个论坛币)
AuthorizationsinSAP.pdf
(19.8 MB, 需要: 5 个论坛币)
ApacheAccumuloforDevelopers.pdf
(4.87 MB, 需要: 5 个论坛币)
Apache Hive Essentials.pdf
(1.86 MB, 需要: 5 个论坛币)
Apache Hadoop YARN.pdf
(7.49 MB, 需要: 5 个论坛币)
AnalyzingtheAnalyzers.pdf
(2.32 MB, 需要: 5 个论坛币)
AManagersGuidetoDataWarehousing.pdf
(2.7 MB, 需要: 5 个论坛币)
AlgorithmsfromandforNatureandLife.pdf
(8.62 MB, 需要: 5 个论坛币)
AgileDataScience.pdf
(11.54 MB, 需要: 5 个论坛币)
AccessDataAnalysisCookbook.pdf
(11.36 MB, 需要: 5 个论坛币)



雷达卡





京公网安备 11010802022788号







