楼主: oliyiyi
912 2

Top Hadoop Interview Questions & Answers [推广有奖]

版主

泰斗

0%

还不是VIP/贵宾

-

TA的文库  其他...

计量文库

威望
7
论坛币
271951 个
通用积分
31269.3519
学术水平
1435 点
热心指数
1554 点
信用等级
1345 点
经验
383775 点
帖子
9598
精华
66
在线时间
5468 小时
注册时间
2007-5-21
最后登录
2024-4-18

初级学术勋章 初级热心勋章 初级信用勋章 中级信用勋章 中级学术勋章 中级热心勋章 高级热心勋章 高级学术勋章 高级信用勋章 特级热心勋章 特级学术勋章 特级信用勋章

+2 论坛币
k人 参与回答

经管之家送您一份

应届毕业生专属福利!

求职就业群
赵安豆老师微信:zhaoandou666

经管之家联合CDA

送您一个全额奖学金名额~ !

感谢您参与论坛问题回答

经管之家送您两个论坛币!

+2 论坛币

本帖隐藏的内容

Q1. What exactly is Hadoop?
A1. Hadoop is a Big Data framework to process huge amount of different types of data in parallel to achieve performance benefits.

Q2. What are 5 Vs of Big Data ?
A2. Volume – Size of the data
Velocity – Speed of change of data
Variety – Different types of data : Structured, Semi-Structured, Unstructured data.

Q3. Give me examples of Unstructured data.
A3. Images, Videos, Audios etc.

Q4. Tell me about Hadoop file system and processing framework.
A4. Hadoop files system is called as HDFS – Hadoop distributed file system. It consists of Name Node, Data Node and Secondary Name Node.
Hadoop processing framework is known as MapReduce. It caters Map and Reduce tasks that get scheduled in parallel to achieve efficiency.

Q5/ What is High Availability feature in Hadoop2.
A5. In Hadoop 2 Passive Name Node is introduced to avoid NameNode becoming single point of failure. This results into High Availability of Hadoop cluster.

Q6. What is Federation.
A6. Federation is introduced in Hadoop 2 to cater multiple NameNodes in Hadoop cluster. This makes NameNode horizontally scalable and allows to cater huge amount of Meta Data.

Q7. What is MetaData ?
A7. MetaData is data about data. Name Node caters MetaData in Hadoop cluster – information about files in HDFS.

Q8. What are the main components in Hadoop Eco-System and what are their functions ?
A8. Here is a list of Hadoop Eco-System components –
1. HDFS – distributed File System
2. MapReduce – programming paradigm – based on Java
3. Pig- to process and analyse the structured and semi-structured data
4. Hive – to process and analyse structured data
5. HBASE – NOSQL database
6. SQOOP – Import/Export structured data
7. Oozie – Scheduler

Q9. Tell me some major benefits of Hadoop?
A9. Some major benefits of Hadoop are –
a. Cost-Effective
b. Ability to handle multiple data types
c. Ability to handle big data
d. Common platform for machine learning/business intelligence/datawarehousing etc.

Q10. How Hadoop is cost-effective?
A10. Hadoop is used with commodity hardware and is open-source. So, it provides a cost-effective solution from both hardware and software fronts.

Q11. What is the block size in Hadoop?
A11. Block size in Hadoop 1 is 64 kb and in Hadoop 2 is 128 kb.

Q12. Please tell me the NameNode port number
A12. Its 50070.

Q13. What is the default replication factor in HDFS ?
A13. Default replication factor is 3.

Q14. What is the command to change the replication factor ?
A14. Replication factor can be changed using SETREP command.

Q15. Tell me two most commonly used commands in HDFS.
A15. Get command and put command.

Q16. What are the common types of NOSQL data bases ?
A16. These are –
a. Columnar database.
b. Document database.
c. Graph database.

Q17. Give me an example of document database ?
A17. MongoDB.

Q18. Give me the examples of Columnar database ?
A18. Cassandra and HBASE.

Q19. Tell me about the execution modes of Apache Pig.
A19. Pig can be executed in local and MapReduce modes.

Q20. How would you import data from MYSQL into HDFS ?
A 20. Using Sqoop.

Q21. What are the Hadoop features extended to its eco-system components ?
A 21. High Availability, Horizontal Scalability and Replication/Data Redundancy.


To read original article, click here.



二维码

扫码加我 拉你入群

请注明:姓名-公司-职位

以便审核进群资格,未注明则拒绝

关键词:questions Interview question answers Hadoop framework different examples Answers achieve

已有 1 人评分论坛币 学术水平 热心指数 信用等级 收起 理由
janyiyi + 15 + 9 + 9 + 9 精彩帖子

总评分: 论坛币 + 15  学术水平 + 9  热心指数 + 9  信用等级 + 9   查看全部评分

缺少币币的网友请访问有奖回帖集合
https://bbs.pinggu.org/thread-3990750-1-1.html
沙发
smartlife 在职认证  发表于 2017-2-22 07:57:58 |只看作者 |坛友微信交流群

使用道具

藤椅
cws_24 发表于 2017-2-27 08:00:18 来自手机 |只看作者 |坛友微信交流群
oliyiyi 发表于 2017-2-21 18:24
**** 本内容被作者隐藏 ****
谢谢分享

使用道具

您需要登录后才可以回帖 登录 | 我要注册

本版微信群
加好友,备注jltj
拉您入交流群

京ICP备16021002-2号 京B2-20170662号 京公网安备 11010802022788号 论坛法律顾问:王进律师 知识产权保护声明   免责及隐私声明

GMT+8, 2024-4-26 10:22