人大经济论坛 › 论坛 › 计量经济学与统计论坛五区 › 计量经济学与统计软件 › LATEX论坛 › Top Hadoop Interview Questions & Answers

发帖

楼主: oliyiyi

1247 2

Top Hadoop Interview Questions & Answers [推广有奖]

1关注
185
粉丝

版主

已卖：2999份资源

泰斗

还不是VIP/贵宾

TA的文库 其他...

计量文库

威望: 7 级
论坛币: -38600 个
通用积分: 31675.2236
学术水平: 1454 点
热心指数: 1573 点
信用等级: 1364 点
经验: 384234 点
帖子: 9629
精华: 66
在线时间: 5508 小时
注册时间: 2007-5-21
最后登录: 2025-7-8

楼主

oliyiyi 发表于 2017-2-21 18:24:23 |AI写论文

是否 +2 论坛币

k人参与回答

经管之家送您一份

应届毕业生专属福利!

求职就业群

赵安豆老师微信：zhaoandou666

经管之家联合CDA

送您一个全额奖学金名额~ !

立即领取

感谢您参与论坛问题回答

经管之家送您两个论坛币！

+2 论坛币

本帖隐藏的内容

Q1. What exactly is Hadoop?
A1. Hadoop is a Big Data framework to process huge amount of different types of data in parallel to achieve performance benefits.

Q2. What are 5 Vs of Big Data ?
A2. Volume – Size of the data
Velocity – Speed of change of data
Variety – Different types of data : Structured, Semi-Structured, Unstructured data.

Q3. Give me examples of Unstructured data.
A3. Images, Videos, Audios etc.

Q4. Tell me about Hadoop file system and processing framework.
A4. Hadoop files system is called as HDFS – Hadoop distributed file system. It consists of Name Node, Data Node and Secondary Name Node.
Hadoop processing framework is known as MapReduce. It caters Map and Reduce tasks that get scheduled in parallel to achieve efficiency.

Q5/ What is High Availability feature in Hadoop2.
A5. In Hadoop 2 Passive Name Node is introduced to avoid NameNode becoming single point of failure. This results into High Availability of Hadoop cluster.

Q6. What is Federation.
A6. Federation is introduced in Hadoop 2 to cater multiple NameNodes in Hadoop cluster. This makes NameNode horizontally scalable and allows to cater huge amount of Meta Data.

Q7. What is MetaData ?
A7. MetaData is data about data. Name Node caters MetaData in Hadoop cluster – information about files in HDFS.

Q8. What are the main components in Hadoop Eco-System and what are their functions ?
A8. Here is a list of Hadoop Eco-System components –
1. HDFS – distributed File System
2. MapReduce – programming paradigm – based on Java
3. Pig- to process and analyse the structured and semi-structured data
4. Hive – to process and analyse structured data
5. HBASE – NOSQL database
6. SQOOP – Import/Export structured data
7. Oozie – Scheduler

Q9. Tell me some major benefits of Hadoop?
A9. Some major benefits of Hadoop are –
a. Cost-Effective
b. Ability to handle multiple data types
c. Ability to handle big data
d. Common platform for machine learning/business intelligence/datawarehousing etc.

Q10. How Hadoop is cost-effective?
A10. Hadoop is used with commodity hardware and is open-source. So, it provides a cost-effective solution from both hardware and software fronts.

Q11. What is the block size in Hadoop?
A11. Block size in Hadoop 1 is 64 kb and in Hadoop 2 is 128 kb.

Q12. Please tell me the NameNode port number
A12. Its 50070.

Q13. What is the default replication factor in HDFS ?
A13. Default replication factor is 3.

Q14. What is the command to change the replication factor ?
A14. Replication factor can be changed using SETREP command.

Q15. Tell me two most commonly used commands in HDFS.
A15. Get command and put command.

Q16. What are the common types of NOSQL data bases ?
A16. These are –
a. Columnar database.
b. Document database.
c. Graph database.

Q17. Give me an example of document database ?
A17. MongoDB.

Q18. Give me the examples of Columnar database ?
A18. Cassandra and HBASE.

Q19. Tell me about the execution modes of Apache Pig.
A19. Pig can be executed in local and MapReduce modes.

Q20. How would you import data from MYSQL into HDFS ?
A 20. Using Sqoop.

Q21. What are the Hadoop features extended to its eco-system components ?
A 21. High Availability, Horizontal Scalability and Replication/Data Redundancy.

To read original article, click here.

扫码加我拉你入群

请注明：姓名-公司-职位

以便审核进群资格，未注明则拒绝

分享0 收藏4 回帖

关键词：questions Interview question answers Hadoop framework different examples Answers achieve

Top Hadoop Interview Questions & Answers [推广有奖]

经管之家送您一份

经管之家联合CDA

感谢您参与论坛问题回答

本帖隐藏的内容

扫码加我拉你入群

相关帖子

浏览过的帖子

浏览过的版块

初级学术勋章

初级热心勋章

初级信用勋章

中级信用勋章

中级学术勋章

中级热心勋章

高级热心勋章

高级学术勋章

高级信用勋章

特级热心勋章

特级学术勋章

特级信用勋章

本版微信群

Top Hadoop Interview Questions & Answers [推广有奖]

经管之家送您一份

经管之家联合CDA

感谢您参与论坛问题回答

本帖隐藏的内容

扫码加我 拉你入群

相关帖子

浏览过的帖子

浏览过的版块

初级学术勋章

初级热心勋章

初级信用勋章

中级信用勋章

中级学术勋章

中级热心勋章

高级热心勋章

高级学术勋章

高级信用勋章

特级热心勋章

特级学术勋章

特级信用勋章

本版微信群

扫码加我拉你入群