Test your knowledge of Hadoop topics
Posted on
November 28, 2018
by
How much do you about the mission critical Hadoop tools that are unlocking the next generation of advances in Big Data?

Big data is a big deal in IT these days, and the Apache Hadoop framework is one of the tools making it possible to manage all the data.  An assortment of utilities — all open-source — Hadoop makes it possible to use a network of computers for scalable, distributed storage, and big data processing.

How well do you know the elements, syntax, and semantics of Hadoop? What follows is a self-test of 25 questions not centered around any one certification but based on the general concepts documented at the Apache Hadoop homepage.

In all cases, pick the best answer(s) to each question. The answers appear at the end of the questions. Good luck!

1. In a Hadoop cluster, which two machines are typically exclusively designated as master (choose two)?
Web App Proxy
NameNode
MapReduce Job History
Common
ResourceManager

2. Which Hadoop command is used to recursively copy file or directories?
distcp
fs
reccopy
copier

3. What will the command hadoop jar do?
Create an archive file
Create a self-contained workspace
Run a jar file
Track usage in a log file

4. Which of the following files should be used to list all slave hostnames or IP addresses, one per line?
etc/hadoop/slaves
bin/hadoop/hosts
etc/hosts/slaves
D. bin/slaves/hosts

5. Which of the following commands gets a delegation token from a NameNode?
hdfs dfs
hdfs fetchdt
hdfs token
hdfs delegate

6. With Hadoop Key Management Server, if no ACL is configured for a specific key AND no default ACL is configured AND no whitelist key ACL is configured for the requested operation, then access will be:
Allowed
Verified
Denied
Restricted

7. Which of the following is NOT one of the regularly implemented Hadoop modules?
HDFS
Common
YARN
Tone
MapReduce

8. HDFS cache directives are identified by a unique, non-repeating integer ID of what size?
64-bit
32-bit
16-bit
8-bit

9. Which command permanently delete files in checkpoints older than the retention threshold and creates a new checkpoint?
hadoop fs -delete
hadoop fs -censor
hadoop fs -expunge
hadoop fs -remove

10. Which of the following is an Apache alternative to MapReduce that can be used to analyze data in HDFS?
Cassie
NoSQL
Pig
Society

11. Which of the following commands can be used to show computed Hadoop environment variables?
hadoop envvars
hadoop environ
hadoop show
hadoop digress

12. Which of the following of Hadoop’s Java configuration files are NOT site-specific?
etc/hadoop/core-site.xml
hdfs-default.xml
etc/hadoop/yarn-site.xml
etc/hadoop/mapred-site.xml

13. Hadoop KMS (Key Management Server) supports HTTP SPNEGO ______ authentication and HTTPS secure transport.
MD5
TLS
LDAP
Kerberos

14. Which of the following must be installed on a Linux host before Hadoop can be set up in a single node cluster?
HTML5 and rsh
Java and Cassandra
Java and ssh
XML and HTML5

15. Which of the following is NOT a Hadoop recognized audience?
Public
Private
Limited-Private
Community

Please visit GoCertify to attempt the remaining 10 questions of this quiz.

ANSWERS

1. B and E
2. A
3. C
4. A
5. B
6. C
7. D
8. A
9. C
10. C
11. A
12. B
13. D
14. C
15. D

About the Author

Emmett Dulaney is a professor at Anderson University and the author of several books including Linux All-in-One For Dummies and the CompTIA Network+ N10-008 Exam Cram, Seventh Edition.

Posted to topic:
Certification

Important Update: We have updated our Privacy Policy to comply with the California Consumer Privacy Act (CCPA)

CompTIA IT Project Management - Project+ - Advance Your IT Career by adding IT Project Manager to your resume - Learn More