Input And Output(Engineering > Computer Science And Engineering > Hadoop ) Questions and Answers

Question 1. ________ instances are encoded using the number of bytes declared in the schema.
  1.    Fixed
  2.    Enum
  3.    Unions
  4.    Maps
Explanation:-
Answer: Option A. -> Fixed


Except for unions, the JSON encoding is the same as is used to encode field default values.



Question 2. Thrift resolves possible conflicts through _________ of the field.
  1.    Name
  2.    Static number
  3.    UID
  4.    None of the mentioned
Explanation:-
Answer: Option B. -> Static number


Avro resolves possible conflicts through the name of the field.



Question 3. ________ permits data written by one system to be efficiently sorted by another system.
  1.    Complex Data type
  2.    Order
  3.    Sort Order
  4.    All of the mentioned
Explanation:-
Answer: Option C. -> Sort Order


Avro binary-encoded data can be efficiently ordered without deserializing it to objects.



Question 4. ________ are a way of encoding structured data in an efficient yet extensible format.
  1.    Thrift
  2.    Protocol Buffers
  3.    Avro
  4.    None of the mentioned
Explanation:-
Answer: Option B. -> Protocol Buffers


Google uses Protocol Buffers for almost all of its internal RPC protocols and file formats.



Question 5. Which of the following is the slowest compression technique ?
  1.    LZO
  2.    Bzip2
  3.    Gzip
  4.    All of the mentioned
Explanation:-
Answer: Option B. -> Bzip2


Of all the available compression codecs in Hadoop, Bzip2 is by far the slowest.



Question 6. Gzip (short for GNU zip) generates compressed files that have a _________ extension.
  1.    .gzip
  2.    .gz
  3.    .gzp
  4.    .g
Explanation:-
Answer: Option B. -> .gz


You can use the gunzip command to decompress files that were created by a number of compression utilities, including Gzip.



Question 7. The ____________ class extends and implements several Hadoop-supplied interfaces.
  1.    AvroReducer
  2.    Mapper
  3.    AvroMapper
  4.    None of the mentioned
Explanation:-
Answer: Option C. -> AvroMapper


AvroMapper is used to provide the ability to collect or map data.



Question 8. Which of the following format is more compression-aggressive ?
  1.    Partition Compressed
  2.    Record Compressed
  3.    Block-Compressed
  4.    Uncompressed
Explanation:-
Answer: Option C. -> Block-Compressed


SequenceFile key-value list can be just a Text/Text pair, and is written to the file during the initialization that happens in the SequenceFile.



Question 9. __________ support storing a copy of data at a particular instant of time.
  1.    Data Image
  2.    Datanots
  3.    Snapshots
  4.    All of the mentioned
Explanation:-
Answer: Option C. -> Snapshots


One usage of the snapshot feature may be to roll back a corrupted HDFS instance to a previously known good point in time.



Question 10. Automatic restart and ____________ of the NameNode software to another machine is not supported.
  1.    failover
  2.    end
  3.    scalability
  4.    All of the mentioned
Explanation:-
Answer: Option A. -> failover


If the NameNode machine fails, manual intervention is necessary.