LatestHadoop admin interview questions and answers:
1. What is Edge Node? Why choose two edge nodes in a cluster?
Basically, Edge Nodes are end-user connectivity purposes like an interface between cluster and client.
One Edge node is a single point if the edge node goes down another edge node will connect that’s why we use two edge nodes.
2. If you have four master nodes what are services are installed?
In master node 1: installed, Name node, Secondary node Hive server, Resource manager one zookeeper
In master node 2: HBase master, Oozie server
In master node 3: Hue, spark, three zookeeper
In master node 4: High availability
3. Tell me about default block size of Hadoop and Unix?
The default block size of HDFS is 128MB
The default block size of Unix is 4kb
4. What are security measures that are implemented in the Hadoop cluster?
LDAP is the first level authentication
Kerberos for the second level authentication
Sentry for role-based authorization to data and metadata stored on Hadoop cluster
Knox, who access the cluster to provide security like a gateways
Ranger is to provide security across Hadoop eco-system folder access and data authorization
5. What about data transmitted over the network data in transit how do you secure the data?
By using encrypted data transmitted over the networks and also using SSL certifications and HTTPS and some other protocols also.
6. What are the types of accounts used in the Hadoop cluster?
Service account: This account belongs to create in the active directory, within the Hadoop cluster access the jobs and applications.
Technical account: This account related to access from outside clients for application related for example Java client to Hive access.
Business user account: This account belongs to some business users want to access the Hadoop cluster.
Admin account: highly privileged account for giving credentials for users from active directory
Local account: This account belongs to Unix based for active directory principals.