Wednesday 1 July 2015

Hadoop Framework

Hadoop Framework:
Architecture:
Master/Slave

Servers in a cluster must always be connected in a LAN
But all servers in a LAN may not be in a cluster.

500GB stored in 59 seconds by Yahoo.

The file system that manages storage across a network of machines is called Distributed File System.

Replication is done to avoid data loss.

Replication can be full and Partial.

Replication is directly proportional to Fetching ease which directly proportional to wastage of storage space which is finally inversly proportional to updation ease and time.

HDFS is a layer above the OS' file system.

The principle is Write once read many times.

Primary and Secondary nodes are desiginated During replication.

No comments:

Post a Comment