So far we have covered following topics in Big Data
In this blog we will discuss ZooKeeper.
In the world of hadoop, theme is distributed. What if you want to build your own distributed application?
You have to worry about centralized configuration, synchronization, serialization.
Zookeeper is the distributed coordination service for the distributed application. a centralized repository.
What is zookeeper?
-Distributed coordination service for distributed applications
-Used for synchronization, serialization and coordination
-Handles the 'nitty-gritty' side of distributed app dev
-Apps use these services to coordinate distributed processing
- coordination is error prone
-Rack conditions, deadlocks,partial failures, inconsistencies
-Configuration - message queue
Now Lets talk about ZOOKeeper architecture
Above diagram, if we looks closely
- Container for data and other nodes
-Stores Stats' user data( 1MB)
Z nodes types