Apache Kafka 3.3 Replaces ZooKeeper with the New KRaft Consensus Protocol

Oct 26, 2022 2 min read

The Apache Software Foundation has released Apache Kafka 3.3.1 with many new features and improvements. In particular, this is the first release that marks KRaft (Kafka Raft) consensus protocol as production ready. In development for several years, it was released in early access in Kafka 2.8, then in preview in Kafka 3.0.

KRaft is the consensus protocol developed to allow metadata management directly in Apache Kafka. This greatly simplifies Kafka’s architecture by consolidating responsibility for metadata into Kafka itself without the requirement of a third-party tool like Apache ZooKeeper. This new KRaft mode improves partition scalability and resiliency while simplifying deployments of Apache Kafka that now can be deployed standalone.

KRaft makes use of an event-based variant of the Raft consensus algorithm, hence its name.

The new quorum controller introduced with KRaft ensures that metadata is accurately replicated across the quorum. The active controller stores the metadata in an event-sourced log topic while the other controllers within the quorum follow the active controller by responding to the events that it creates. The event log is periodically snapshotted to guarantee that the log cannot grow indefinitely. In case of issues, unlike the ZooKeeper-based controller, the quorum controller does not need to load state from ZooKeeper since the internal state of the cluster is already distributed in the metadata topic. This significantly decreases the unavailability window, improving the worst-case recovery time of the system.

The image below shows a much faster shutdown of a Kafka cluster with two million partitions using the new quorum controller versus ZooKeeper.

The new KRaft consensus and quorum controller enables Kafka clusters to scale to millions of partitions through improved control plane performance with the new metadata management; improves stability, and makes it easier to monitor, administer, and support Kafka; allows Kafka to have a single security model for the whole system and makes controller failover near-instantaneous.

The Kafka community plans to deprecate ZooKeeper in the next release (3.4) and then remove it entirely in version 4.0.

In addition, Apache Kafka 3.3 comes with other new features like adding metadata log processing error-related metrics, allowing users to create delegation tokens for other users and strictly uniform sticky partitioner to improve the partition time.

For Kafka Streams this release adds source/sink node metrics for consumed/produced throughput, pause/resume topologies, and consolidates the KStream transform() and process() methods. Kafka Connect adds Exactly-Once support for source connectors.

About the Author

Andrea Messetti

Andrea is a software architect at DXC Technology. Previously he worked at HP. Andrea is currently focusing on Java, cloud-native applications and microservices. He is passionate about every aspect related to Computer Science (ML, Blockchain, edge computing).

Apache Kafka 3.3 Replaces ZooKeeper with the New KRaft Consensus Protocol

Apache Kafka 3.3 Replaces ZooKeeper with the New KRaft Consensus Protocol

About the Author

Andrea Messetti

Recommend

抖音搜索推广：降低运营成本，获得精准用户群体

'Freed' bird: Day 1 of Elon Musk's Twitter and the social media platform's path...

Nancy Pelosi's Husband to Make 'Full Recovery' After Home Invasion Attack

Telegram removes paid posts from its iOS app due to App Store guidelines

Add Realism to Your Designs with These Fingerprint Textures

[ROM][01 Sep] Clean ROM 3.2 [MM 6.0 - All Variants]

Elon Musk to set up content moderation council to make critical content decision...

Motion Controls In The Browser

凛冬将至，苹果还能靠iPhone支撑多久

EU officially inks complete ban on new combustion vehicles from 2035 onward

About Joyk