DUE TO SPAM, SIGN-UP IS DISABLED. Goto Selfserve wiki signup and request an account.
Status
Current state: Accepted
Discussion thread: here
JIRA: here
Please keep the discussion on the mailing list rather than commenting on the wiki (wiki discussions get unwieldy fast).
Motivation
One of MM2's main features is syncing consumer group offset between clusters. This feature relies on two separate connectors
MirrorSourceConnector: this connector stores the translation of the new record offsets on target cluster inoffset_syncsinternal topic after mirroring the record itself. This translation is done atcommitmethod since 3.5 wherecommit()API stores the offsets in the source system, up to the offsets that have been returned bypoll(). The implementation of this API should block until the commit is complete. See KAFKA-14610 for more details.MirrorCheckpointConnector: this connector uses the new record offsets fromoffset_syncsand updatecheckpointsand__consumer_offsetstopics. This can be controlled byemit.checkpoints.enabledandsync.group.offsets.enabled
For use-cases where customers only run MM2 for replicating between clusters without the need for moving the consumer group offsets by running only MirrorSourceConnector. Part#1 (blocking until commit) of this feature still an issue as it adds cost to the progress of their replication. The only workaround reducing the latency cost is by increasing offset.lag.max to INT_MAX which will result that the offset will never going to be queued for offset sync (Check `PartitionState::update` for details on how MM2 decided if it should emit an offset sync or not) however this doesn't eliminate the cost of creating an internal topic.
This KIP proposes that we add a new config for MirrorSourceConnector to control enabling the offsets translation between clusters for replicated topics.
Public Interfaces
| Properties | Default | Description |
|---|---|---|
emit.offset-syncs.enabled | true | Whether to store the new offset of the replicated records in offset-syncs topic or not. MirrorCheckpointConnector will fail to start if emit.checkpoints.enabled and/or sync.group.offsets.enabled are enabled while |
Proposed Changes
Similar to
emit.checkpoints.enabledandemit.heartbeat.enabledthis KIP will addemit.offset-syncs.enabled. The config default value will be true to keep the current implementation.offset-syncs topic will not be created if
emit.offset-syncs.enabledset to falselogic to publish offset syncs in
commit()will not be executed ifemit.offset-syncs.enabledset to falseMirrorCheckpointConnectorvalidation will fail at start ifemit.offset-syncs.enableddisabled whileemit.checkpoints.enabledand/orsync.group.offsets.enabledare enabled.
Compatibility, Deprecation, and Migration Plan
emit.offset-syncs.enabled is set to true by default so any existing use-cases or use-cases that run this feature will do nothing. Only use-case that wish to fully disable this feature will need to set emit.offset-syncs.enabled to false.
Rejected Alternatives
Reuse
emit.checkpoints.enabledandsync.group.offsets.enabledand assume that if both are disabled then this feature needs to be fully disabled. I didn't want to rely onMirrorCheckpointConnectorconfigs to configureMirrorSourceConnector. And this might impact customers who enabledMirrorSourceConnectorwith intention to runMirrorCheckpointConnectorlater in the future