| ID | IEP-132 |
| Author | |
| Sponsor | |
| Created | |
| Status | |
Motivation
For now, Ignite cluster can be upgraded only by full cluster restart.
Procedure must be the following:
- Stop ALL cluster nodes.
- Update files on each node.
- Start nodes one by one.
Each version update means cluster unavailability for the end user.
This is extremely inconvenient, especially for the users who use Ignite as a primary data storage.
Other systems supports rolling upgrade feature when upgrade made node by node without unavailability period.
After implementing rolling upgrade, Ignite must support the following upgrade procedure:
- Stop one node.
- Upgrade files on the node.
- Start node.
- Repeat steps 1-3 for each node in cluster.
Let's see how rolling upgrade implemented in other systems:
Other systems
Many distributed open source systems has rolling upgrade feature, already.
We must study implemented approaches to gain some insights from them.
- Is there rolling upgrade feature?
- How it implemented?
- How it tested?
- When it tested: Each PR? Weekly? Only on release?
- Network message format.
- Serdes implementation.
- How many earlier releases can be upgraded.
Name | Supported | How it implemented? | How it tested? | When it tested? | Network message format | Serdes implementation | How many earlier releases can be upgraded? |
|---|
| Apache Cassandra | Yes (1), (2) | Server-client compatibility works similar to Ignite Thin Client protocol. | Compatibility checked with the special test framework (3) At a first glance, there are no special source code checks to ensure compatibility in day by day coding | On release or by request | POJO | Custom serdes implementation. |
|
| Apache Kafka | Yes (4) | Message formats checked on PR reivew. At a first glance, there are no special source code checks to ensure compatibility in day by day coding. But, all machinery to code compatible implemented in code generation framework (5) | Compatibility checked with the special test framework ducktest (6) | On release or by request. Kafka doesn't provide public resources to run ducktests. Run done by contributors or by confluent employers on private hosts | POJO | Custom serdes implementation. | All |
| Yugabyte | Yes (7) | checked on review (see commit message section "Upgrade/Rollback safety") (8) |
|
| plain objects | Protobuf |
|
| YDB | Yes | Message formats checked on PR rivew - grpc+protobuf helps to maintain compatibility | Compatibility checked with the special test framework (9) | On request. | plain objects | Protobuf |
|
| Cockroach DB | Yes |
|
|
| plain objects | Protobuf |
|
| Hazelcast | Yes (12) |
|
|
|
|
|
|
Description
Rolling Upgrade assumes that cluster can consist of Ignite nodes of different versions.
So Ignite must provide backward compatibility on network level and ability to work in mixed topology.
Development time checks, tests must be added to provide protection of incorrect patches.
Ignite codebase must be reorganized in a way to clearly distinguish those parts that require compatibility and those who don't.
Let's define clearly, what backward compatibility for the network messages means:
- New Ignite server MUST be able to read previous version of message.
- Old Ignite server MUST be able to read new version of message
- Newly added fields MUST be ignored.
- Removed fields MUST have default values.
Note, there are guide for PDS compatibility, already (11).
Let's list subsystems that must be reworked to provide compatibility:
- communication: Communication messages consists of two parts:
- Message format: message format itself. Communication API should be reworked to force backward compatible messages.
- User data: Message can store user data. User data format must be backward compatible.
- discovery:
- Message format: message format itself.
Discovery API must be reworked to force backward compatible messages.
- binary marshaller:
- Currently, binary marshaller code highly coupled with the other Ignite code.
We must modularize Binary infrastructure (10) and provide compatibility guarantees for each part of it.
- features:
- Framework to enable/disable features for mixed version clusters must be developed.
- affinity:
- Affinity function must the same for each online node version.
- management commands
All subsystem that must be compatible:
- PDS
- partition data
- WAL records
- Metadata
- binary meta
- marshaller
- Communication SPI
- Discovery SPI
- Binary Marshaller
- Affinity.
- Management API
- argument
- results
- tasks (class names).
- ThinClient Protocol
Compatibility Matrix
| Component | Type | Number of releases |
|---|
| Ignite public API | backward | 1 |
| PDS | backward | all |
| WAL | backward | all |
| Metadata | backward | all |
| Thin Client | full | all |
| Communication SPI | full | 1 |
| Discovery SPI | full | 1 |
| Binary Marshaller | backward | all |
| Management API | backward | 1 |
| Affinity | full | all |
| JDBC | backward | 1 |
| ODBC | backward | 1 |
| Rest | backward | 1 |
| CDC | full | 1 |
Current Ignite codebase
- Is there any primitives, building blocks to provide compatibility?
- Difficulties for day by day coding.
- java serialzation
- anonymous class names based on declaration order
- Compatibility testing.
- explicit tests.
- ability to run tests with random node versions
- New feature implementation, enabling.
- patterns for implementing commons cases: new version of algorithmes, testing against new versions.
- Scope of compatibility:
- Currently any third-party module can register own ports (messages?) and must be able to track compatibility.
Implementation phases
Phase 0 - Code Cleanup
Phase 1 - BinaryMarshaller modularization
- modularize BinaryMarshaller.
- create small jar for ignite thin client
- provide clear API for binary objects inside ignite-code and other modules.
Phase 2 - Communication SPI Compatibility
- Communication MessageWriter and MessageReader are aware of peers version, and then schema of a message.
- Distinguish serdes generation from POJO
- PR checks for
Messages ancestor change with the some warning, labels, etc. that can draw reviewer attention to the possible compatibility issues.
Phase 3 - Discovery SPI Compatibility
Serdes changes to provide compatibility? (current approach with JDK serialization provides some compatibility (13).
Is it enough for long-term compatibility support?
Serdes changes to restrict classes that can be sent over SPI.- PR checks for
Messages ancestor change with the some warning, labels, etc. that can draw reviewer attention to the possible compatibility issues.
Phase 4 - IgniteFeatures
- Write down clear rules to deal with the new features and not compatible enhancements.
- Support, if not, already this rules in IgniteFeatures framework.
Phase 5 - Management API
- Provide ability to change arg, result classes in compatible way.
- Possible approach is to reuse communication serdes framework.
- Other possibility is to migrate on BinaryObject as a arguments and results.
- PR checks for
Messages ancestor change with the some warning, labels, etc. that can draw reviewer attention to the possible compatibility issues.
Phase 6 - Testing
- ducktests to check upgrade procedure.
unit tests mode that start some nodes of previous version.
Phase 7 - Development process changes
Any change in public API MUST be in the form of IEP.
Any change in public API MUST be voted by two(three?) committers.- Documentation with clear description of development rules for all subsystems required to be compatible.
Alternative designs
- subsystem API versions (like in REST API)
- runtime component(IgniteProcessor, IgniteManager) upgrade with the dynamic class loading.
Reference Links
- https://www.datastax.com/learn/whats-new-for-cassandra-4/migrating-cassandra-4x
- https://docs.datastax.com/en/luna-cassandra/guides/upgrade/overview.html
- https://github.com/apache/cassandra-dtest/blob/trunk/upgrade_tests/README.md
- https://kafka.apache.org/documentation/#upgrade
- https://github.com/apache/kafka/blob/trunk/clients/src/main/resources/common/message/AlterPartitionResponse.json
- https://github.com/apache/kafka/blob/trunk/tests/kafkatest/tests/core/kraft_upgrade_test.py
- https://docs.yugabyte.com/preview/manage/upgrade-deployment/
- https://github.com/yugabyte/yugabyte-db/commit/e9ab17dea0d3b4f1673531c07404a290f9fbd8f2
- https://github.com/ydb-platform/ydb/blob/main/ydb/tests/functional/restarts/
- IEP-119 Binary infrastructure modularization
- PDS Compatibility Guide (WIP)
- https://hazelcast.com/products/rolling-upgrade/
- https://docs.oracle.com/en/java/javase/17/docs/specs/serialization/version.html#compatible-java-type-evolution
Tickets
| Key
|
Summary
|
T
|
Created
|
Updated
|
Due
|
Assignee
|
Reporter
|
P
|
Status
|
Resolution
|