Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.


Discussion threadhttps://lists.apache.org/thread/670qw80wwfflgv3djqg4304xqy9y8l19
Vote threadhttps://lists.apache.org/thread/2lqq021vyc98w3yly678s8lpv0o8vpz5
JIRA

Jira
serverASF JIRA
serverId5aa69414-a9e9-3523-82ec-879b028fb15b
keyCELEBORN-1492

Release0.6.0


Please keep the discussion on the mailing list rather than commenting on the wiki (wiki discussions get unwieldy fast).

...

  1. Start the Runner and Scheduler, and use CLI commands to submit verification plan files to simulate the following anomalies and verify Celeborn's stability.
    • Kill master process
    • Kill worker process
    • Worker directory not writable
    • Worker disk IO hang
    • High CPU load
    • Master node metadata corruption
  2. Mock shuffle process and support implementation of other corner cases to test each stage of shuffle and enrich workloads.
  3. Provide helm chart to support deployment of chaos testing framework on Kubernetes.

Rejected Alternatives

 The chaos testing framework has no other rejected alternatives.