Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

2. Supports cross-DC joins by using Paxos to detect duplicates, etc.

 

Apache Apex (incubating)

Website: http://apex.incubator.apache.org/

  1. Pipeline processing architecture, can be used for real-time and batch processing in unified architecture.
  2. Architected for scalability, low-latency processing, high availability, operability.
  3. Stateful fault tolerance (checkpoints operator state without user having to write code for it).
  4. Runs natively on YARN and HDFS, local mode for development.
  5. Rich library of pre-built operators (Malhar) with many adapters for message buses, databases, file systems etc.
  6. Supports Kafka as source and sink (at any point in the topology), connector with offset management for exactly once semantics / idempotency.

Resource Manager Frameworks

...