This Confluence has been LDAP enabled, if you are an ASF Committer, please use your LDAP Credentials to login. Any problems file an INFRA jira ticket please.

Page tree
Skip to end of metadata
Go to start of metadata

This page describes a proposed Flink Improvement Proposal (FLIP) process for proposing a major change to Flink.

To create your own FLIP, click on "Create" on the header and choose "FLIP-Template" other than "Blank page".

Purpose

The purpose of FLIPs is to have a central place to collect and document planned major enhancements to Apache Flink. While JIRA is still the tool to track tasks, bugs, and progress, the FLIPs give an accessible high level overview of the result of design discussions and proposals. Think of FLIPs as collections of major design documents for user-relevant changes.

We want to make Flink a core architectural component for users. We also support a large number of integrations with other tools, systems, and clients. Keeping this kind of usage healthy requires a high level of compatibility between releases — core architectural elements can't break compatibility or shift functionality from release to release. As a result each new major feature or public API has to be done in a way that we can stick with it going forward.

This means when making this kind of change we need to think through what we are doing as best we can prior to release. And as we go forward we need to stick to our decisions as much as possible. All technical decisions have pros and cons so it is important we capture the thought process that lead to a decision or design to avoid flip-flopping needlessly.

Hopefully we can make these proportional in effort to their magnitude — small changes should just need a couple brief paragraphs, whereas large changes need detailed design discussions.

This process also isn't meant to discourage incompatible changes — proposing an incompatible change is totally legitimate. Sometimes we will have made a mistake and the best path forward is a clean break that cleans things up and gives us a good foundation going forward. Rather this is intended to avoid accidentally introducing half thought-out interfaces and protocols that cause needless heartburn when changed. Likewise the definition of "compatible" is itself squishy: small details like which errors are thrown when are clearly part of the contract but may need to change in some circumstances, likewise performance isn't part of the public contract but dramatic changes may break use cases. So we just need to use good judgement about how big the impact of an incompatibility will be and how big the payoff is.

What is considered a "major change" that needs a FLIP?

Any of the following should be considered a major change:

  • Any major new feature, subsystem, or piece of functionality
  • Any change that impacts the public interfaces of the project

What are the "public interfaces" of the project?


All of the following are public interfaces that people build around:

  • DataStream and DataSet API, including classes related to that, such as StreamExecutionEnvironment
  • Classes marked with the @Public annotation
  • On-disk binary formats, such as checkpoints/savepoints
  • User-facing scripts/command-line tools, i.e. bin/flink, Yarn scripts, Mesos scripts
  • Configuration settings
  • Exposed monitoring information


Not all compatibility commitments are the same. We need to spend significantly more time on public APIs as these can break code for users. They cause people to rebuild code and lead to compatibility issues in large multi-dependency projects (which end up requiring multiple incompatible versions). Configuration, monitoring, and command line tools can be faster and looser — changes here will break monitoring dashboards and require a bit of care during upgrades but aren't a huge burden.

For the most part monitoring, command line tool changes, and configs are added with new features so these can be done with a single FLIP.

What should be included in a FLIP?

A FLIP should contain the following sections:

  • Motivation: describe the problem to be solved
  • Proposed Change: describe the new thing you want to do. This may be fairly extensive and have large subsections of its own. Or it may be a few sentences, depending on the scope of the change.
  • New or Changed Public Interfaces: impact to any of the "compatibility commitments" described above. We want to call these out in particular so everyone thinks about them.
  • Migration Plan and Compatibility: if this feature requires additional support for a no-downtime upgrade describe how that will work
  • Rejected Alternatives: What are the other alternatives you considered and why are they worse? The goal of this section is to help people understand why this is the best solution now, and also to prevent churn in the future when old alternatives are reconsidered.

Who should initiate the FLIP?

Anyone can initiate a FLIP but you shouldn't do it unless you have an intention of getting the work done to implement it (otherwise it is silly).

Process

Here is the process for making a FLIP:

  1. Create a page which is a child of this one. Take the next available FLIP number and give your proposal a descriptive heading. e.g. "FLIP 42: Enable Flink Streaming Jobs to stop gracefully". If you don't have the necessary permissions for creating a new page, please ask on the development mailing list.
  2. Fill in the sections as described above
  3. Start a [DISCUSS] thread on the Apache mailing list. Please ensure that the subject of the thread is of the format [DISCUSS] FLIP-{your FLIP number} {your FLIP heading} The discussion should happen on the mailing list not on the wiki since the wiki comment system doesn't work well for larger discussions. In the process of the discussion you may update the proposal. You should let people know the changes you are making.
  4. Once the proposal is finalized call a [VOTE] to have the proposal adopted. These proposals are more serious than code changes and more serious even than release votes. The criteria for acceptance is lazy majority.
  5. Please update the FLIP wiki page, and the index below, to reflect the current stage of the FLIP after a vote. This acts as the permanent record indicating the result of the FLIP (e.g., Accepted or Rejected). Also report the result of the FLIP vote to the voting thread on the mailing list so the conclusion is clear.

FLIP round-up

Next FLIP Number: 57

Use this number as the identifier for your FLIP and increment this value.

Adopted/Accepted but unreleased FLIPs

FLIPTarget ReleaseLink to Discussion Thread
FLIP-1 : Fine Grained Recovery from Task Failures1.9http://apache-flink-mailing-list-archive.1008284.n3.nabble.com/DISCUSS-FLIP-1-Fine-grained-recovery-from-task-failures-td12510.html
FLIP-16: Loop Fault ToleranceTBDhttp://apache-flink-mailing-list-archive.1008284.n3.nabble.com/DISCUSS-FLIP-13-Consistent-Processing-with-Loops-td14149.html
FLIP-23 - Model Serving  TBDhttps://lists.apache.org/thread.html/968a633aee01463dc844d35ffa9a9c9e46b638158a8d32f1c4b4dea2@%3Cdev.flink.apache.org%3E

FLIP-29: Support map/flatMap/aggregate/flatAggregate on TableAPI

TBDhttp://apache-flink-mailing-list-archive.1008284.n3.nabble.com/DISCUSS-Table-API-Enhancement-Outline-td25070.html
FLIP-30: Unified Catalog APIsTBDhttp://apache-flink-mailing-list-archive.1008284.n3.nabble.com/DISCUSS-Integrate-Flink-SQL-well-with-Hive-ecosystem-tt24538.html
FLIP-31: Pluggable Shuffle Manager1.9https://lists.apache.org/thread.html/73d7c62a6c6dc7d3fddda79701c71b921e635766737c1b33b84df234@%3Cdev.flink.apache.org%3E
FLIP-32: Restructure flink-table for future contributionsTBDhttps://lists.apache.org/thread.html/13c605185f81958ef63c1090f72c82ba979f90f283406f90217dd5db@%3Cdev.flink.apache.org%3E

FLIP-35: Support Chinese Documents and Website

TBDhttps://lists.apache.org/thread.html/1d079e7218b295048c990edcb9891b5935d02eeac0927c89696aad42@%3Cdev.flink.apache.org%3E
FLIP-37: Rework of the Table API Type System1.9

https://lists.apache.org/thread.html/e1eca89862d59f78ba86c9a2596a810fdb1d42bcba7356fbda51e60e@%3Cdev.flink.apache.org%3E

FLIP-39:Flink ML pipeline and ML libs

1.9https://mail-archives.apache.org/mod_mbox/flink-dev/201904.mbox/%3CCAAjCYUWJs6kaf-n4rewm3vY2LJC_sgrr8ExexnR9-yV%3DOhxGjQ%40mail.gmail.com%3E
FLIP-41: Unify Canonical Binary Format for Keyed StateTBDhttp://apache-flink-mailing-list-archive.1008284.n3.nabble.com/DISCUSS-FLIP-41-Unify-Keyed-State-Snapshot-Binary-Format-for-Savepoints-td29197.html
FLIP-43: State Processing API1.9

http://apache-flink-mailing-list-archive.1008284.n3.nabble.com/DISCUSS-FLIP-42-Savepoint-Connector-td29231.html

FLIP-51: Rework of the Expression Design1.10http://apache-flink-mailing-list-archive.1008284.n3.nabble.com/DISCUSS-FLIP-51-Rework-of-the-Expression-Design-td31653.html
FLIP-52 : Remove legacy Program interface.1.10 https://lists.apache.org/thread.html/0dbd0a4adf9ad00d6ad869dffc8820f6ce4c1969e1ea4aafb1dd0aa4@%3Cdev.flink.apache.org%3E

FLIPs under discussion

FLIPStateLink to Discussion Thread
FLIP-5: Only send data to each taskmanager once for broadcastsDiscusshttp://apache-flink-mailing-list-archive.1008284.n3.nabble.com/DISCUSS-FLIP-5-Only-send-data-to-each-taskmanager-once-for-broadcasts-td12677.html
FLIP-14: crossGroup OperatorDiscuss
FLIP-15: Redesign Iterations (Scoping, Flow Control and Termination)Discusshttp://apache-flink-mailing-list-archive.1008284.n3.nabble.com/DISCUSS-FLIP-13-Consistent-Processing-with-Loops-td14149.html
FLIP-17 Side Inputs for DataStream APIAccepted
FLIP-18: Code Generation for improving sorting performanceDiscusshttp://apache-flink-mailing-list-archive.1008284.n3.nabble.com/DISCUSS-FLIP-18-Code-Generation-for-improving-sorting-performance-tc16486.html
FLIP-21 - Improve object Copying for Streaming RuntimeDiscusshttps://lists.apache.org/thread.html/d1cd94a97eaba49cb05d8891b133374cd59021d6c5f04e9a19f0a6df@%3Cdev.flink.apache.org%3E
FLIP-22: Eager State DeclarationDiscusshttp://apache-flink-mailing-list-archive.1008284.n3.nabble.com/DISCUSS-FLIP-22-Eager-State-Declaration-td18562.html
FLIP-26: Service AuthorizationDiscuss
FLIP-27: Refactor Source InterfaceDiscusshttps://lists.apache.org/thread.html/70484d6aa4b8e7121181ed8d5857a94bfb7d5a76334b9c8fcc59700c@%3Cdev.flink.apache.org%3E

FLIP-33: Standardize Connector Metrics

Discusshttps://lists.apache.org/thread.html/65078bad6e04bbbb7578d502e1e5d92026f13fd9648725f5b74ed330@%3Cdev.flink.apache.org%3E
FLIP-36: Support Interactive Programming in FlinkDiscusshttps://lists.apache.org/thread.html/5f4961f1dfe23204631fd6f2b3227724ce9831f462737f51742a52c1@%3Cdev.flink.apache.org%3E
FLIP-38: Python Table APIDiscusshttps://lists.apache.org/thread.html/08aad344e43a409c318d4bda67bac1fa7beecc6abd8539c53021d310@%3Cdev.flink.apache.org%3E
FLIP-40: Flink DriverDiscusshttps://lists.apache.org/thread.html/f5dd0920ffd1361513c1d73352788086773595777cae9d70713fc1ac@%3Cdev.flink.apache.org%3E
FLIP-44: Support Local Aggregation in FlinkDiscusshttp://apache-flink-mailing-list-archive.1008284.n3.nabble.com/DISCUSS-FLIP-44-Support-Local-Aggregation-in-Flink-td29513.html
FLIP-45: Reinforce Job Stop SemanticDiscusshttp://apache-flink-mailing-list-archive.1008284.n3.nabble.com/DISCUSS-FLIP-45-Reinforce-Job-Stop-Semantic-td30161.html
FLIP-46: Graceful Shutdown Handling by UDFsDiscusshttps://lists.apache.org/x/thread.html/dd3b941621645bd78e119c44667c18cfef23f89b8721eeb9f8926fc6@%3Cdev.flink.apache.org%3E
FLIP-47: Checkpoints vs. SavepointsDiscuss https://lists.apache.org/x/thread.html/dc5ab1fad1a38b58aa762303922c04f1310d49dbd3c3ac8264cbe27f@%3Cdev.flink.apache.org%3E
FLIP-49: Unified Memory Configuration for TaskExecutorsDiscuss http://apache-flink-mailing-list-archive.1008284.n3.nabble.com/DISCUSS-FLIP-49-Unified-Memory-Configuration-for-TaskExecutors-td31436.html
FLIP-50: Spill-able Heap Keyed State BackendDiscusshttps://lists.apache.org/thread.html/a10cae910f92936783e59505d7b9fe71c5f66ceea4c1c287c87164ae@%3Cdev.flink.apache.org%3E
FLIP-53: Fine Grained Operator Resource ManagementDiscusshttp://apache-flink-mailing-list-archive.1008284.n3.nabble.com/DISCUSS-FLIP-53-Fine-Grained-Resource-Management-td31831.html
FLIP-54: Evolve ConfigOption and ConfigurationDiscusshttps://lists.apache.org/thread.html/a56c6b52e5f828d4a737602b031e36b5dd6eaa97557306696a8063a9@%3Cdev.flink.apache.org%3E
FLIP-56: Dynamic Slot AllocationDiscusshttp://apache-flink-mailing-list-archive.1008284.n3.nabble.com/DISCUSS-FLIP-56-Dynamic-Slot-Allocation-td31960.html


Implemented and Released FLIPs

FLIPFirst Release VersionLink to Discussion Thread
FLIP-10: Unify Checkpoints and Savepoints1.2http://apache-flink-mailing-list-archive.1008284.n3.nabble.com/DISCUSS-FLIP-10-Unify-Savepoints-and-Checkpoints-tt13149.html
FLIP-2 Extending Window Function Metadata1.3http://apache-flink-mailing-list-archive.1008284.n3.nabble.com/DISCUSS-FLIP-2-Extending-Window-Function-Metadata-td12522.html
FLIP-3 - Improving Organization of Documentation1.2http://apache-flink-mailing-list-archive.1008284.n3.nabble.com/DISCUSS-FLIP-3-Organization-of-Documentation-td12560.html
FLIP-4: Enhance Window Evictor1.2http://apache-flink-mailing-list-archive.1008284.n3.nabble.com/DISCUSS-Enhance-Window-Evictor-in-Flink-td12406.html
1.5

http://apache-flink-mailing-list-archive.1008284.n3.nabble.com/DISCUSS-FLIP-6-Flink-Deployment-and-Process-Model-Standalone-Yarn-Mesos-Kubernetes-etc-td12685.html

FLIP-7: Expose metrics to WebInterface1.2
FLIP-8: Rescalable Non-Partitioned State1.2http://apache-flink-mailing-list-archive.1008284.n3.nabble.com/DISCUSS-FLIP-8-Rescalable-Non-Partitioned-State-tt12999.html
FLIP-11: Table API Stream Aggregations1.3http://mail-archives.apache.org/mod_mbox/flink-dev/201609.mbox/%3C4f43ebc4-7c5d-711a-5d9f-3e179bdcd33c%40apache.org%3E
FLIP-12: Asynchronous I/O Design and Implementation1.2
FLIP-13: SideOutputs in Flink1.3http://apache-flink-mailing-list-archive.1008284.n3.nabble.com/Discuss-FLIP-13-Side-Outputs-in-Flink-td14204.html

FLIP-19: Improved BLOB storage architecture

1.4http://apache-flink-mailing-list-archive.1008284.n3.nabble.com/DISCUSS-FLIP-19-Improved-BLOB-storage-architecture-td18092.html
FLIP-20: Integration of SQL and CEP1.7

http://apache-flink-mailing-list-archive.1008284.n3.nabble.com/DISCUSS-Integrating-Flink-Table-API-amp-SQL-with-CEP-td17964.html

FLIP-24 - SQL Client1.5http://mail-archives.apache.org/mod_mbox/flink-dev/201712.mbox/%3C8a9d718b-5dae-0fe2-1da6-a8d557d45582%40apache.org%3E
FLIP-25 TTL for State1.6https://lists.apache.org/thread.html/05d187544229d3cbb46270a9e1f09001e02904b8d3e6dea71593a18f@%3Cdev.flink.apache.org%3E
FLIP-34: Terminate/Suspend Job with Savepoint1.9https://lists.apache.org/thread.html/b8d2f3209e7ca7467af6037383ade6c14c35276f7acb2bbbc9a50c0f@%3Cdev.flink.apache.org%3E

Discarded FLIPs

FLIPComment

FLIP-28: Long-term goal of making flink-table Scala-free

This FLIP is part of the bigger vision described in FLIP-32.
FLIP-9: Trigger DSLDecided to not work on that, at least at the moment.
  • No labels