Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Flink] Arctic table supports real-time dimension table join #94

Closed
YesOrNo828 opened this issue Aug 1, 2022 · 2 comments · Fixed by #127
Closed

[Flink] Arctic table supports real-time dimension table join #94

YesOrNo828 opened this issue Aug 1, 2022 · 2 comments · Fixed by #127
Assignees
Labels
module:mixed-flink Flink moduel for Mixed Format
Milestone

Comments

@YesOrNo828
Copy link
Contributor

YesOrNo828 commented Aug 1, 2022

Background:
There are some requirements for real-time data widening. Now hive supports lookup join, but this solution is not available for production, and the hive table needs to be loaded into memory. Large tables are prone to oom problems. Besides, neither Iceberg nor Hudi support lookup joins.

Here is a summary proposal:
Flink affords the event time temporal join. The right table will be used as a version table, and its data can be managed in rocksdb instead of memory.

-- create a left table, using localtimestamp as event time.
create table source (
  ...,
  arcitc_process_time AS LOCALTIMESTAMP,
  WATERMARK FOR arcitc_process_time AS arcitc_process_time,
) with (...);

create table arctic_dim (...) with ('connector'='arctic', 'dim-table.enabled'='true');

select * from source as O left join arctic_dim FOR SYSTEM_TIME AS OF O.arcitc_process_time as P on O.id = P.id;

The arctic source will automatically create a custom watermark strategy if dim-table.enabled equals true.

@YesOrNo828 YesOrNo828 added the module:mixed-flink Flink moduel for Mixed Format label Aug 1, 2022
@YesOrNo828 YesOrNo828 added this to the Release 0.3.1 milestone Aug 1, 2022
@zstraw zstraw self-assigned this Aug 12, 2022
@YesOrNo828
Copy link
Contributor Author

Hello @zstraw could you share your design document here?

@zstraw
Copy link
Contributor

zstraw commented Aug 17, 2022

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
module:mixed-flink Flink moduel for Mixed Format
Projects
None yet
Development

Successfully merging a pull request may close this issue.

2 participants