本次更新内容
- 支持 RBAC 角色权限控制,对所有引擎、所有语言API均有效;文档
- 自动清理旧的 compaction 数据,支持分区级生命周期(TTL);文档
- 升级 Flink 版本到 1.17,并支持批模式下行级别更新和删除;
- 优化整库同步 Flink 作业,吞吐提升 80%: #307 ;
- 支持 Presto 读取;文档
- 支持原生 Python 读取,提供 PyTorch、HuggingFace 的集成。文档
更新日志
- update maven versions
- Support cdc column value change in compaction (#336)
- [Docs ] Refine flink sql and python docs (#337)
- Support view、batch update、batch delete in flink (#332)
- [Bug]filter should not pushdown before merge on read (#310)
- [Bug] turn off native meta query and set prefetch_size=1 temporarily (#333)
- add workspace and rbac docs (#331)
- add doc machine-learning-support.md (#330)
- [Docs] Add presto connector deployment docs (#329)
- [Docs] Usage on auto table clean (#326)
- update docs and readme for release 2.4
- update docs and readme for release 2.4 (#328)
- [Python] Examples using Python API for AI model training (#327)
- [Python][Dataset] Update Python dataset api for LakeSoul (#325)
- list namespace should return empty array (#323)
- support query metadata with null string (#324)
- [RBAC] Set hdfs dir owner (#321)
- bump version to 2.4.0 (#319)
- merge native-io and native-metadata modules (#318)
- ignore exception when hadoop env missing (#317)
- add scala in common to address build in idea intellij (#316)
- Presto Connector Support (#314)
- [Flink] cdc supplement data delay check mechanism and fix logicallyDro…
- retry when native metadata client fails (#313)
- add arg exclude_partition at get_arrow_schema_by_table_name (#312)
- [NativeIO] add hdfs feature in lakesoul-io-c (#311)
- [Flink] Optimize CDC sink serde with Fury (#307)
- rollback flink cdc to 2.3.0 and supplement tables check in benchmark (#309)
- [Python][Dataset] PyArrow and PyTorch dataset api for LakeSoul (#308)
- [Python] C callback with data (#306)
- [Python][Native-Metadata] Python interface of lakesoul metadata (#305)
- clean old compaction data and redundant data (#304)
- upgrade flink cdc connector to 2.4 (#303)
- update docs (#298)
- fix jackson-core package in flink (#297)
- [Native-Metadata] Rust implementation of DAO layer (#294)
- fix apache license (#293)
- Add Built-in RBAC support (#292)
- update arrow version (#290)
- [Python][NativeIO] Add C interface definition (#291)
- Upgrade Flink to 1.17 (#288)
- [Flink] implement filter pushdown and fix partition pushdown in flink (#287)
- [NativeIO] Upgrade datafusion to 27 (#282)
- add 2.3.0 release blog (#281)
- [Doc] Add Spark session catalog doc (#279)
- [Project] add/change spdx header to all files (#278)
- format table path with hdfs configuration (#277)
- [Metadata] refractor metadata entity with protobuf code-gen (#276)
- [RBAC] PG workspace implementation (#274)
- partition rollback/cleanup add timeZoneId (#275)
完整的更新日志: https://gitee.com/meta-soul/LakeSoul/compare/v2.3.1...v2.4.0