You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Currently, Arctic only supports running within the Hadoop ecosystem, and it cannot be used effectively with other storage types like Glue + S3. We should support more types of storage systems.
Use case/motivation
We should support managing Iceberg tables with Arctic in S3 + Glue scenarios, optimize it for better performance, and enable features like cleaning up orphan files.
Describe the solution
ArcticFileIO interface should be refactored to reduce its strong dependency on Hadoop.
The Optimizer logic needs to be retested to ensure that it supports non-Hadoop systems.
The logic for cleaning orphan files needs to be adapted to non-file system scenarios. In extreme cases where the storage system does not support list operations, this functionality may need to be downgraded or even unavailable.
Due to the fact that cloud object storage is usually billed based on the number of API calls, the logic of the optimizer checker needs to be adjusted to avoid generating unnecessary costs from frequent checks.
Description
Currently, Arctic only supports running within the Hadoop ecosystem, and it cannot be used effectively with other storage types like Glue + S3. We should support more types of storage systems.
Use case/motivation
We should support managing Iceberg tables with Arctic in S3 + Glue scenarios, optimize it for better performance, and enable features like cleaning up orphan files.
Describe the solution
Related issues
Are you willing to submit a PR?
Code of Conduct
The text was updated successfully, but these errors were encountered: