-
Notifications
You must be signed in to change notification settings - Fork 289
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
disable the config enable-partition-separator #11979
Comments
Problems introduced by this pr: #8617 |
Because different partitions may be on different cdc nodes, it is hard to coordinate between them. So we must remove the config |
When the sink type is storage, for partition table, the default file path for CDC on remote storage is {scheme}://{prefix}/{schema}/{table}/{table-version-separator}/{partition-separator}/{date-separator}/CDC{num}.{extension}. And the {partition-separator} segment represents the physical ID of the partition. However, when the configuration option If different partitions are handled by separate CDC nodes, maintaining a globally unique {num} counter to avoid filename conflicts becomes extremely complex. This issue arises when such conflicts lead to file overwrites and data loss. To address this problem, it is recommended to remove the |
@lidezhu given that we may have users that have configured the solution this way already, we should go through the deprecation and removal process for the setting (as we should do with other configuration changes). To do that we would first have to issue warnings when we detect this setting being used, before we make a change to remove the setting in a later release. This would give an opportunity for users to correct their configurations prior to a subsequent release beginning to raise invalid parameter errors when they try and use the feature. |
OK, when this setting is detected and a warning is issued, should we proceed with the configuration value provided by the user, or should we disregard their input and forcibly set the configuration to true? @benmeadowcroft |
@benmeadowcroft Do you have a specific suggestion about the versions and the related actions? For example:
|
Yes, I think that the following would work:
|
What did you do?
What did you expect to see?
All rows are written to storage services;
What did you see instead?
Some rows are lost;
Versions of the cluster
Upstream TiDB cluster version (execute
SELECT tidb_version();
in a MySQL client):(paste TiDB cluster version here)
Upstream TiKV version (execute
tikv-server --version
):(paste TiKV version here)
TiCDC version (execute
cdc version
):(paste TiCDC version here)
The text was updated successfully, but these errors were encountered: