Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Fine-Tune training for new and rare Chinese characters in Traditional Chinese #1361

Open
6692a opened this issue Jan 8, 2025 · 0 comments
Open

Comments

@6692a
Copy link

6692a commented Jan 8, 2025

Hello
Can I ask a question?
How to fine-tune training for new and rare Chinese characters in Traditional Chinese?
For rare words that are not established in the vocabulary, the official teaching is currently used, and the parameters of config.yaml are
FT: True, new_prediction: True

A small number of training sets/validation sets are generated, first generated with only one font, but with different background colors and tilts. Finally, the pre-trained model is used for fine-tuning training, but the results are almost always worse than the original pre-trained model.
In other words, The original official Traditional Chinese model worked well, but after the training steps, the text content that could have been recognized normally had errors

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant