-
Notifications
You must be signed in to change notification settings - Fork 27.6k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
support telechat2 #35415
base: main
Are you sure you want to change the base?
support telechat2 #35415
Conversation
Signed-off-by: xiangw2 <[email protected]>
Signed-off-by: xiangw2 <[email protected]>
Signed-off-by: xiangw2 <[email protected]>
Signed-off-by: xiangw2 <[email protected]>
Signed-off-by: shunxing12345 <[email protected]>
Signed-off-by: shunxing12345 <[email protected]>
Signed-off-by: shunxing12345 <[email protected]>
Signed-off-by: shunxing12345 <[email protected]>
Signed-off-by: shunxing12345 <[email protected]>
Signed-off-by: shunxing12345 <[email protected]>
Signed-off-by: shunxing12345 <[email protected]>
Signed-off-by: shunxing12345 <[email protected]>
Signed-off-by: shunxing12345 <[email protected]>
Signed-off-by: shunxing12345 <[email protected]>
Signed-off-by: shunxing12345 <[email protected]>
Signed-off-by: shunxing12345 <[email protected]>
Signed-off-by: shunxing12345 <[email protected]>
Signed-off-by: shunxing12345 <[email protected]>
Signed-off-by: shunxing12345 <[email protected]>
Hi team, 👋 Just following up on this PR. I understand you may have a lot on your plate, but I’d love to hear your thoughts or feedback on this contribution whenever you have time. Please let me know if there’s anything I can do to assist or improve the PR! Thanks for your time and all the great work you do maintaining this repository! 🙏 Best, |
Signed-off-by: shunxing12345 <[email protected]>
Signed-off-by: shunxing12345 <[email protected]>
Signed-off-by: shunxing12345 <[email protected]>
Signed-off-by: shunxing12345 <[email protected]>
Hey @shunxing12345 👋 Most of us are on holidays this week for the new year. The rhythm will pickup early next week and the team will review your PR as soon as possible. Thanks a lot for your patience and your contribution 🤗 |
Signed-off-by: shunxing12345 <[email protected]>
Signed-off-by: shunxing12345 <[email protected]>
Signed-off-by: shunxing12345 <[email protected]>
Signed-off-by: shunxing12345 <[email protected]>
Signed-off-by: shunxing12345 <[email protected]>
Signed-off-by: shunxing12345 <[email protected]>
@Cyrilvallez Thank you for your support! I noticed that some classes or methods differ only in terms of their parameters, while the overall logic remains consistent. To avoid duplicating code, could you suggest a best practice for handling such cases? For example:
I truly appreciate your insights and guidance on this matter. Thank you for your time and support! |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks for the docs!
Signed-off-by: shunxing12345 <[email protected]>
…sformers into fix_conflicts
Signed-off-by: shunxing12345 <[email protected]>
…sformers into fix_conflicts
@stevhliu Thank you for your review! I have updated the docs as requested. Please let me know if there's anything else that needs adjustment. 😊 |
Signed-off-by: shunxing12345 <[email protected]>
Signed-off-by: shunxing12345 <[email protected]>
Signed-off-by: shunxing12345 <[email protected]>
Hi @Cyrilvallez , Thank you so much for your feedback and detailed guidance! 😊 I’ve addressed the issue and added the modular file as suggested. Please let me know if there’s anything else I should refine or improve further. Thanks again for your support! |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Hey! Basically, the easiest (if you do not have many models checkpoints on the hub already?) is to match your config arguments names and model layer names to existing ones in Llama (it is extremely important for everyone to always see the same names so that they know what they are immediately, instead of figuring it out for every model/config). For the config it is straightforward, however model checkpoints will require a conversion script as for e.g. https://github.com/huggingface/transformers/blob/main/src/transformers/models/glm/convert_glm_weights_to_hf.py to change the layer names, and in your case split the k/v layers in 2!
Doing so will greatly simplify the modular as you will be able to inherit everything directly! (I did not review all the file, just the beginning so that you can understand my point!)
Signed-off-by: shunxing12345 <[email protected]>
Signed-off-by: shunxing12345 <[email protected]>
Signed-off-by: shunxing12345 <[email protected]>
Signed-off-by: shunxing12345 <[email protected]>
Signed-off-by: shunxing12345 <[email protected]>
Signed-off-by: shunxing12345 <[email protected]>
Hi @Cyrilvallez Thanks for your guidance and suggestions! I’ve addressed the issues and completed the necessary updates as per your feedback. Please let me know if there’s anything else I should refine. 😊 Best regards! |
What does this PR do?
Description
This PR introduces TeleChat2, a new model designed for [specific use case, e.g., multilingual conversational AI or general-purpose language modeling]. TeleChat2 supports model sizes of [e.g., 3B, 7B, and 35B] and has been pre-trained and fine-tuned on large datasets across English and Chinese.
Motivation
Adding TeleChat2 to the Transformers repository allows the community to benefit from its capabilities, leveraging state-of-the-art performance on [specific benchmarks or tasks]. This inclusion aligns with the repository's goal of supporting a diverse range of models and architectures.
Key Changes
TeleChat2Config
for model configuration.TeleChat2Model
,TeleChat2ForCausalLM
, and corresponding tokenizers.AutoModel
and other relevant files.Documentation and Tests
Before submitting
Pull Request section?
to it if that's the case.
documentation guidelines, and
here are tips on formatting docstrings.
Who can review?
Anyone in the community is free to review the PR once the tests have passed. Feel free to tag
members/contributors who may be interested in your PR.