-
Notifications
You must be signed in to change notification settings - Fork 0
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
References and discussions #1
Comments
What I've tried so far:
Things to try:
The main issue I am facing is the number of models, and their difference in flop regimes. It is difficult to generalize the findings for more than 2 or 3 models at a time. So we might need to start from scratch with every model. Please share your thoughts on the same. |
|
|
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
@sayakpaul
Following some important links for reference:
If possible let's keep the discussions and updates regarding training here. As discussed I'll create an issue for each experiment we want to run.
The text was updated successfully, but these errors were encountered: