Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Scaling laws data #42

Open
borgr opened this issue Feb 29, 2024 · 1 comment
Open

Scaling laws data #42

borgr opened this issue Feb 29, 2024 · 1 comment

Comments

@borgr
Copy link

borgr commented Feb 29, 2024

I am researching scaling laws across models and architectures among other things and was wondering if you could share the logs\training losses\val eval of the models you have ran for the scaling law experiments in DeepSeek LLM. If you have other similar losses or results it would also be interesting. It might not be super well curated, anything can be helpful.
Thanks

@borgr
Copy link
Author

borgr commented Feb 29, 2024

Also the model losses from the figure, are they available somewhere?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant