-
Notifications
You must be signed in to change notification settings - Fork 1.7k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add support for DeepSeek language models in STORM Wiki pipeline #84
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Hi @rmcc3 , thank you so much for the prompt response! The quality of the example outputs is pretty good - we are very happy to support DeepSeek models in our project. Could you make the following change so that I can merge this PR?
- Resolve the merge conflict. (We recently have breaking change in order to support installing our project via
pip
.) - Add a few lines (see my comment) to
DeepSeekModel
to ensure the call history is tracked. Currently, llm_call_history.jsonl is empty in your shared output. After adding these lines, it shall include call history in the session.
Seems updating to head caused this to close. Working on the change now. |
Here is the call history (had to upload as |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM, thank you so much!!
Add support for DeepSeek language models in STORM Wiki pipeline
Description
This pull request adds support for using DeepSeek language models in the STORM Wiki pipeline, providing an alternative to the existing OpenAI models. The integration allows users to easily switch between OpenAI and DeepSeek models, enhancing the flexibility and capabilities of the STORM Wiki system.
Key Changes
DeepSeekModel
class insrc/lm.py
, which is compatible with the existingdspy.OpenAI
interface.examples/run_storm_wiki_deepseek.py
to demonstrate how to use DeepSeek models with the STORM Wiki pipeline.STORMWikiRunner
class to be model-agnostic, ensuring compatibility with both OpenAI and DeepSeek models.How to Use
Users can now run the STORM Wiki pipeline with DeepSeek models by using the new
run_storm_wiki_deepseek.py
script. The script allows configuration of various DeepSeek-specific parameters such as model choice, temperature, and top_p sampling. The API key should be set withDEEPSEEK_API_KEY
. The DeepSeek base can be set withDEEPSEEK_API_BASE
, if needed.Testing
Future Considerations