Retries via LiteLLM RetryPolicy #1866

dbczumar · 2024-11-27T11:32:07Z

Retries via LiteLLM RetryPolicy

This depends on BerriAI/litellm#6916 being merged and released (may take some coordination)

Signed-off-by: dbczumar <[email protected]>

dbczumar · 2024-11-27T11:37:17Z

cc @okhat - feel free to test this out in the meantime while I work with @krrishdholakia on BerriAI/litellm#6916

Signed-off-by: dbczumar <[email protected]>

dbczumar · 2024-12-17T00:17:53Z

dspy/clients/lm.py

@@ -36,7 +37,7 @@ def __init__(
        max_tokens: int = 1000,
        cache: bool = True,
        callbacks: Optional[List[BaseCallback]] = None,
-        num_retries: int = 3,
+        num_retries: int = 8,


Empirically, this provides roughly 1 minute of retries, which is typically necessary to overcome rate limit errors (providers like Azure OpenAI & Databricks support RPM rate limits)

dbczumar · 2024-12-17T00:18:33Z

dspy/clients/lm.py

@@ -102,14 +103,13 @@ def __call__(self, prompt=None, messages=None, **kwargs):
            outputs = [
                {
                    "text": c.message.content if hasattr(c, "message") else c["text"],
-                    "logprobs": c.logprobs if hasattr(c, "logprobs") else c["logprobs"]
+                    "logprobs": c.logprobs if hasattr(c, "logprobs") else c["logprobs"],


Just the linter being itself...

dbczumar · 2024-12-17T00:19:36Z

tests/test_utils/server/litellm_server_config.yaml

@@ -7,6 +7,7 @@ model_list:
      model: "dspy-test-provider/dspy-test-model"

 litellm_settings:
+  num_retries: 0


Disable retries on the server side to ensure that server retries don't stack atop client retries, which can result in test failures due to a mismatch between expected and actual # of retries

Signed-off-by: dbczumar <[email protected]>

dbczumar · 2024-12-17T00:23:08Z

pyproject.toml

@@ -38,7 +38,7 @@ dependencies = [
    "pydantic~=2.0",
    "jinja2",
    "magicattr~=0.1.6",
-    "litellm",
+    "litellm==1.55.3",


LiteLLM version 1.55.3 is the only version that correctly supports passing retry_policy to completion() while respecting the number of retries specified by the policy

Signed-off-by: dbczumar <[email protected]>

dbczumar · 2024-12-17T00:26:22Z

requirements.txt

-cloudpickle
-jinja2


Linter reordering

dbczumar added 5 commits November 26, 2024 17:16

Retr

faa2698

Signed-off-by: dbczumar <[email protected]>

works

f52c4f1

Signed-off-by: dbczumar <[email protected]>

retry

8a01558

Signed-off-by: dbczumar <[email protected]>

Retry

b808a0f

Signed-off-by: dbczumar <[email protected]>

fix

6acd071

Signed-off-by: dbczumar <[email protected]>

dbczumar changed the title ~~DRAFT: Retries via LiteLLM RetryPolicy~~ [DO NOT MERGE] Retries via LiteLLM RetryPolicy Nov 27, 2024

fix

5472be6

Signed-off-by: dbczumar <[email protected]>

dbczumar changed the title ~~[DO NOT MERGE] Retries via LiteLLM RetryPolicy~~ Retries via LiteLLM RetryPolicy Dec 10, 2024

dbczumar and others added 7 commits December 9, 2024 23:44

Rename

b809af7

Signed-off-by: dbczumar <[email protected]>

fix

0db52c8

Signed-off-by: dbczumar <[email protected]>

Update test_lm.py

3b29d81

Update test_lm.py

3b6d2b4

Update test_lm.py

f3c76dc

fix

9c4ed35

Signed-off-by: dbczumar <[email protected]>

Merge main

81e062b

Signed-off-by: dbczumar <[email protected]>

dbczumar commented Dec 17, 2024

View reviewed changes

dbczumar added 2 commits December 16, 2024 16:20

fix

71a8015

Signed-off-by: dbczumar <[email protected]>

fix

bec6341

Signed-off-by: dbczumar <[email protected]>

dbczumar commented Dec 17, 2024

View reviewed changes

dbczumar and others added 2 commits December 16, 2024 16:24

Make tests more robust

1b666ad

Signed-off-by: dbczumar <[email protected]>

Update pyproject.toml

3600c1a

dbczumar commented Dec 17, 2024

View reviewed changes

requirements.txt

Comment on lines -21 to -22

cloudpickle

jinja2

Copy link

Collaborator Author

dbczumar Dec 17, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Linter reordering

dbczumar requested review from okhat and chenmoneygithub December 17, 2024 02:29

okhat merged commit 1bae71c into stanfordnlp:main Dec 17, 2024
4 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Retries via LiteLLM RetryPolicy #1866

Retries via LiteLLM RetryPolicy #1866

dbczumar commented Nov 27, 2024 •

edited

Loading

dbczumar commented Nov 27, 2024

dbczumar Dec 17, 2024 •

edited

Loading

dbczumar Dec 17, 2024

dbczumar Dec 17, 2024

dbczumar Dec 17, 2024

dbczumar Dec 17, 2024

Retries via LiteLLM RetryPolicy #1866

Retries via LiteLLM RetryPolicy #1866

Conversation

dbczumar commented Nov 27, 2024 • edited Loading

dbczumar commented Nov 27, 2024

dbczumar Dec 17, 2024 • edited Loading

Choose a reason for hiding this comment

dbczumar Dec 17, 2024

Choose a reason for hiding this comment

dbczumar Dec 17, 2024

Choose a reason for hiding this comment

dbczumar Dec 17, 2024

Choose a reason for hiding this comment

dbczumar Dec 17, 2024

Choose a reason for hiding this comment

dbczumar commented Nov 27, 2024 •

edited

Loading

dbczumar Dec 17, 2024 •

edited

Loading