Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Recompute KV cache for Phi3 when switching from short to long factor #1161

Merged
merged 7 commits into from
Jan 8, 2025

Conversation

ajindal1
Copy link
Collaborator

Recompute KV cache for Phi3 when switching from short to long factor.

Verified that this PR fixes the issue for:

  1. Phi3.5 mini
  2. Phi3 mini 128K
  3. Phi3 small
  4. Phi3 medium

src/generators.cpp Outdated Show resolved Hide resolved
src/generators.cpp Outdated Show resolved Hide resolved
src/generators.cpp Outdated Show resolved Hide resolved
src/generators.cpp Outdated Show resolved Hide resolved
src/generators.cpp Show resolved Hide resolved
src/generators.cpp Show resolved Hide resolved
src/generators.cpp Outdated Show resolved Hide resolved
src/generators.cpp Outdated Show resolved Hide resolved
src/generators.cpp Outdated Show resolved Hide resolved
@ajindal1 ajindal1 merged commit 41c2543 into main Jan 8, 2025
14 checks passed
@ajindal1 ajindal1 deleted the abjindal/phi3_reset_compute_cache branch January 8, 2025 21:54
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants