[ModelRunner] Fix stop and bad words list contiguous for offsets #1815

Marks101 · 2024-06-20T09:32:44Z

In our regression tests of the ModelRunner we noticed that in the current main branch (Jun 18, 2024) the stop_words_list feature does not work properly for batch_size > 1. The issue seems to be that the token arrays are not contiguously layed out in memory due to the transpose that is done in this line:

TensorRT-LLM/tensorrt_llm/runtime/generation.py

Line 104 in 2a115da

return np.array([flat_ids, offsets], dtype="int32").transpose((1, 0, 2))

This makes the array offsets that are created invalid.

In examples/run.py this features was deactivated for a long time, but it seems that originally the contiguous was implemented here:

TensorRT-LLM/examples/run.py

Line 403 in b777bd6

    
           # stop_words_list = torch.Tensor(stop_words_list).to(torch.int32).to("cuda").contiguous()

Thanks for taking a look at this

MartinMarciniszyn · 2024-06-24T09:07:31Z

@Funatiq , could you please merge this into the main branch?

nv-guomingz · 2024-06-24T12:30:43Z

@Funatiq , could you please merge this into the main branch?

@byshiue already merged this PR into internal code base this morning.

nv-guomingz · 2024-07-04T15:21:20Z

@Marks101 thanks for your contribution to TRT-LLM, this MR has been merged into upstream now.

Python generation: make stop and bad words list contiguous

837e536

byshiue self-assigned this Jun 23, 2024

byshiue added the triaged Issue has been triaged by maintainers label Jun 23, 2024

MartinMarciniszyn assigned Funatiq Jun 24, 2024

nv-guomingz added the Merged label Jun 25, 2024

kaiyux mentioned this pull request Jul 4, 2024

Update TensorRT-LLM #1891

Merged

nv-guomingz closed this Jul 4, 2024

kaiyux mentioned this pull request Jul 17, 2024

TensorRT-LLM v0.11 Update #1969

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[ModelRunner] Fix stop and bad words list contiguous for offsets #1815

[ModelRunner] Fix stop and bad words list contiguous for offsets #1815

Marks101 commented Jun 20, 2024

MartinMarciniszyn commented Jun 24, 2024

nv-guomingz commented Jun 24, 2024

nv-guomingz commented Jul 4, 2024

[ModelRunner] Fix stop and bad words list contiguous for offsets #1815

[ModelRunner] Fix stop and bad words list contiguous for offsets #1815

Conversation

Marks101 commented Jun 20, 2024

MartinMarciniszyn commented Jun 24, 2024

nv-guomingz commented Jun 24, 2024

nv-guomingz commented Jul 4, 2024