We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
No response
examples
Description:
Steps to reproduce this issue
import tensorrt_llm.bindings.executor as trtllm kv_cache_config=trtllm.KvCacheConfig(enable_block_reuse=True) llm = LLM(model="/trt_engines/a100/llama/tp1/compiled-model/", tokenizer="/trt_engines/a100/llama/tp1/tokenizer", guided_decoding_backend='xgrammar',kv_cache_config=kv_cache_config)
No crash
Segmentation fault crash.
Error
[baf3afa81c47:1647813] Signal: Segmentation fault (11) [baf3afa81c47:1647813] Signal code: Address not mapped (1) [baf3afa81c47:1647813] Failing at address: (nil) [baf3afa81c47:1647813] [ 0] /usr/lib/x86_64-linux-gnu/libc.so.6(+0x42520)[0x7f8bcffdf520] [baf3afa81c47:1647813] [ 1] /tensorrt-0.16/lib/python3.10/site-packages/tensorrt_llm/libs/libtensorrt_llm.so(_ZN8xgrammar14GrammarMatcher20FillNextTokenBitmaskEP8DLTensori+0x0)[0x7f89d845f220] [baf3afa81c47:1647813] [ 2] /tensorrt-0.16/lib/python3.10/site-packages/tensorrt_llm/libs/libtensorrt_llm.so(_ZN12tensorrt_llm13batch_manager13GuidedDecoder5buildERKNS0_17ScheduledRequestsE+0x296)[0x7f89d8391fb6] [baf3afa81c47:1647813] [ 3] /tensorrt-0.16/lib/python3.10/site-packages/tensorrt_llm/libs/libtensorrt_llm.so(_ZN12tensorrt_llm13batch_manager27TrtGptModelInflightBatching12forwardAsyncERKSt4listISt10shared_ptrINS0_10LlmRequestEESaIS5_EE+0x6bf)[0x7f89d83f929f] [baf3afa81c47:1647813] [ 4] /tensorrt-0.16/lib/python3.10/site-packages/tensorrt_llm/libs/libtensorrt_llm.so(_ZN12tensorrt_llm8executor8Executor4Impl12forwardAsyncERSt4listISt10shared_ptrINS_13batch_manager10LlmRequestEESaIS7_EE+0x1e6)[0x7f89d848e746] [baf3afa81c47:1647813] [ 5] /tensorrt-0.16/lib/python3.10/site-packages/tensorrt_llm/libs/libtensorrt_llm.so(_ZN12tensorrt_llm8executor8Executor4Impl13executionLoopEv+0x501)[0x7f89d8495331] [baf3afa81c47:1647813] [ 6] /tensorrt-0.16/lib/python3.10/site-packages/torch/lib/libtorch.so(+0x145c0)[0x7f8bc80ad5c0] [baf3afa81c47:1647813] [ 7] /usr/lib/x86_64-linux-gnu/libc.so.6(+0x94ac3)[0x7f8bd0031ac3] [baf3afa81c47:1647813] [ 8] /usr/lib/x86_64-linux-gnu/libc.so.6(clone+0x44)[0x7f8bd00c2bf4]
The text was updated successfully, but these errors were encountered:
Hi @syuoni would u please take a look this xgrammar related issue?
Sorry, something went wrong.
Thanks for reporting this issue. We also found this issue internally, and it's currently being fixed.
syuoni
No branches or pull requests
System Info
Who can help?
No response
Information
Tasks
examples
folder (such as GLUE/SQuAD, ...)Reproduction
Description:
Steps to reproduce this issue
Expected behavior
No crash
actual behavior
Segmentation fault crash.
Error
additional notes
Description:
The text was updated successfully, but these errors were encountered: