Skip to content

Actions: NVIDIA/TensorRT-LLM

auto-assign

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
101 workflow runs
101 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

EAGLE model seems to be deployed but raises an error on inference
auto-assign #104: Issue #2673 labeled by nuxlear
January 9, 2025 02:01 2s
January 9, 2025 02:01 2s
Prompt formatting for different version of InternVL2
auto-assign #103: Issue #2672 labeled by nzarif
January 8, 2025 20:19 3s
January 8, 2025 20:19 3s
Runtime error in windows, help me
auto-assign #102: Issue #2540 labeled by nv-guomingz
January 8, 2025 11:37 45s
January 8, 2025 11:37 45s
Runtime error in windows, help me
auto-assign #101: Issue #2540 labeled by nv-guomingz
January 8, 2025 11:37 42s
January 8, 2025 11:37 42s
[Performance] KV cache reuse is slower when batch size > 1
auto-assign #100: Issue #2631 labeled by nv-guomingz
January 8, 2025 09:09 47s
January 8, 2025 09:09 47s
PyTorch Nightly support in Dockerfile
auto-assign #99: Issue #2657 labeled by nv-guomingz
January 8, 2025 09:09 51s
January 8, 2025 09:09 51s
trtllm-serve without any output Qwne2.5-7b
auto-assign #98: Issue #2667 labeled by nv-guomingz
January 8, 2025 07:38 44s
January 8, 2025 07:38 44s
trtllm-serve without any output Qwne2.5-7b
auto-assign #97: Issue #2667 labeled by Justin-12138
January 8, 2025 03:45 2s
January 8, 2025 03:45 2s
Failed to build engine with lookahead_decoding
auto-assign #96: Issue #2641 labeled by nv-guomingz
January 7, 2025 14:49 44s
January 7, 2025 14:49 44s
fp8 quantization for CohereForCausalLM
auto-assign #95: Issue #2666 labeled by nv-guomingz
January 7, 2025 14:37 47s
January 7, 2025 14:37 47s
QTIP Quantization Support?
auto-assign #93: Issue #2663 labeled by nv-guomingz
January 7, 2025 14:32 50s
January 7, 2025 14:32 50s
Doc llm-api reference page empty?
auto-assign #92: Issue #2665 labeled by nv-guomingz
January 7, 2025 14:30 47s
January 7, 2025 14:30 47s
January 7, 2025 14:29 52s
Doc llm-api reference page empty?
auto-assign #90: Issue #2665 labeled by BugFreeee
January 7, 2025 02:37 2s
January 7, 2025 02:37 2s
Error with LoRA Weights Data Type in Quantized TensorRT-LLM Model Execution
auto-assign #89: Issue #2628 labeled by nv-guomingz
January 6, 2025 14:41 46s
January 6, 2025 14:41 46s
Cpp runner outputs wrong results when using lora + tensor parallelism
auto-assign #88: Issue #2634 labeled by nv-guomingz
January 6, 2025 14:40 45s
January 6, 2025 14:40 45s
setuptools conflict
auto-assign #87: Issue #2655 labeled by nv-guomingz
January 6, 2025 14:37 44s
January 6, 2025 14:37 44s
setuptools conflict
auto-assign #86: Issue #2655 labeled by nv-guomingz
January 6, 2025 14:37 3s
January 6, 2025 14:37 3s
No module named 'tensorrt_llm.bindings'` error message
auto-assign #85: Issue #2656 labeled by nv-guomingz
January 6, 2025 13:38 2s
January 6, 2025 13:38 2s
Which version of InternVL does TensorRT-llm 1.5 support ?
auto-assign #83: Issue #2578 labeled by nv-guomingz
January 6, 2025 05:34 42s
January 6, 2025 05:34 42s
gemma 2 convert_checkpoint takes gpu ram more than needed
auto-assign #82: Issue #2647 labeled by nv-guomingz
January 6, 2025 03:05 46s
January 6, 2025 03:05 46s
Qwen2 VL cannot be convert to checkpoint on TensorRT-LLM
auto-assign #81: Issue #2658 labeled by nv-guomingz
January 6, 2025 03:02 47s
January 6, 2025 03:02 47s