auto-assign

Actions

auto-assign

Actions

Loading...
Loading

auto-assign.yml

101 workflow runs

EAGLE model seems to be deployed but raises an error on inference auto-assign #104: Issue #2673 labeled by nuxlear

January 9, 2025 02:01

Prompt formatting for different version of InternVL2 auto-assign #103: Issue #2672 labeled by nzarif

January 8, 2025 20:19

Runtime error in windows, help me auto-assign #102: Issue #2540 labeled by nv-guomingz

January 8, 2025 11:37

45s

January 8, 2025 11:37

45s

Runtime error in windows, help me auto-assign #101: Issue #2540 labeled by nv-guomingz

January 8, 2025 11:37

42s

January 8, 2025 11:37

42s

[Performance] KV cache reuse is slower when batch size > 1 auto-assign #100: Issue #2631 labeled by nv-guomingz

January 8, 2025 09:09

47s

January 8, 2025 09:09

47s

PyTorch Nightly support in Dockerfile auto-assign #99: Issue #2657 labeled by nv-guomingz

January 8, 2025 09:09

51s

January 8, 2025 09:09

51s

trtllm-serve without any output Qwne2.5-7b auto-assign #98: Issue #2667 labeled by nv-guomingz

January 8, 2025 07:38

44s

January 8, 2025 07:38

44s

trtllm-serve without any output Qwne2.5-7b auto-assign #97: Issue #2667 labeled by Justin-12138

January 8, 2025 03:45

Failed to build engine with lookahead_decoding auto-assign #96: Issue #2641 labeled by nv-guomingz

January 7, 2025 14:49

44s

January 7, 2025 14:49

44s

fp8 quantization for CohereForCausalLM auto-assign #95: Issue #2666 labeled by nv-guomingz

January 7, 2025 14:37

47s

January 7, 2025 14:37

47s

torch.cuda.DeferredCudaCallError: CUDA call failed lazily at initialization with error: 'NoneType' object is not iterable auto-assign #94: Issue #2652 labeled by nv-guomingz

January 7, 2025 14:36

QTIP Quantization Support? auto-assign #93: Issue #2663 labeled by nv-guomingz

January 7, 2025 14:32

50s

January 7, 2025 14:32

50s

Doc llm-api reference page empty? auto-assign #92: Issue #2665 labeled by nv-guomingz

January 7, 2025 14:30

47s

January 7, 2025 14:30

47s

What are supported low-bit (int8/fp8/int4) data types in MLP and Attention layers? auto-assign #91: Issue #2664 labeled by nv-guomingz

January 7, 2025 14:29

52s

January 7, 2025 14:29

52s

Doc llm-api reference page empty? auto-assign #90: Issue #2665 labeled by BugFreeee

January 7, 2025 02:37

Error with LoRA Weights Data Type in Quantized TensorRT-LLM Model Execution auto-assign #89: Issue #2628 labeled by nv-guomingz

January 6, 2025 14:41

46s

January 6, 2025 14:41

46s

Cpp runner outputs wrong results when using lora + tensor parallelism auto-assign #88: Issue #2634 labeled by nv-guomingz

January 6, 2025 14:40

45s

January 6, 2025 14:40

45s

setuptools conflict auto-assign #87: Issue #2655 labeled by nv-guomingz

January 6, 2025 14:37

44s

January 6, 2025 14:37

44s

setuptools conflict auto-assign #86: Issue #2655 labeled by nv-guomingz

January 6, 2025 14:37

No module named 'tensorrt_llm.bindings'` error message auto-assign #85: Issue #2656 labeled by nv-guomingz

January 6, 2025 13:38

Segmentation fault crash: Tensorrt-LLM crash when using guided decoding xgrammar and kv cache reuse auto-assign #84: Issue #2660 labeled by Somasundaram-Palaniappan

January 6, 2025 08:22

Which version of InternVL does TensorRT-llm 1.5 support ? auto-assign #83: Issue #2578 labeled by nv-guomingz

January 6, 2025 05:34

42s

January 6, 2025 05:34

42s

gemma 2 convert_checkpoint takes gpu ram more than needed auto-assign #82: Issue #2647 labeled by nv-guomingz

January 6, 2025 03:05

46s

January 6, 2025 03:05

46s

Qwen2 VL cannot be convert to checkpoint on TensorRT-LLM auto-assign #81: Issue #2658 labeled by nv-guomingz

January 6, 2025 03:02

47s

January 6, 2025 03:02

47s

[QST] why the implementation of f16xs8 mixed gemm is different between TRT-LLM and native cutlass mixed gemm example? auto-assign #80: Issue #2659 labeled by nv-guomingz

January 6, 2025 02:59

48s

January 6, 2025 02:59

48s

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Actions

Workflows

Management

auto-assign

Actions

Loading...
Loading

Create status badge

Filter by Event

Sorry, something went wrong.

Sorry, something went wrong.

No matching events.

Filter by Status

Sorry, something went wrong.

Sorry, something went wrong.

No matching statuses.

Filter by Branch

Sorry, something went wrong.

Sorry, something went wrong.

No matching branches.

Filter by Actor

Sorry, something went wrong.

Sorry, something went wrong.

No matching users.

Actions: NVIDIA/TensorRT-LLM

Actions

auto-assign auto-assign Actions Loading... Loading Sorry, something went wrong.

auto-assign

auto-assign

Actions

Loading...
Loading