convert_checkpoint qwen1.5 error #1675

diandianliu · 2024-05-25T15:07:08Z

Hi, I am facing an error when trying to convert_checkpoint qwen1.5

**model: **
https://huggingface.co/Qwen/Qwen1.5-0.5B-Chat
error:
python convert_checkpoint.py --qwen_type qwen2 --model_dir /workspace/triton/models/qwen/Qwen1.5-0.5B-Chat/ --output_dir /workspace/triton/models/qwen/trt_ckpt_Qwen1.5-0.5B-Chat_fp16_1gpu
[TensorRT-LLM] TensorRT-LLM version: 0.11.0.dev2024052100
0.11.0.dev2024052100
Traceback (most recent call last):
File "/workspace/triton/tensorrtllm_backend/tensorrt_llm/examples/qwen/convert_checkpoint.py", line 369, in
main()
File "/workspace/triton/tensorrtllm_backend/tensorrt_llm/examples/qwen/convert_checkpoint.py", line 361, in main
convert_and_save_hf(args)
File "/workspace/triton/tensorrtllm_backend/tensorrt_llm/examples/qwen/convert_checkpoint.py", line 323, in convert_and_save_hf
execute(args.workers, [convert_and_save_rank] * world_size, args)
File "/workspace/triton/tensorrtllm_backend/tensorrt_llm/examples/qwen/convert_checkpoint.py", line 329, in execute
f(args, rank)
File "/workspace/triton/tensorrtllm_backend/tensorrt_llm/examples/qwen/convert_checkpoint.py", line 309, in convert_and_save_rank
qwen = from_hugging_face(
File "/usr/local/lib/python3.10/dist-packages/tensorrt_llm/models/qwen/convert.py", line 1087, in from_hugging_face
weights = load_weights_from_hf(config=config,
File "/usr/local/lib/python3.10/dist-packages/tensorrt_llm/models/qwen/convert.py", line 1193, in load_weights_from_hf
weights = convert_hf_qwen(
File "/usr/local/lib/python3.10/dist-packages/tensorrt_llm/models/qwen/convert.py", line 931, in convert_hf_qwen
lm_head_weights = get_weight(model_params, 'lm_head', dtype)
File "/usr/local/lib/python3.10/dist-packages/tensorrt_llm/models/qwen/convert.py", line 455, in get_weight
if config[prefix + '.weight'].dtype != dtype:
KeyError: 'lm_head.weight'

byshiue · 2024-05-29T06:43:32Z

Thank you for the report. It is a bug of Qwen convert when tie_word_embeddings is True. We will fix it in next update.

nv-guomingz · 2024-06-03T12:02:10Z

Hi @diandianliu , thanks for your contributing. We've merged your contribution into code base and will add you into contributor list.

byshiue self-assigned this May 29, 2024

byshiue added bug Something isn't working triaged Issue has been triaged by maintainers labels May 29, 2024

nv-guomingz closed this as completed Jun 3, 2024

nv-guomingz added the Merged label Jun 3, 2024

kaiyux mentioned this issue Jun 4, 2024

Update TensorRT-LLM #1725

Merged

kaiyux mentioned this issue Jul 17, 2024

TensorRT-LLM v0.11 Update #1969

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

convert_checkpoint qwen1.5 error #1675

convert_checkpoint qwen1.5 error #1675

diandianliu commented May 25, 2024

byshiue commented May 29, 2024

nv-guomingz commented Jun 3, 2024

convert_checkpoint qwen1.5 error #1675

convert_checkpoint qwen1.5 error #1675

Comments

diandianliu commented May 25, 2024

byshiue commented May 29, 2024

nv-guomingz commented Jun 3, 2024