Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

bug: load model first time load meanwhile i cant access my SSD - TENSOR convert ??? #4303

Open
1 of 3 tasks
kalle07 opened this issue Dec 19, 2024 · 8 comments
Open
1 of 3 tasks
Assignees
Labels
category: hardware type: bug Something isn't working

Comments

@kalle07
Copy link

kalle07 commented Dec 19, 2024

Jan version

0.5.11

Describe the Bug

i load a model the first time
nearly no CPU or GPU is used meanwhile and it need ~120 sec on my SSD (100% read/wite)
iv tested 10 models and they run fine in GPT4all (~120sec first time) but i can access my SSD while the model is loaded ...

while load the model with JAN it need also 120sec (first time) but meanwhile i cant acces my SSD !!!
btw what is happened that long time ? i can copy on same SSD 10GB in 20sec.
i think the model is converted to TENSOR-RT (iv installed) at least you can write please wait util convert

Steps to Reproduce

No response

Screenshots / Logs

No response

What is your OS?

  • MacOS
  • Windows
  • Linux
@kalle07 kalle07 added the type: bug Something isn't working label Dec 19, 2024
@github-project-automation github-project-automation bot moved this to Investigating in Jan & Cortex Dec 19, 2024
@louis-jan louis-jan self-assigned this Dec 20, 2024
@louis-jan
Copy link
Contributor

Hi @kalle07. What model are you using?
Could you please assist in uploading the log files? We will investigate then.

@kalle07
Copy link
Author

kalle07 commented Dec 20, 2024

which log where i can find ?

even here no info where is the path

grafik

i mean i would help you but ^^

@louis-jan
Copy link
Contributor

louis-jan commented Dec 20, 2024

Hi @kalle07 Hi @kalle07, thanks for the help. You can click the System Monitor (bottom-right corner) to see the Open App Log button.
CleanShot 2024-12-21 at 00 24 29

@kalle07
Copy link
Author

kalle07 commented Dec 20, 2024

the last one need 3min to load

and all in all ... all model talk such as much ... the overall settings on models are not that good ^^

cortex.log

@louis-jan
Copy link
Contributor

louis-jan commented Dec 23, 2024

Hi @kalle07 Could you please try Llama 3.2 1B or 3B to see if the models run fine? Also, would you mind sharing your computer specs here? Like:

  • CPU
  • GPU
  • RAM
  • SSD model

@kalle07
Copy link
Author

kalle07 commented Dec 23, 2024

that seems ok ...

log-txt.txt

but
https://huggingface.co/bartowski/Llama-3.1-SauerkrautLM-8b-Instruct-GGUF/tree/main
SauerkrautLM-Llama-3.1-8b-Instruct.Q5_K_M.gguf
need 3min
log2.txt

but if i exit jan, start again same model load its fast ~5sec

intel 8core 4GHZ
rtx4660
64GB
SSD (standart 500MB/s - not fast NVM)

@louis-jan
Copy link
Contributor

louis-jan commented Dec 30, 2024

@kalle07 Is it possible that other processes are also using SSD? There's also a background running inference. Does the Activity Monitor report app usage at 100%?

@kalle07
Copy link
Author

kalle07 commented Dec 30, 2024

no its repeatable ... and NO GPU or CPU usage while this 3min

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
category: hardware type: bug Something isn't working
Projects
Status: Investigating
Development

No branches or pull requests

4 participants