bug: Gemma 3 Models are using CPU

### Jan version

0.6.0

### Describe the Bug

I tried loading the Gemma 3 models after the update and noticed that has much slower token generation speed, and saw that it uses the CPU instead. I tried loading other models and they are fine. This happens on both the 4b and 12b versions, I didn't try the other versions. GPU Layers are at 100.

### Steps to Reproduce

Download the Gemma3 models, load and tell it to generate a test message.

### Screenshots / Logs

![Image](https://github.com/user-attachments/assets/921d5d85-ab0f-4573-aa0f-a2b12dee2479)

### What is your OS?

- [ ] MacOS
- [x] Windows
- [ ] Linux

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

bug: Gemma 3 Models are using CPU #5376

Jan version

Describe the Bug

Steps to Reproduce

Screenshots / Logs

What is your OS?

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

bug: Gemma 3 Models are using CPU #5376

Description

Jan version

Describe the Bug

Steps to Reproduce

Screenshots / Logs

What is your OS?

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions