-
Notifications
You must be signed in to change notification settings - Fork 1
Closed
Labels
featureNew feature or requestNew feature or request
Description
ggml-org/llama.cpp#1126 introduced some new ones. Right now, we use repetition penalty. It does a decent job of avoiding repeated content for a while, but it's certainly not perfect. For example, a large window penalizes a lot of punctuation and causes run-on sentences. We can already change this by excluding tokens from the penalized list, but it's a balancing act that I'm not very good at.
First order of business is to get repeat_penalty working with the new API.
Metadata
Metadata
Assignees
Labels
featureNew feature or requestNew feature or request