-
Notifications
You must be signed in to change notification settings - Fork 12.4k
Closed
Labels
good first issueGood for newcomersGood for newcomershelp wantedNeeds help from the communityNeeds help from the communitymodelModel specificModel specific
Description
Support is almost complete. There is a dangling issue with the pre-tokenizer: #7036
A useful discussion related to that is here: #7144
Outdated below
Creating this issue for more visibility
The main problem is around tokenization support, since the models use some variation of the BPE pre-processing regex. There are also some issues with the conversion scripts.
Anyway, looking for contributions to help with this
Previous unfinished work:
Possible implementation plan: #5464 (comment)
lin72h, phymbert, mirek190, joe0BAB, gyxlucy and 5 more0x4E69676874466F78, mscheong01, lin72h and Noeda
Metadata
Metadata
Assignees
Labels
good first issueGood for newcomersGood for newcomershelp wantedNeeds help from the communityNeeds help from the communitymodelModel specificModel specific