Add support for batched decoding api #795

abetlen · 2023-10-05T20:39:48Z

llama.cpp recently moved to a new api which supports batching (both for single sequences with multiple outputs and multiple seperate streams) and streaming support. This new api based on llama_decode supercedes the now deprecated llama_eval api. This means that the current api should be migrated anyways regardless of the new features but we'll see how easy it is to implement along the way.

flexorRegev · 2023-10-18T16:17:45Z

is there a way to help with this one?

zpzheng · 2023-10-26T07:12:01Z

Is this feature live yet？Why can't I support batch tasks locally？

It should behave like llama.cpp, where most out of the box usages treat special characters accordingly

Add low-level batching notebook

95f3c15

antoine-lizee and others added 9 commits November 1, 2023 21:29

fix: tokenization of special characters: (#850)

47ca05a

It should behave like llama.cpp, where most out of the box usages treat special characters accordingly

Merge branch 'main' of github.com:abetlen/llama_cpp_python into main

a9fe204

Update CHANGELOG

addc2f6

Cleanup

3e180d7

Fix runner label

f0d1a1b

Merge branch 'main' into add-support-for-llama-batch

7ff8508

Update notebook

753dfbc

Use llama_decode and batch api

5617825

Support logits_all parameter

9f32e8e

abetlen force-pushed the main branch from f902f59 to fa83cc5 Compare November 2, 2023 18:28

Merge branch 'main' into add-support-for-llama-batch

86db11e

abetlen marked this pull request as ready for review November 3, 2023 00:12

abetlen merged commit ab028cb into main Nov 3, 2023

tk-master mentioned this pull request Nov 8, 2023

Broken generate after Add support for batched decoding #888

Closed

abetlen deleted the add-support-for-llama-batch branch November 14, 2023 20:07

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add support for batched decoding api #795

Add support for batched decoding api #795

Uh oh!

abetlen commented Oct 5, 2023 •

edited

Loading

Uh oh!

flexorRegev commented Oct 18, 2023

Uh oh!

zpzheng commented Oct 26, 2023

Uh oh!

Uh oh!

Add support for batched decoding api #795

Add support for batched decoding api #795

Uh oh!

Conversation

abetlen commented Oct 5, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

flexorRegev commented Oct 18, 2023

Uh oh!

zpzheng commented Oct 26, 2023

Uh oh!

Uh oh!

abetlen commented Oct 5, 2023 •

edited

Loading