Calculated tokens much higher than actual #6

Qarj · 2023-05-12T16:37:09Z

Thanks for this. I've noticed a weird issue though both with this library and also the official code from open ai that I found a while back before gpt-4 came out.

What is happening is that the tokens calculated by this tool are much higher than the openai api is reporting in the completion. For example, a prompt I just submitted to gpt-4 was calculated as 7810 tokens by this library but when I got the completion from openai it told me my prompt had 5423 tokens. I'm not sure if you have also noticed something similar? In the prompt I'm submitting primarily Node JS code.

Qarj · 2023-05-14T15:48:52Z

As a workaround, I've noticed when you request too many tokens, you get a 400 error very quickly for example

This model's maximum context length is 8192 tokens. However, you requested 13674 tokens (7469 in the messages, 6205 in the completion). Please reduce the length of the messages or completion.

So I parse of the messages token count and resubmit with a max_tokens calculated as follows: 8192 - 7469 - 1

niieani · 2023-05-21T02:17:43Z

Hi @Qarj! Thanks for flagging this problem.
As @ricardomatias noticed in #5, the tokenizer is using the r50k_base encoding, which isn't the one used by GPT-4. Hence the token offset. I'm working on v2 which will allow for choosing which encoding to use, which will correctly tokenize for GPT-4 specifically.

Qarj · 2023-05-22T09:47:26Z

Thanks very much for addressing this! I will definitely use this feature in v2 when it is out.

BREAKING CHANGE: default encoder is now GPT3.5 / GPT4 fixes #5 fixes #6

github-actions · 2023-05-23T16:59:16Z

🎉 This issue has been resolved in version 2.0.0-beta.1 🎉

The release is available on:

Your semantic-release bot 📦🚀

Qarj · 2023-05-23T22:27:59Z

Thanks very much for this! Am using it already :)

Qarj · 2023-05-23T23:11:58Z

So it seems it is much closer now to the actual tokens, in a test I did the prompt was calculated as 998 tokens according to the library but 1003 tokens according to open ai. I suspect if we allow a 50 token margin then our completion token requests should always be within limit.

niieani · 2023-05-24T01:54:53Z

Interesting. I wonder if there are 5 additional tokens that are set by OpenAI for each request? The algorithm should be exactly the same as OpenAIs.

Thanks for investigating.

github-actions · 2023-05-24T02:03:54Z

🎉 This issue has been resolved in version 2.0.0 🎉

The release is available on:

Your semantic-release bot 📦🚀

niieani · 2023-06-03T01:27:28Z

@Qarj added the new encodeChat function which should return correct values for chats!

Qarj · 2023-06-03T22:55:54Z

Thanks very much for this! :)

niieani mentioned this issue May 23, 2023

feat: complete rewrite to support different models #8

Merged

niieani added a commit that referenced this issue May 23, 2023

feat: complete rewrite to support different models

eedd944

BREAKING CHANGE: default encoder is now GPT3.5 / GPT4 fixes #5 fixes #6

github-actions bot added the released on @beta label May 23, 2023

niieani closed this as completed in #8 May 24, 2023

github-actions bot added the released label May 24, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Calculated tokens much higher than actual #6

Calculated tokens much higher than actual #6

Qarj commented May 12, 2023

Qarj commented May 14, 2023

niieani commented May 21, 2023

Qarj commented May 22, 2023

github-actions bot commented May 23, 2023

Qarj commented May 23, 2023

Qarj commented May 23, 2023

niieani commented May 24, 2023

github-actions bot commented May 24, 2023

niieani commented Jun 3, 2023

Qarj commented Jun 3, 2023

Calculated tokens much higher than actual #6

Calculated tokens much higher than actual #6

Comments

Qarj commented May 12, 2023

Qarj commented May 14, 2023

niieani commented May 21, 2023

Qarj commented May 22, 2023

github-actions bot commented May 23, 2023

Qarj commented May 23, 2023

Qarj commented May 23, 2023

niieani commented May 24, 2023

github-actions bot commented May 24, 2023

niieani commented Jun 3, 2023

Qarj commented Jun 3, 2023