Scorpil@discuss.tchncs.de to ChatGPT@lemmy.worldEnglish · 1 year agoUnderstanding Generative AI: Part One - Tokenizerscorpil.comexternal-linkmessage-square2fedilinkarrow-up115arrow-down14
arrow-up111arrow-down1external-linkUnderstanding Generative AI: Part One - Tokenizerscorpil.comScorpil@discuss.tchncs.de to ChatGPT@lemmy.worldEnglish · 1 year agomessage-square2fedilink
minus-squareStereoTrespasser@lemmy.worldlinkfedilinkarrow-up2·1 year agoThis is really helpful, thanks. I followed up until the part about token IDs. Token text, even in binary, is of variable length, making it hard to work with, but token IDs are just numbers. How and why is token text converted to an ID? Is the ID for a specific word always the same?
minus-squaremozzribo@leminal.spacelinkfedilinkarrow-up1·1 year agoWhat are tokens and how to count them?
This is really helpful, thanks. I followed up until the part about token IDs.
How and why is token text converted to an ID? Is the ID for a specific word always the same?
What are tokens and how to count them?