Mystery company accidentally blew $500 million on Claude in a single month — failed to put usage limit on licenses for employees

Sahwa@reddthat.com · 22 days ago

Mystery company accidentally blew $500 million on Claude in a single month — failed to put usage limit on licenses for employees

perviouslyiner@lemmy.world · 21 days ago

does it give the full history to the LLM each time?

Last time I tried implementing something like this, it suggested to have a rolling window of history so that it takes into account your last X messages but not the entire conversation.

(I guess this is what ollama calls “context length”?)

Sabata@ani.social · edit-2 21 days ago

You send the entire history for that conversation every time and likely more if its getting info from tools. If its not in the context the model dose not see it unless you have a memory system that dose something like feeding in summaries of past conversations that also takes up tokens and context. Rolling drops old messages to not reach context limits but you can lose important info or get odd results. If the history gets bigger than the context things break or slow way down.

perviouslyiner@lemmy.world · 21 days ago

presumably this is why Claude periodically writes its conclusions so far into a text file that it can read later instead of having to remember everything. Sounds like an interesting approach.

percent@infosec.pub · 21 days ago

Most agent harnesses do something called “compaction.” For example, here’s how Pi does compaction

BlackLaZoR@lemmy.world · 21 days ago

does it give the full history to the LLM each time?

It’s limited to the context size supported by given model. You can give the model 100k tokens of history but if it’s configured for less, it will just truncate it before processing (usually by removing oldest tokens first)

boonhet@sopuli.xyz · 20 days ago

Good news for tokenmaxxers: frontier models now have 1M context

Easy way to blow through ridiculous amounts of tokens.

Mystery company accidentally blew $500 million on Claude in a single month — failed to put usage limit on licenses for employees

Mystery company accidentally blew $500 million on Claude in a single month — failed to put usage limit on licenses for employees

Mystery company accidentally blew $500 million on Claude AI in a single month — failed to put usage limit on licenses for employees