• 26 Posts
  • 330 Comments
Joined 6 years ago
cake
Cake day: August 24th, 2019

help-circle



  • New comment for clarity, crush is still crushing through the problem (hah) but it has found some cool stuff already. Their homepage calls json files which contain the models they offer, and we see there is at least GPT4.1 and Deepseek. No word yet on how they use these models exactly but since GPT is closed-source they have to be contacting the service and forwarding the requests, can’t run it yourself.

    Will update as it works through the tasks.

    edit: so we can only look at the JSON and JS files their homepage calls (don’t really feel like making an account with them) and there is some interesting info. Seems to simply be RAG from the way the service parses the data (there’s a file with the steps it will show to the user on the interface).

    No fine-tuning or running their own model and BECAUSE OPENAI IS PROPRIETARY, everything you send through the service ends up with OAI.



  • lmao

    edit: okay I’m trying to dig more into how they provide this service and the terms of service are pretty funny.

    Left Insight LLC does not return any results online (I might try to scan deeper later). They probably set it up to own the chatbot.

    There’s a subscription because of course there is. Greatest victory for the workers and socialism but also pay us to unlock this victory.

    Lengthy intellectual property bit. Can’t use the “chatbot” to compete against wsws was a good one.

    This doesn’t hold up in US court where they operate from. They can make licences but nobody owns the output of an LLM. Also more funny copyright bit. The trotskyist revolution will not be televized because the workers will not have paid the TV fees.

    I also wanted to look at their html to see if I could find more information there such as interface used or maybe even a comment but it’s a huge one-line block that I can’t lint. I threw deepseek at it, will report when it’s done (oh no that counts as reverse-engineering they will terminate my non-existent account with them!)




  • We’ve looked at integrating TTS natively onto our pages previously but it’s not great. Firefox had an API to use the rudimentary pre-installed voice packages but apparently they are deprecating it and it’s unsure if it’ll work for everyone.

    And of course since we change the content sometimes, having a static TTS file means we need to recreate and reupload it every time we make an edit. Very costly if we start looking at AI TTS voices, unless we are able to start running some on our server locally.

    You could totally have a choice of voices such as Lenin, Stalin, anyone for whom we have voice recordings basically lol. And yes neural networks are especially good at TTS and even different languages, it’s one of the first things they perfected - it’s pretty surreal listening to yourself speak perfect Chinese or Danish in your own voice and shows that we are not so different after all from each other.


  • good idea in theory, sucks that trots were the first to get to it lol. I also wonder what they use under the hood, if it’s just a system prompt being injected into a commercial model or they fine-tuned a model or what. Dozens of different ways one could make this LLM with varying amount of effort and resources needed for varying results. I doubt they trained an entire model from scratch because you still need more than their 250,000 WSWS articles they put in it (which is also just wild to think about that they have 250k articles over 25 years. Do Trots ever only write stuff? Apparently so.)

    We have given out ProleWiki content (as txt files) to a dev to make it available for RAG on github, which is basically a compendium of data the LLM looks at before answering. We have no reason to paywall or restrict the use of our content, so it was only natural to start making these - with the understanding that they make the data available to everyone of course, so that it’s ready to use for anyone who wants to try their hand with it (over 5000 PW pages + library books we rehost). They’re currently working through it to format it for LLM use since you don’t just give it the raw text files, and previously they’d also worked on porting the MIA in the same way to a github repo (https://github.com/percy-raskova/marxists.org-rag-db). Oh they also said it was easier to work with the heavily-formatted mediawiki data over the plain html non marked-up data from MIA hah. Even our library books are formatted with clear reference tags and heading/section tags. They also contain metadata in a clearly formatted infobox (on purpose, not every wiki does this) which also allows the llm to natively understand the metadata it’s looking at.

    What could come out of it though I’m not sure. That is to say I’m not sure if the best thing you can do with a socialist llm is the novelty of “wow marx chatbot” like wsws seems to be doing. I could definitely see it being queried on much higher-level questions like presenting it with your current material conditions and asking what is the best party form for it based on theory, submitting your organizational ideas to it and having it challenge them with theory. But for that it needs to have a vast corpus of texts available, and definitely MIA + ProleWiki’s current library is far from enough. MIA has a lot of works but they don’t republish if it’s not fair use or from living authors. ProleWiki republishes everything, but we don’t have a lot of works compared to all that exists.

    edit: damn they’re just yapping in that press release. get to the point lol.


  • Pythons a huge mess I mean, virtual environments started because just having the wrong libraries together on the system could break either one of them. On newer Linux distros they discourage you from installing pip system wide because the distros rely so much on python you could break something. So instead you do everything through venvs which is a cool mess to sort out when you want something to work system wide and it has to go through the venv.

    Or some libraries exist as apt package but only like 10 of them it seems so it’s just easier to set up a venv anyway.

    And upgrades to python not being incremental meaning you need like 3 different versions of it installed because this one program needs 3.11 not 3.12, remembering which one to call, remembering which one needs to be the default system wide and making sure it stays that way…



  • CriticalResist8@lemmygrad.mlMtoMemes@lemmygrad.mlai oopsie
    link
    fedilink
    arrow-up
    6
    ·
    edit-2
    12 days ago

    I think this is the way yeah. For extra protection you can also do physical backups of the project (copy pastes) at various points, because even if the LLM doesn’t know you have gitted your project, it may still run the command. The newer deepseek is much more biased towards doing this, I wrote “commit your findings to a file” and it wanted to git it. There’s always the possibility it can squash all commits or erase them (much like someone can write rm -rf in any terminal!) but this is why we invented prod/dev redundancy and RAID backups lol. You don’t necessarily have to be this paranoid when using agentic AI but it’s an extra security and some peace of mind.

    I also checked and crush is completely able to write and run bash commands (incl. rm) on files not in the folder you opened it on. Definitely something to look into, I’ll check if there’s a way to containerize it better and make a post for [email protected]. Yog and I brainstormed the idea of making another linux user just for crush, then putting your main account in that user group along with the crush user, but not the crush user in your main account’s group. That way it only has perms to act on the files belonging to crush/crush, though it can still try to run any bash command it wants. And you would also have access to crush’s files with your main account so it’s more convenient. But I don’t know much yet about how linux users work, I’ll have to look into it and will make a post about it if I find something.

    I think crush also has config files you can edit to blacklist or auto deny some commands.


  • CriticalResist8@lemmygrad.mlMtoMemes@lemmygrad.mlai oopsie
    link
    fedilink
    arrow-up
    23
    arrow-down
    2
    ·
    13 days ago

    The way these tools are being marketed by tech companies is completely wrong and prone to making disasters like this. It’s a tool; it’s like selling a fruit-only knife then leading customers into thinking it can only cut fruit and nothing else (until inevitably someone cuts themselves on it). I agree google has some responsibility there if this happened (his story seems a bit fishy tbh but that’s not really the point) and this is also why OSes bake some protective measures in such as user permissions. It’s also why everyone has been telling everyone to make backups for years even though nobody does it lol. 10 years ago steam introduced a bug that could wipe linux drives.

    I see from his video that anti-gravity obfuscates the chain-of-thought and the outputs - it’s a proprietary model so they don’t want to share that, but it makes troubleshooting impossible. He also had it set on ‘turbo’ mode which bypasses requesting permissions to run commands - there should be heavy discouragement to users doing that,including making them actually edit config files imo, it shouldn’t just be a nice-sounding toggle because then people think “turbo means it goes fast of course I want it to go fast”.

    They want to market agents as a do-everything app but it’s still software under the hood. And I don’t trust google to ship any good product anyway, but obviously that’s not how google markets itself. And of course you’re stuck with expensive google models if you use anti-gravity.

    People are also right that this should run in a container with no way to escape it, and even crush (the one I use) is not great about this - though it should be possible to containerize it yourself. Coming from a company like google this kind of stuff should come out of the box with the software and set up for you. This is also one of the many reasons I switched away from Windows, the moment they announced integrated agentic I knew you would never be able to fully remove it.

    I can believe what happened is possible – if anything it serves as a PSA not to trust software blindly. When I was a kid the most hilarious thing you could do on the internet is tell someone to delete system32 so. From one of OP’s comments it seems the problem was the space in a folder name that windows parsed incorrectly because of the OS’s rmdir command? No way to tell for sure since gemini obfuscates the output, and of course that’s just what OP thinks the problem was.

    Someone tried to reproduce with more locked down perms and the output (pic) was just as concerning from anti-gravity. It said its “instructions” prevented it from running the command, when it should say “the agent prevents the command from being run” (and deepseek does say this in crush). I.e. this should be hard-coded but it seems to be passed to the LLM instead.

    And as much as it sucks, you live and learn. People have been accidentally wiping their drives for decades at this point, I’ve probably done it too before when I was younger. If anything software was better about preventing this sort of thing in the 2010s, the 2000s were wild lol they gave you access to buttons that could reformat everything without even a confirmation button or an explanation of what the button was for.





  • setting up an API connection is definitely a bit more involved, but it allows people to use it for their specific needs that the API devs may not have thought of. For example to translate all of prolewiki english to french, I set up an API access to mistral to use their servers/models to do the actual translation. Basically I sent a chunk of text through the API, it did its magic on mistral’s servers side of things, and then their servers return the translated chunk of text. My script saves that returned text to a document, without ever caring what was going on mistral’s side.

    There are other programs for agent coding though I don’t have experience with them, but they might make it a bit easier for people to adopt the tools. I know Claude (Anthropic) has one that apparently works with models other than their own, and is a bit more graphical, i.e. you can use it with the mouse while crush is keyboard-only.

    But try out crush, it works on any computer and I promise it’s not as scary as it looks haha (if that’s what’s holding you back). Once you’re connected it works every time you’ll use it from then on.