• benjirenji@slrpnk.net
    link
    fedilink
    English
    arrow-up
    9
    ·
    12 hours ago

    LLMs are the only thing that is hyped. The other models and applications have existed already back when ChatGPT first hit the public and they have not had any special break through that would explain exponential growth in investment or a need for compute power. Language models had that with the transformer structure, everything else just develops iteratively.

    The bubble we see now is because of language models and we can try and conflate it with other deep models and call it all AI, but it doesn’t change the fact that the generative models are the only ones requiring these resources and are looking for a problem to solve.