A note that this setup runs a 671B model in Q4 quantization at 3-4 TPS, running a Q8 would need something beefier. To run a 671B model in the original Q8 at 6-8 TPS you’d need a dual socket EPYC server motherboard with 768GB of RAM.

  • ☆ Yσɠƚԋσʂ ☆@lemmygrad.mlOP
    link
    fedilink
    arrow-up
    8
    arrow-down
    1
    ·
    2 months ago

    I really like the take from Wang Jian who founded Alibaba Cloud. His view is that AGI is a meaningless term. In practice, it’s a gradient where capabilities of the models continue to improve across different spectrums, and they continue to become more useful.

    In terms of the whole singularity thing, it’s certainly not out of the realm of possibility. For example, stuff like this is already happening where the discovery of better models is becoming automated. The question is where things start to plateau.

    Overall, I’m fairly optimistic as well. I think it’s almost certain that China will drive most of the progress because they have the industries to apply this tech. We already see automation in factories, robots being increasingly used to do manual labour, stuff like self driving trucks, etc. It’s entirely likely that a lot of hard jobs will be automated within a decade or so.

    At the same time, I do expect this tech will have negative consequences in capitalist societies where it will displace labour and drive unemployment. I’d argue that deepening inequalities will necessarily lead to further radicalization of the workers, and would help convince people that capitalism is not sustainable.