• DragonBallZinn [he/him]@hexbear.net
    link
    fedilink
    English
    arrow-up
    67
    ·
    edit-2
    3 days ago

    cheaply and less powerful

    So AI could be more sustainable this whole time? Interesting…

    Who wants to bet their AI will be used for more useful things too?

    • somename [she/her]@hexbear.net
      link
      fedilink
      English
      arrow-up
      65
      ·
      3 days ago

      AI, as in machine learning, has always had uses. Like, real useful additions to human capability. It’s very useful in pattern recognition and statistical analysis of various things.

      Our more modern shit connotation comes from capitalists trying replace labor with generative AI, which is a small subset of potential machine learning uses, but by its nature sends massive amounts of shlock at us.

    • DefinitelyNotAPhone [he/him]@hexbear.net
      link
      fedilink
      English
      arrow-up
      36
      ·
      3 days ago

      Having dealt with ML engineers in depth before, American tech companies tend to throw blank checks their way which, combined with them not tending to have backgrounds in optimization or infrastructure, means they spin up 8 billion GPU instances in the cloud and use 10% of them ever because engineers are lazy.

      They could, without any exaggeration, reduce their energy consumption by a factor of ten with about two weeks of honest engineering work. Yes this bothers the fuck out of me.

      • NudeNewt@lemm.ee
        link
        fedilink
        English
        arrow-up
        14
        ·
        3 days ago

        That’s my biggest gripe with mainstream closed source AI, they can optimize some of their most powerful MLAs to run on a potato but… they don’t. And they’ll never open source becaue it’d be forked by people who are genuinely passionate about improvement.

        AKA they’d be run outta business in no time.

    • Sulvor [he/him, undecided]@hexbear.net
      link
      fedilink
      English
      arrow-up
      25
      ·
      3 days ago

      Is it actually “sustainable” though? I’d like to see some wattage numbers.

      I said this in the other thread but I’ll say it again: our sophisticated artificial intelligence, their planet burning chatbot.

    • hello_hello [comrade/them]@hexbear.net
      link
      fedilink
      English
      arrow-up
      29
      ·
      edit-2
      3 days ago

      Who wants to bet their AI will be used for more useful things too?

      Unlike in the West where if Nvidia tanks then the entire US economy goes down with it.

    • JustSo [she/her, any]@hexbear.net
      link
      fedilink
      English
      arrow-up
      12
      ·
      3 days ago

      So AI could be more sustainable this whole time? Interesting…

      Yeah a lot of the breakthrough research has happened via cheaper / resource limited methods now that the cat is out of the bag. We live in interesting times.

    • Hexboare [they/them]@hexbear.net
      link
      fedilink
      English
      arrow-up
      7
      ·
      3 days ago

      So AI could be more sustainable this whole time? Interesting…

      It’s not that unsustainable unless you believe the predicts that your microwave will be building it’s own LLM model every month by 2030

      The electricity and water usage isn’t very high in general or compared to data centre usage more broadly (and is basically nothing compared to crypto)

      • dinklesplein [any, he/him]@hexbear.net
        link
        fedilink
        English
        arrow-up
        8
        ·
        3 days ago

        source: ML guy i know so this could be entirely unsubstantiated but apparently the main environmental burden of LLM infrastructure comes from training new models not serving inference from already deployed models.

        • tellmeaboutit@lemmygrad.ml
          link
          fedilink
          English
          arrow-up
          1
          ·
          2 days ago

          That might change now that companies are creating “reasoning” models like DeepSeek R1. They aren’t really all that different architecturally but they produce longer outputs which just requires more compute.