• legolas@fedit.pl
    link
    fedilink
    English
    arrow-up
    17
    arrow-down
    14
    ·
    edit-2
    5 months ago

    Apparently DeepSeek is lying, they were collecting thousands of NVIDIA chips against the US embargo and it’s not about the algorithm. The model’s good results come just from sheer chip volume and energy used. That’s the story I’ve heard and honeslty it sounds legit.

    Not sure if this questions has been answered though: if it’s open sourced, cant we see what algorithms they used to train it? If we could then we would know the answer. I assume we cant, but if we cant, then whats so cool about it being open source on the other hand? What parts of code are valuable there besides algorithms?

    • Pennomi@lemmy.world
      link
      fedilink
      English
      arrow-up
      18
      arrow-down
      3
      ·
      5 months ago

      The open paper they published details the algorithms and techniques used to train it, and it’s been replicated by researchers already.

      • legolas@fedit.pl
        link
        fedilink
        English
        arrow-up
        7
        arrow-down
        1
        ·
        edit-2
        5 months ago

        So are these techiques so novel and breaktrough? Will we now have a burst of deepseek like models everywhere? Cause that’s what absolutely should happen if the whole storey is true. I would assume there are dozens or even hundreds of companies in USA that are in a posession of similar number but surely more chips that Chinese folks claimed to trained their model on, especially in finance sector and just AI reserach focused.

    • ayyy@sh.itjust.works
      link
      fedilink
      English
      arrow-up
      1
      arrow-down
      3
      ·
      edit-2
      5 months ago

      It’s time for you to do some serious self-reflection about the inherent biases you believe about Asians Chinese people.

      • legolas@fedit.pl
        link
        fedilink
        English
        arrow-up
        3
        ·
        5 months ago

        WTF dude. You mentioned Asia. I love Asians. Asia is vast. There are many countries, not just China bro. I think you need to do these reflections. Im talking about very specific case of Chinese Deepseek devs potentiall lying about the chips. The assumptions and generalizations you are thinking of are crazy.

        • ayyy@sh.itjust.works
          link
          fedilink
          English
          arrow-up
          2
          arrow-down
          1
          ·
          5 months ago

          And how do your feelings stand up to the fact that independent researchers find the paper to be reproducible?

          • legolas@fedit.pl
            link
            fedilink
            English
            arrow-up
            2
            ·
            5 months ago

            Well maybe. Apparntly some folks are already doing that but its not done yet. Let’s wait for the results. If everything is legit we should have not one but plenty of similar and better models in near future. If Chinese did this with 100 chips imagine what can be done with 100000 chips that nvidia can sell to a us company