Nvidia falls 14% in premarket trading as China's DeepSeek triggers global tech sell-off

schizoidman@lemm.ee · 7 months ago

Nvidia falls 14% in premarket trading as China's DeepSeek triggers global tech sell-off

surph_ninja@lemmy.world · 7 months ago

It’s one thing to be ignorant. It’s quite another to be confidently so in the face of overwhelming evidence that you’re wrong. Impressive.

fuck_u_spez_in_particular@lemmy.world · 7 months ago

confidently so in the face of overwhelming evidence

That I’d really like to see. And I mean more than the marketing bullshit that AI companies are doing…

For the record I was one of the first jumping on the AI hype-train (as programmer, and computer-scientist with machine-learning background), following the development of GPT1-4, being excited about having to do less boilerplaty code etc. getting help about rough ideas etc. GPT4 was almost so far as being a help (similar with o1 etc. or Anthropics models). Though I seldom use AI currently (and I’m observing similar with other colleagues and people I know of) because it actually slows me down with my stuff or gives wrong ideas, having to argue, just to see it yet again saturating at a local-minimum (aka it doesn’t get better, no matter what input I try). Just so that I have to do it myself… (which I should’ve done in the first place…).

Same is true for the image-generative side (i.e. first with GANs now with diffusion-based models).

I can get into more details about transformer/attention-based-models and its current plateau phase (i.e. more hardware doesn’t actually make things significantly better, it gets exponentially more expensive to make things slightly better) if you really want…

I hope that we do a breakthrough of course, that a model actually really learns reasoning, but I fear that that will take time, and it might even mean that we need different type of hardware.

surph_ninja@lemmy.world · 7 months ago

Any other AI company, and most of that would be legitimate criticism of the overhype used to generate more funding. But how does any of that apply to DeepSeek, and the code & paper they released?

fuck_u_spez_in_particular@lemmy.world · 7 months ago

DeepSeek

Yeah it’ll be exciting to see where this goes, i.e. if it really develops into a useful tool, for certain. Though I’m slightly cautious non-the less. It’s not doing something significantly different (i.e. it’s still an LLM), it’s just a lot cheaper/efficient to train, and open for everyone (which is great).

surph_ninja@lemmy.world · 7 months ago

What’s this “if” nonsense? I loaded up a light model of it, and already have put it to work.

fuck_u_spez_in_particular@lemmy.world · 7 months ago

Have you actually read my text wall?

Even o1 (which AFAIK is roughly on par with R1-671B) wasn’t really helpful for me. I just need often (actually all the time) correct answers to complex problems and LLMs aren’t just capable to deliver this.

I still need to try it out whether it’s possible to train it on my/our codebase, such that it’s at least possible to use as something like Github copilot (which I also don’t use, because it just isn’t reliable enough, and too often generates bugs). Also I’m a fast typer, until the answer is there and I need to parse/read/understand the code, I already have written a better version.

surph_ninja@lemmy.world · 7 months ago

Ahh. It’s overconfident neckbeard stuff then.

fuck_u_spez_in_particular@lemmy.world · 7 months ago

You’re just trolling aren’t you? Have you used AI for a longer time while coding and then tried without for some time? I currently don’t miss it… Keep in mind that you still have to check whether all the code is correct etc. writing code isn’t the thing that usually takes that much time for me… It’s debugging, and finding architecturally sound and good solutions for the problem. And AI is definitely not good at that (even if you’re not that experienced).

surph_ninja@lemmy.world · 7 months ago

Yes, I have tested that use case multiple times. It performs well enough.

A calculator also isn’t much help, if the person operating it fucks up. Maybe the problem in your scenario isn’t the AI.

Nvidia falls 14% in premarket trading as China's DeepSeek triggers global tech sell-off

Nvidia falls 14% in premarket trading as China's DeepSeek triggers global tech sell-off

Nvidia drops nearly 17% as China's cheaper AI model DeepSeek sparks global tech sell-off