It looks like the west has been caught with its pants down, China is developing what seems to be far more efficient AI tech, perhaps while big tech are motivated by money and IP theft, China has just got on with developing better ideas.
This also seems to be a major win for Open Source models, which is a good thing. Could also be a good thing for the EU who want to develop their own AI solutions to break from US big tech.
I was thinking about the training data, of which you need massive amounts to train. And as far as I know, pretty much all companies have worked on a scraping basis, rather than paying for (or even asking for).
Snark aside, thanks for clarifying which kind of ip theft was meant, because this is not the kind of ip theft that is normally associated with training models.
I’m personally against copyrights as a concept and absolutely don’t care about this aspect, especially when it comes to open models. The way I look at is that the model is unlocking this content and making this knowledge available to humanity.
@yogthos
It looks like the west has been caught with its pants down, China is developing what seems to be far more efficient AI tech, perhaps while big tech are motivated by money and IP theft, China has just got on with developing better ideas.
This also seems to be a major win for Open Source models, which is a good thing. Could also be a good thing for the EU who want to develop their own AI solutions to break from US big tech.
Genuinely curious, what makes you think that deepseek has been built without ip theft?
I guess it depends on what kind of ipntheft you mean.
I was thinking about the training data, of which you need massive amounts to train. And as far as I know, pretty much all companies have worked on a scraping basis, rather than paying for (or even asking for).
What kind of ip theft were you thinking of?
I was referring to both scraping to create the models and using the models to create infringing content.
Ah yes, they must be stealing IP from the future when they publish novel papers on things nobody’s done before!
Snark aside, thanks for clarifying which kind of ip theft was meant, because this is not the kind of ip theft that is normally associated with training models.
I’m personally against copyrights as a concept and absolutely don’t care about this aspect, especially when it comes to open models. The way I look at is that the model is unlocking this content and making this knowledge available to humanity.
That’s nice, dear.