• just another dev@lemmy.my-box.dev
        link
        fedilink
        English
        arrow-up
        1
        ·
        5 months ago

        I was thinking about the training data, of which you need massive amounts to train. And as far as I know, pretty much all companies have worked on a scraping basis, rather than paying for (or even asking for).

        What kind of ip theft were you thinking of?

        • hitmyspot@aussie.zone
          link
          fedilink
          arrow-up
          1
          ·
          5 months ago

          I was referring to both scraping to create the models and using the models to create infringing content.

      • just another dev@lemmy.my-box.dev
        link
        fedilink
        English
        arrow-up
        2
        ·
        5 months ago

        Snark aside, thanks for clarifying which kind of ip theft was meant, because this is not the kind of ip theft that is normally associated with training models.

        • ☆ Yσɠƚԋσʂ ☆@lemmy.mlOP
          link
          fedilink
          arrow-up
          3
          arrow-down
          1
          ·
          5 months ago

          I’m personally against copyrights as a concept and absolutely don’t care about this aspect, especially when it comes to open models. The way I look at is that the model is unlocking this content and making this knowledge available to humanity.