• The Octonaut@mander.xyz
    link
    fedilink
    arrow-up
    2
    ·
    7 months ago

    I don’t think you or that Medium writer understand what “open source” means. Being able to run a local stripped down version for free puts it on par with Llama, a Meta product. Privacy-first indeed. Unless you can train your own from scratch, it’s not open source.

    Here’s the OSI’s helpful definition for your reference https://opensource.org/ai/open-source-ai-definition

    • ☆ Yσɠƚԋσʂ ☆@lemmy.mlOP
      link
      fedilink
      arrow-up
      0
      ·
      7 months ago

      You can run the full version if you have the hardware, the weights are published, and importantly the research behind it is published as well. Go troll somewhere else.

      • The Octonaut@mander.xyz
        link
        fedilink
        arrow-up
        0
        ·
        7 months ago

        All that is true of Meta’s products too. It doesn’t make them open source.

        Do you disagree with the OSI?

          • The Octonaut@mander.xyz
            link
            fedilink
            arrow-up
            0
            ·
            edit-2
            7 months ago

            The data part. ie the very first part of the OSI’s definition.

            It’s not available from their articles https://arxiv.org/html/2501.12948v1 https://arxiv.org/html/2401.02954v1

            Nor on their github https://github.com/deepseek-ai/DeepSeek-LLM

            Note that the OSI only ask for transparency of what the dataset was - a name and the fee paid will do - not that full access to it to be free and Free.

            It’s worth mentioning too that they’ve used the MIT license for the “code” included with the model (a few YAML files to feed it to software) but they have created their own unrecognised non-free license for the model itself. Why they having this misleading label on their github page would only be speculation.

            Without making the dataset available then nobody can accurately recreate, modify or learn from the model they’ve released. This is the only sane definition of open source available for an LLM model since it is not in itself code with a “source”.