About as open source as a binary blob without the training data

Prunebutt@slrpnk.net · 1 month ago

About as open source as a binary blob without the training data

Ajen · edit-2 1 month ago

I don’t know what, if any, CS background you have, but that is way off. The training dataset is used to generate the weights, or the trained model. In the context of building a trained LLM model, the input is the dataset and the output is the trained model, or weights.

It’s more appropriate to call deepseek “open-weight” rather than open-source.