• 12 Posts
  • 202 Comments
Joined 1 year ago
cake
Cake day: June 8th, 2023

help-circle


















  • SkySyrupMtoLocalLLaMATraining a model without a GPU
    link
    fedilink
    English
    arrow-up
    6
    ·
    edit-2
    11 months ago

    Sure! You’ll probably want to look at train-text-from-scratch in the llama.cpp project, it runs on pure CPU. The (admittedly little docs) should help, otherwise ChatGPT is a good help if you show it the code. NanoGPT is fine too.

    For dataset, maybe you could train on French Wikipedia, or scrape from a French story site or fan fiction or whatever. Wikipedia is probably easiest, since they provide downloadable offline versions that are only a couple gigs.















Moderates