article: https://x.ai

trained a prototype LLM (Grok-0) with 33 billion parameters. This early model approaches LLaMA 2 (70B) capabilities on standard LM benchmarks but uses only half of its training resources. In the last two months, we have made significant improvements in reasoning and coding capabilities leading up to Grok-1, a state-of-the-art language model that is significantly more powerful, achieving 63.2% on the HumanEval coding task and 73% on MMLU.

  • @noneabove1182M
    link
    English
    2
    edit-2
    8 months ago

    While the drama around X and musk cannot be understated, it’s still great to see more players in the open model world (assuming this gets properly opened)

    One thing that’ll hold it back (for people like us at least) is developer support so I’m quite curious to see how this plays out with things like GPTQ and llama.cpp