morrowind@lemm.eeEnglish · 1 day agoStarVector - a foundation model for generating svgsplus-squarehuggingface.coexternal-linkmessage-square3fedilinkarrow-up118arrow-down10
arrow-up118arrow-down1external-linkStarVector - a foundation model for generating svgsplus-squarehuggingface.comorrowind@lemm.eeEnglish · 1 day agomessage-square3fedilink
thickertoofan@lemm.eeEnglish · 4 days agoSpatialLM, a 1B model capable of spatial identification, using 3d point cloud data. The video demo is amazing.plus-squaremanycore-research.github.ioexternal-linkmessage-square6fedilinkarrow-up126arrow-down10
arrow-up126arrow-down1external-linkSpatialLM, a 1B model capable of spatial identification, using 3d point cloud data. The video demo is amazing.plus-squaremanycore-research.github.iothickertoofan@lemm.eeEnglish · 4 days agomessage-square6fedilink
thickertoofan@lemm.eeEnglish · 4 days agoSoon you will be able to run LLMs natively in dockerplus-squarewww.docker.comexternal-linkmessage-square1fedilinkarrow-up124arrow-down15
arrow-up119arrow-down1external-linkSoon you will be able to run LLMs natively in dockerplus-squarewww.docker.comthickertoofan@lemm.eeEnglish · 4 days agomessage-square1fedilink
thickertoofan@lemm.eeEnglish · 6 days agoMicrosoft KBLAMplus-squarewww.microsoft.comexternal-linkmessage-square5fedilinkarrow-up111arrow-down10
arrow-up111arrow-down1external-linkMicrosoft KBLAMplus-squarewww.microsoft.comthickertoofan@lemm.eeEnglish · 6 days agomessage-square5fedilink
NinjaMoves@feddit.nuEnglish · 8 days agoMistral small 3.1 releasedplus-squaremistral.aiexternal-linkmessage-square5fedilinkarrow-up128arrow-down11
arrow-up127arrow-down1external-linkMistral small 3.1 releasedplus-squaremistral.aiNinjaMoves@feddit.nuEnglish · 8 days agomessage-square5fedilink
Autonomous User@lemmy.worldEnglish · edit-25 days agoOllama not using AMD GPU on Arch Linux [Fixed] plus-squaremessage-squaremessage-square0fedilinkarrow-up18arrow-down12
arrow-up16arrow-down1message-squareOllama not using AMD GPU on Arch Linux [Fixed] plus-squareAutonomous User@lemmy.worldEnglish · edit-25 days agomessage-square0fedilink
Autonomous User@lemmy.worldEnglish · 7 days agoLlama 3.1 Community License is not a free software licenseplus-squarewww.fsf.orgexternal-linkmessage-square3fedilinkarrow-up117arrow-down11
arrow-up116arrow-down1external-linkLlama 3.1 Community License is not a free software licenseplus-squarewww.fsf.orgAutonomous User@lemmy.worldEnglish · 7 days agomessage-square3fedilink
Autonomous User@lemmy.worldEnglish · edit-25 days agoHow to install Ollama on Arch Linux (Windows and Open WebUI guides coming soon)plus-squaremessage-squaremessage-square0fedilinkarrow-up15arrow-down12
arrow-up13arrow-down1message-squareHow to install Ollama on Arch Linux (Windows and Open WebUI guides coming soon)plus-squareAutonomous User@lemmy.worldEnglish · edit-25 days agomessage-square0fedilink
morrowind@lemm.eeEnglish · 7 days agoEXAONE Deep ━ Setting a New Standard for Reasoning AI - LG AI Research Newsplus-squarewww.lgresearch.aiexternal-linkmessage-square4fedilinkarrow-up15arrow-down11
arrow-up14arrow-down1external-linkEXAONE Deep ━ Setting a New Standard for Reasoning AI - LG AI Research Newsplus-squarewww.lgresearch.aimorrowind@lemm.eeEnglish · 7 days agomessage-square4fedilink
SmokeyDope@lemmy.worldMEnglish · 8 days agoReturning back to where it started with llama 3 8B. DeepHermes is a great for 8gb VRAM cardsplus-squaremessage-squaremessage-square3fedilinkarrow-up120arrow-down10
arrow-up120arrow-down1message-squareReturning back to where it started with llama 3 8B. DeepHermes is a great for 8gb VRAM cardsplus-squareSmokeyDope@lemmy.worldMEnglish · 8 days agomessage-square3fedilink
morrowind@lemm.eeEnglish · 9 days agoSketch-of-Thought: Efficient LLM Reasoning with Adaptive Cognitive-Inspired Sketchingplus-squarearxiv.orgexternal-linkmessage-square2fedilinkarrow-up112arrow-down10
arrow-up112arrow-down1external-linkSketch-of-Thought: Efficient LLM Reasoning with Adaptive Cognitive-Inspired Sketchingplus-squarearxiv.orgmorrowind@lemm.eeEnglish · 9 days agomessage-square2fedilink
hendrik@palaver.p3x.deEnglish · edit-29 days agoRecommendations for a lightweight Python LLM framework for a webapp?plus-squaremessage-squaremessage-square5fedilinkarrow-up18arrow-down11
arrow-up17arrow-down1message-squareRecommendations for a lightweight Python LLM framework for a webapp?plus-squarehendrik@palaver.p3x.deEnglish · edit-29 days agomessage-square5fedilink
thickertoofan@lemm.eeEnglish · 10 days agoLoaded benchmark for 1-3-4-7b models?plus-squaremessage-squaremessage-square4fedilinkarrow-up17arrow-down10
arrow-up17arrow-down1message-squareLoaded benchmark for 1-3-4-7b models?plus-squarethickertoofan@lemm.eeEnglish · 10 days agomessage-square4fedilink
SmokeyDope@lemmy.worldMEnglish · edit-210 days agoCan Your LLM pass the Sun Theft Vibe Check (STVC) benchmark?plus-squaremessage-squaremessage-square2fedilinkarrow-up112arrow-down10
arrow-up112arrow-down1message-squareCan Your LLM pass the Sun Theft Vibe Check (STVC) benchmark?plus-squareSmokeyDope@lemmy.worldMEnglish · edit-210 days agomessage-square2fedilink
SmokeyDope@lemmy.worldMEnglish · edit-210 days agoDeepHermes Preview features swappable standard output to R1 distill CoT reasoning. Its kind of blowing my mind.plus-squarelemmy.worldimagemessage-square5fedilinkarrow-up18arrow-down10
arrow-up18arrow-down1imageDeepHermes Preview features swappable standard output to R1 distill CoT reasoning. Its kind of blowing my mind.plus-squarelemmy.worldSmokeyDope@lemmy.worldMEnglish · edit-210 days agomessage-square5fedilink
thickertoofan@lemm.eeEnglish · 12 days agoGemma 3 1B and 3B result on a "needle in a haystack" like test ran locallyplus-squaremessage-squaremessage-square1fedilinkarrow-up112arrow-down10
arrow-up112arrow-down1message-squareGemma 3 1B and 3B result on a "needle in a haystack" like test ran locallyplus-squarethickertoofan@lemm.eeEnglish · 12 days agomessage-square1fedilink
Lantier@jlai.luEnglish · edit-213 days agoNew release: Gemma 3 family of modelsplus-squarehuggingface.coexternal-linkmessage-square3fedilinkarrow-up120arrow-down10
arrow-up120arrow-down1external-linkNew release: Gemma 3 family of modelsplus-squarehuggingface.coLantier@jlai.luEnglish · edit-213 days agomessage-square3fedilink
Björn Tantau@swg-empire.deEnglish · edit-213 days agoIs there a German 7B Vision Model?plus-squaremessage-squaremessage-square3fedilinkarrow-up16arrow-down10
arrow-up16arrow-down1message-squareIs there a German 7B Vision Model?plus-squareBjörn Tantau@swg-empire.deEnglish · edit-213 days agomessage-square3fedilink
morrowind@lemm.eeEnglish · 13 days agoSorting-Free GPU Kernels for LLM Samplingplus-squareflashinfer.aiexternal-linkmessage-square0fedilinkarrow-up15arrow-down10
arrow-up15arrow-down1external-linkSorting-Free GPU Kernels for LLM Samplingplus-squareflashinfer.aimorrowind@lemm.eeEnglish · 13 days agomessage-square0fedilink
morrowind@lemm.eeEnglish · 14 days agoReka Flash, open source 21B model comparable to QWQ 32Bi.postimg.ccimagemessage-square2fedilinkarrow-up119arrow-down10
arrow-up119arrow-down1imageReka Flash, open source 21B model comparable to QWQ 32Bi.postimg.ccmorrowind@lemm.eeEnglish · 14 days agomessage-square2fedilink