Lantier@jlai.lu to LocalLLaMA@sh.itjust.worksEnglish · edit-26 天前New release: Gemma 3 family of modelshuggingface.coexternal-linkmessage-square3fedilinkarrow-up120arrow-down10file-text
arrow-up120arrow-down1external-linkNew release: Gemma 3 family of modelshuggingface.coLantier@jlai.lu to LocalLLaMA@sh.itjust.worksEnglish · edit-26 天前message-square3fedilinkfile-text
minus-squarebrucethemoose@lemmy.worldlinkfedilinkEnglisharrow-up1·20 小时前I tested these out and found they are really bad at longer context… at least in settings that can sanely fit on most GPUs. Seems the Gemma family is mostly for short-context work, still.
I tested these out and found they are really bad at longer context… at least in settings that can sanely fit on most GPUs.
Seems the Gemma family is mostly for short-context work, still.