ByteDance just dopped Doubao-1.5-pro tht uses sparse MoE architecture, it matches GPT 4o benchmarks while being 50x cheaper to run, and it's 5x cheaper than DeepSeek

☆ Yσɠƚԋσʂ ☆@lemmygrad.ml · 3 months ago

ByteDance just dopped Doubao-1.5-pro tht uses sparse MoE architecture, it matches GPT 4o benchmarks while being 50x cheaper to run, and it's 5x cheaper than DeepSeek

peppersky [he/him, any]@hexbear.net · 3 months ago

They use synthetic AI generated benchmarks

It’s computer silicon blowing itself basically

Devorlon@lemmy.zip · edit-2 3 months ago

I’ve been researching this for uni at you’re not too far off. There’s a bunch of benchmarks out there and LLMs are ran against a set of questions and are given a score based on its response.

The questions can be multiple choice or open ended. If they’re open then it’ll be marked by another LLM.

There’s a couple initiatives to create benchmarks with known answers that are updated frequently, so they don’t need to marked by another LLM, but where the questions aren’t in the testing LLMs training dataset. This is because a lot of advancements in LLMs with these benchmarks is just the creators including the text questions and answers in the training data.

ByteDance just dopped Doubao-1.5-pro tht uses sparse MoE architecture, it matches GPT 4o benchmarks while being 50x cheaper to run, and it's 5x cheaper than DeepSeek

ByteDance just dopped Doubao-1.5-pro tht uses sparse MoE architecture, it matches GPT 4o benchmarks while being 50x cheaper to run, and it's 5x cheaper than DeepSeek

ByteDance Releases Doubao Large Model 1.5 Pro, Performance Surpassing GPT-4o and Claude3.5Sonnet