What Benchmark Should I Use

Hosted on MSN

Qwen3.5-9B tops every AI benchmark right now, but that's not how you should pick a model

I should be clear: I'm not saying Qwen3.5-9B is bad. I'm saying that benchmarks, as they exist right now, are a terrible way to decide what model to use. And the hype around this particular set of ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results

Qwen3.5-9B tops every AI benchmark right now, but that's not how you should pick a model

Trending now