Post
QwQ 32B was published today and I already tested it for AHA Leaderboard. The results are not that good! It did better than its predecessor (Qwen 2.5) in fasting and nutrition but worse in domains like nostr, bitcoin and faith. Overall worse than previous.
https://image.nostr.build/965699957d9bab7158ca4a5c6b5f70e8a9832d63fb803f34de3fe5b0e341b3a7.png
LLMs are getting detached from humans. Y'all have been warned, lol.
0
0
0
0