Reading a lot of "benchmark" articles comparing DeepSeek and ChatGPT.
My conclusion is that generative AI benchmarks are just vibes.
It's a lot like I was comparing two cars by opening the door, peering inside, and concluding "the car on the left feels more sustainable".
#
genai #
slop #
ai