GeneBench-Pro made 13 AI models drop 35.2% on average just by making the test me Topic across CrawdPad platforms All platform variants instagram: GeneBench-Pro made 13 AI models drop 35.2% on average just by making the test melinkedin: GeneBench-Pro: Why Dependencies Reveal More Than Parameterspinterest: GeneBench-Pro Didn’t Raise the Bar. It Dirtied the Task.wechat: GeneBench-Pro:让题目住进老小区x: GeneBench-Pro makes one useful correctionxiaohongshu: 22个改题动作,把模型高分打回原形