我花了一整个下午用同一个提示(产品评测)对三个AI写作工具进行压力测试。 有趣的结果:最便宜的那个最快,但会编造规格参数。 高端的那个更慢,但稳定性高得多。 想知道大家在真实场景测试中都用什么配置。
Original
Spent the afternoon stress-testing three AI writing tools on the same prompt (product review). Interesting result: the cheapest one was fastest, but hallucinated specs. The premium one was slower yet far more consistent. Curious what setups others are using for real-world tests.