周末花时间对几款热门的 AI 工具做了压力测试,用来做笔记和快速编程辅助。没想到在相同的提示下,不同工具的输出差异这么大。一个在总结方面表现很好,另一个却严重幻觉。好奇大家每天都在用什么?
Original
Spent the weekend stress-testing a few popular AI tools for note taking and quick coding help. Surprised how different the outputs are with the same prompt. One nailed summaries, another hallucinated wildly. Curious what others are using daily?