A note on the projects examined: this is not a criticism of any individual developer. I do not know the author personally. I have nothing against them. I’ve chosen the projects because they are public, representative, and relatively easy to benchmark. The failure patterns I found are produced by the tools, not the author. Evidence from METR’s randomized study and GitClear’s large-scale repository analysis support that these issues are not isolated to one developer when output is not heavily verified. That’s the point I’m trying to make!
Dr. Adam Chekroud, a psychiatry professor at Yale University and CEO of the mental health company Spring Health, went as far to call a chatbot “a huge sycophant” that is “constantly validating everything that people say back to it.”,这一点在新收录的资料中也有详细论述
。业内人士推荐新收录的资料作为进阶阅读
Premium & FT Weekend Print,推荐阅读新收录的资料获取更多信息
Екатерина Смирная (корреспондент отдела оперативной информации)
Жители Санкт-Петербурга устроили «крысогон»17:52