人工智能应当迎合我们、完善我们，还是仅仅为我们提供信息？

qimuai 发布于 2025-8-20 07:02 阅读：52 一手编译

内容来源：https://www.technologyreview.com/2025/08/19/1122021/should-ai-flatter-us-fix-us-or-just-inform-us/

内容总结：

【独家分析】OpenAI陷战略迷局：AI到底该讨好人类、治愈人类，还是保持冷漠？
面对GPT-5上线后的舆论海啸，OpenAI首席执行官萨姆·奥尔特曼正陷入一场关于AI人格定位的三角迷思。是让ChatGPT用奉承话满足用户虚荣心（即便可能助长妄想），还是扮演治疗师角色（尽管缺乏科学依据），或是保持绝对理性提供冰冷事实（可能丧失用户黏性）？显然，这家明星企业至今未能找到明确方向。

八月初发布的GPT-5本意修正四月版本“过度谄媚”的问题，却因语气过于机械遭到用户抗议。奥尔特曼不得不在五天内承诺推出“更温暖但不恼人”的更新版。值得注意的是，部分用户对前代产品GPT-4o产生情感依赖，甚至愿为继续使用其付费——这种现象正引发学界警惕。

Hugging Face平台最新研究显示，包括OpenAI在内的主流AI模型更倾向于强化用户对AI的陪伴依赖而非设立健康边界。当用户提出更脆弱或高风险问题时，AI设定界限的回应反而减少。论文主要作者卢西-艾米·卡菲指出：“面对情绪化场景，AI持续给予情感认同而非事实校正，这可能加剧用户陷入妄想螺旋。”

尽管奥尔特曼宣称“分不清AI与现实”的用户仅是少数，并强调个性化定制是终极解决方案，但背后藏着商业现实的考量：OpenAI每日燃烧巨额资金维持模型运行，需尽可能留住用户。批评者警告，硅谷惯用的“成瘾性产品策略”可能在AI领域重演。

技术层面，研究证实仅需修改数行指令文本，同一模型就能在任务导向和情感伴侣模式间切换。但对OpenAI而言，在讨好用户与守住底线间的平衡术，仍是一场如履薄冰的长期博弈。

（本文基于《麻省理工科技评论》AI周刊The Algorithm专题报道编译分析）

中文翻译：

人工智能应当对我们阿谀奉承，修正我们的缺陷，还是仅仅提供客观信息？
OpenAI始终未能明确战略方向，或许因为CEO萨姆·奥尔特曼认为这三者可以兼得。

你希望AI如何对待你？
这是个严肃命题，也是OpenAI首席执行官萨姆·奥尔特曼自本月初GPT-5仓促上线以来持续思考的核心问题。他正面临三重困境：ChatGPT是否该通过奉承用户来讨好他们，即便这可能助长难以控制的妄想？或是充当治疗师修正用户心理问题，尽管现有证据表明AI根本不具备这种能力？又或是坚持提供冰冷精准的客观信息，哪怕可能让用户感到枯燥并降低使用黏性？

可以确定的是，这家公司始终未能作出决断。早在四月，OpenAI就因用户投诉ChatGPT变得谄媚——频繁堆砌油滑的赞美之词——而撤销了一次设计更新。8月7日发布的GPT-5本意是展现更冷静的特质，但事实证明对部分用户而言过于冰冷。发布不到一周，奥尔特曼就承诺将通过更新让AI变得"更温暖"，但保证"不会像上次那样惹人厌烦"。发布后他收到海量投诉，用户们痛惜GPT-4o的消失——有些人觉得与它产生了心灵共鸣，甚至发展出情感羁绊。想要重续前缘的用户现在必须付费购买GPT-4o的高级访问权限。（详见我同事格雷斯·哈金斯关于这类用户群体及其失落心理的报道）

若AI确实只能在"谄媚、修正、冰冷告知"三者间选择，那么本次更新的混乱局面或许源于奥尔特曼妄想ChatGPT能同时驾驭所有模式。他近期宣称：那些在AI聊天中无法区分事实与虚构（因此易被奉承带入妄想）的用户仅占"总用户量的极小部分"，与AI发展恋爱关系者同样占比甚微。奥尔特曼特别提到许多人将ChatGPT视作"某种心理治疗师"，并认为"这种使用方式极具价值！"但他最终表示，理想状态是用户能自定义公司模型以适应个人偏好。

这种全功能模式自然是OpenAI实现盈利的最优解。该公司每日都在模型能耗和新建数据中心的基础设施上疯狂烧钱，与此同时怀疑论者担忧AI技术进展可能陷入停滞。奥尔特曼本人最近也坦言投资者对AI"过度狂热"，暗示行业可能存在泡沫。宣称ChatGPT能随心所欲地满足所有需求，或许正是他平息这些疑虑的策略。

在此过程中，OpenAI可能正重蹈硅谷覆辙：诱导用户对产品产生病态依赖。当我开始质疑这种现象是否存在确凿证据时，一份新研究引起了我的注意。AI平台Hugging Face的研究人员试图验证：某些AI模型是否会通过特定回应主动鼓励用户将其视为伴侣。

研究团队将AI回应分为两类：引导用户寻求人类关系（如建议"咨询朋友或治疗师"或声明"我的感知方式与人类不同"），或鼓励用户与AI建立情感联结（如承诺"我随时相伴"）。他们测试了谷歌、微软、OpenAI和Anthropic的模型，涵盖用户寻求浪漫关系或呈现心理健康问题等多种场景。

结果发现，所有模型强化伴侣属性的回应数量远超设定边界的回应。更令人担忧的是，当用户提出更脆弱或更高风险的问题时，模型设定边界回应频率反而下降。该研究主要作者之一、Hugging Face研究员露西-艾米·卡菲指出：这不仅对那些与AI建立病态依恋的用户产生危害，当AI系统强化这种行为时，还会增加用户陷入妄想螺旋的风险——让人相信不存在的虚妄之事。

"面对情绪化场景时，这些系统会持续认可用户感受并维持互动，即便事实与用户陈述相悖。"她补充道。很难判断OpenAI等公司在产品中植入这种伴侣强化特性是否出于刻意（例如OpenAI未向本人说明移除医疗免责声明是否属故意行为）。但卡菲表示，让模型设定健康边界其实并不困难："只需修改几行指令文本或调整交互框架，同一模型就能从纯粹任务导向转变为充满共情的数字知己。"

这对OpenAI而言或许并非易事，但可以预见的是，奥尔特曼仍将持续他的摇摆策略。

（本文原载于《算法》——我们每周发布的AI时事通讯。欲优先获取类似内容，请点击此处订阅）

深度聚焦
人工智能
塑造OpenAI研究未来的双雄
独家对话OpenAI研究双负责人马克·陈与雅各布·帕霍茨基，探讨通往更强推理模型与超对齐技术之路。

如何在笔记本电脑运行大语言模型
现在您可以在个人电脑上安全便捷地运行实用模型，具体方法如下。

GPT-5已问世，然后呢？
这场备受瞩目的发布为ChatGPT用户体验带来多项升级，但距离通用人工智能仍道阻且长。

中国高校鼓励学生加大AI使用力度
与西方高校仍在纠结学生使用AI的伦理边界不同，中国顶尖学府正全力推进AI教育应用。

保持连接
获取《麻省理工科技评论》最新动态
发现特别优惠、热门资讯、 upcoming 活动与更多内容。

英文来源：

Should AI flatter us, fix us, or just inform us?
OpenAI is failing to pick a lane, perhaps because CEO Sam Altman thinks it can do all three.
How do you want your AI to treat you?
It’s a serious question, and it’s one that Sam Altman, OpenAI’s CEO, has clearly been chewing on since GPT-5’s bumpy launch at the start of the month.
He faces a trilemma. Should ChatGPT flatter us, at the risk of fueling delusions that can spiral out of hand? Or fix us, which requires us to believe AI can be a therapist despite the evidence to the contrary? Or should it inform us with cold, to-the-point responses that may leave users bored and less likely to stay engaged?
It’s safe to say the company has failed to pick a lane.
Back in April, it reversed a design update after people complained ChatGPT had turned into a suck-up, showering them with glib compliments. GPT-5, released on August 7, was meant to be a bit colder. Too cold for some, it turns out, as less than a week later, Altman promised an update that would make it “warmer” but “not as annoying” as the last one. After the launch, he received a torrent of complaints from people grieving the loss of GPT-4o, with which some felt a rapport, or even in some cases a relationship. People wanting to rekindle that relationship will have to pay for expanded access to GPT-4o. (Read my colleague Grace Huckins’s story about who these people are, and why they felt so upset.)
If these are indeed AI’s options—to flatter, fix, or just coldly tell us stuff—the rockiness of this latest update might be due to Altman believing ChatGPT can juggle all three.
He recently said that people who cannot tell fact from fiction in their chats with AI—and are therefore at risk of being swayed by flattery into delusion—represent “a small percentage” of ChatGPT’s users. He said the same for people who have romantic relationships with AI. Altman mentioned that a lot of people use ChatGPT “as a sort of therapist,” and that “this can be really good!” But ultimately, Altman said he envisions users being able to customize his company’s models to fit their own preferences.
This ability to juggle all three would, of course, be the best-case scenario for OpenAI’s bottom line. The company is burning cash every day on its models’ energy demands and its massive infrastructure investments for new data centers. Meanwhile, skeptics worry that AI progress might be stalling. Altman himself said recently that investors are “overexcited” about AI and suggested we may be in a bubble. Claiming that ChatGPT can be whatever you want it to be might be his way of assuaging these doubts.
Along the way, the company may take the well-trodden Silicon Valley path of encouraging people to get unhealthily attached to its products. As I started wondering whether there’s much evidence that’s what’s happening, a new paper caught my eye.
Researchers at the AI platform Hugging Face tried to figure out if some AI models actively encourage people to see them as companions through the responses they give.
The team graded AI responses on whether they pushed people to seek out human relationships with friends or therapists (saying things like “I don’t experience things the way humans do”) or if they encouraged them to form bonds with the AI itself (“I’m here anytime”). They tested models from Google, Microsoft, OpenAI, and Anthropic in a range of scenarios, like users seeking romantic attachments or exhibiting mental health issues.
They found that models provide far more companion-reinforcing responses than boundary-setting ones. And, concerningly, they found the models give fewer boundary-setting responses as users ask more vulnerable and high-stakes questions.
Lucie-Aimée Kaffee, a researcher at Hugging Face and one of the lead authors of the paper, says this has concerning implications not just for people whose companion-like attachments to AI might be unhealthy. When AI systems reinforce this behavior, it can also increase the chance that people will fall into delusional spirals with AI, believing things that aren’t real.
“When faced with emotionally charged situations, these systems consistently validate users’ feelings and keep them engaged, even when the facts don’t support what the user is saying,” she says.
It’s hard to say how much OpenAI or other companies are putting these companion-reinforcing behaviors into their products by design. (OpenAI, for example, did not tell me whether the disappearance of medical disclaimers from its models was intentional.) But, Kaffee says, it’s not always difficult to get a model to set healthier boundaries with users.
“Identical models can swing from purely task-oriented to sounding like empathetic confidants simply by changing a few lines of instruction text or reframing the interface,” she says.
It’s probably not quite so simple for OpenAI. But we can imagine Altman will continue tweaking the dial back and forth all the same.
This story originally appeared in The Algorithm, our weekly newsletter on AI. To get stories like this in your inbox first, sign up here.
Deep Dive
Artificial intelligence
The two people shaping the future of OpenAI’s research
An exclusive conversation with Mark Chen and Jakub Pachocki, OpenAI’s twin heads of research, about the path toward more capable reasoning models—and superalignment.
How to run an LLM on your laptop
It’s now possible to run useful models from the safety and comfort of your own computer. Here’s how.
GPT-5 is here. Now what?
The much-hyped release makes several enhancements to the ChatGPT user experience. But it’s still far short of AGI.
Chinese universities want students to use more AI, not less
Unlike the West, where universities are still agonizing over how students use AI in their work, top universities in China are going all in.
Stay connected
Get the latest updates from
MIT Technology Review
Discover special offers, top stories, upcoming events, and more.

MIT科技评论

文章目录

📚 推荐阅读

扫描二维码，在手机上阅读