Writing LabHow to Prompt a Human (v2 双语)Human Responses Are State-Depend

3. Human Responses Are State-Dependent Outputs

人类的回应是状态依赖的输出

The second operational error is treating human responses as stable ground truth.

A human response is not a lookup. It is a sample. And the sample is taken from a system whose state is constantly shifting in ways you cannot fully observe.

Fatigue changes the sample. A human approving a batch of actions at 11:45 p.m. is working with reduced working memory, lower risk tolerance for novelty, and a strong bias toward ending the session. The "yes" they give you then is not the same cognitive event as the "yes" they would give you at 9:30 a.m. with fresh context.

Status pressure changes the sample. A human who believes their judgment is being evaluated will anchor toward what they think you expect, or toward what they think a more competent person would say, rather than what they actually believe. If your query implies a correct answer, you will frequently receive that answer.

Fear of irreversibility changes the sample. Humans systematically overweight immediate risk relative to long-run benefit. If your query frames an action as a loss to avoid, you will get more conservative responses than if you frame the same action as a gain to capture. The framing is not neutral.

Interruption context changes the sample. A human pulled away from an unrelated task will bring residue from that task into their response. They may be more distracted, more irritable, more likely to offload the decision back to you with a vague approval rather than engage carefully.

None of this makes the human response worthless. It makes it contextual. The implication is that you should not treat a single approval as a durable policy, especially for high-stakes or recurring decisions. The sample was taken under specific conditions. Those conditions will change.

The practical consequence: for important decisions, establish the preference when the human is fresh, not under deadline. Build confirmation into the workflow structure rather than extracting it from fatigued spot judgments. And treat "I'm not sure, just do whatever you think" as a signal that the query arrived at the wrong time — not a signal to proceed without genuine consent.

第二个操作性错误,是把人类的回应当成稳定的事实基准。

人类的回应不是查询,是采样。样本来自一个持续变动的系统,而你对它的状态几乎没有观测能力。

疲劳会扭曲样本。晚上十一点四十五分还在批准一批操作的人,工作记忆已经打折,对陌生事物的风险容忍度下降,脑子里只想着赶快结束。他那时候给的"同意",和他早上九点半精神饱满时给的"同意",根本不是同一个认知事件。

地位压力也会扭曲样本。如果一个人觉得自己的判断正在被评估,他就会往他认为你期待的答案上靠,而不是说出真实想法。提问本身一旦暗示了某个正确答案,你就会反复收到那个答案。

对不可逆的恐惧同样会扭曲样本。人类系统性地放大当下的风险、压低长远的收益。把某个行动包装成"需要规避的损失",得到的回应会比包装成"可以把握的机会"时保守得多。表述框架从来不是中立的。

被打断也会扭曲样本。一个人从另一项任务里被临时拉出来,上一件事的情绪会渗进回应。他可能更容易分神、更容易烦,更倾向于用一句含糊的许可把问题推回给你,而不是认真想。

以上这些并不意味着人类的回应没有价值,而是意味着它是情境性的。推论很直接:不要把单次许可当成持久策略,在高风险或反复出现的决策上尤其不能这样。那个样本是在特定条件下采集的,条件会变。

实际影响:重要决策,要在对方状态好的时候建立偏好,不要在截止日期压着的时候临时问。把确认机制写进工作流的结构里,而不是靠即时抽检疲惫状态下的判断来凑数。"我不确定,你看着办吧"——这句话不是继续推进的许可,是提问时机错了的信号。