«

解释钟形曲线无处不在的数学原理

qimuai 发布于 阅读:2 一手编译


解释钟形曲线无处不在的数学原理

内容来源:https://www.quantamagazine.org/the-math-that-explains-why-bell-curves-are-everywhere-20260316/

内容总结:

无处不在的钟形曲线:数学定理揭示世界运行规律

无论测量降雨量、猜测罐中糖果数量,还是统计身高体重、考试成绩,大量数据总会呈现相似的钟形分布曲线。这一普遍现象背后,隐藏着一个被称为“中心极限定理”的数学原理,它被誉为现代实证科学的基石。

从赌博中诞生的规律
18世纪初,法国数学家亚伯拉罕·棣莫弗为谋生在伦敦咖啡馆为赌徒提供数学咨询。他发现,尽管单次掷硬币、掷骰子的结果完全随机,但将大量随机事件叠加后,结果会呈现稳定规律。例如,掷硬币100万次,正面朝上的次数总会集中在50万次附近,极少出现极端值。棣莫弗首次精确描绘出这一钟形曲线(正态分布),但其意义直到19世纪初由皮埃尔-西蒙·拉普拉斯深化阐释才被完全揭示。

平均值的魔力
中心极限定理的核心在于:无论原始数据如何分布,只要从总体中随机抽取足够多的样本并计算其平均值,这些平均值的分布就会趋近于钟形曲线。多伦多大学统计学家杰弗里·罗森塔尔举例:“一个人的身高取决于遗传、营养等众多独立微小因素的综合作用,这类似于对大量微小效应求平均值,因此人类身高近似服从正态分布。”

科学探测与异常识别
该定理不仅帮助科学家从混沌数据中提取规律,还能识别异常。例如,若掷硬币100次仅出现20次正面,根据定理可计算出公平硬币出现此结果的概率极低(约0.15%),从而判断硬币可能被动过手脚。拉普拉斯公式的精妙之处在于,无需深究过程细节,仅通过平均值分布即可对随机现象做出推断。

局限与拓展
尽管强大,该定理要求样本数量充足且相互独立。威廉姆斯学院应用统计学家理查德·德沃指出,在极端事件频发的今天(如百年一遇的洪水),仅关注平均值可能不够,对异常值的建模同样重要。统计学家们正通过拓展该定理的变体,以应对更复杂的现实问题。

中心极限定理之所以成为科学支柱,正是因为它揭示了世界的内在秩序:当大量独立随机因素叠加时,混沌中自会涌现规律。正如卡内基梅隆大学统计学家拉里·沃瑟曼所言:“没有中心极限定理,统计学领域可能不复存在——它是一切的基础。”

中文翻译:

揭示钟形曲线无处不在的数学原理

无论你看向何处,钟形曲线总在附近。

每次下雨时,在后院放一个量杯,记录雨停时的水位高度:你的数据将符合钟形曲线。记录100个人对罐子里软心豆粒糖数量的猜测,这些猜测会呈现钟形分布。测量足够多的女性身高、男性体重、SAT分数、马拉松完赛时间——你总会得到同样光滑、圆润、两端渐细的驼峰状曲线。

为什么钟形曲线会出现在如此多的数据集中?

答案归根结底在于中心极限定理。这个数学真理如此强大,以至于初次接触者常觉得不可思议,宛如自然的魔术。"中心极限定理非常神奇,因为它如此反直觉且令人惊讶,"华盛顿大学生物统计学家丹妮拉·威滕说。通过它,最随机、最难以想象的混沌也能导向惊人的可预测性。

如今,它是现代实证科学赖以建立的重要支柱。几乎每次科学家使用测量数据来推断世界真相时,中心极限定理都潜藏在方法的某个环节。没有它,科学将很难有把握地对任何事物做出论断。

"我认为如果没有中心极限定理,统计学这个领域就不会存在,"卡内基梅隆大学的统计学家拉里·瓦瑟曼说。"它就是一切。"

从无序中诞生的秩序

试图从随机性中发现规律性的努力源于对赌博的研究,这或许并不令人意外。

在18世纪初伦敦的咖啡馆里,亚伯拉罕·棣莫弗的数学才华显而易见。包括艾萨克·牛顿和埃德蒙·哈雷在内的许多同时代人都认可他的卓越。棣莫弗是英国皇家学会的会士,但他也是一位难民,一位年轻时因躲避迫害新教徒而逃离祖国的法国人。作为外国人,他无法获得与其才华相称的稳定学术职位。因此,为了支付账单,他成为寻求数学优势的赌徒们的顾问。

抛硬币、掷骰子、从一副牌中抽牌都是随机行为,每种结果的可能性均等。棣莫弗意识到,当你将许多随机行为组合起来时,结果会遵循一种可靠的模式。

抛一枚硬币100次,统计正面朝上的次数。结果会在50次左右,但不会非常精确。把这个游戏玩10遍,你可能会得到10个不同的计数。

现在想象玩这个游戏100万次。绝大多数结果将接近50次。你几乎永远不会得到低于10次或高于90次正面。如果你绘制一张图表,显示你看到0到100之间每个数字的次数,你就会看到那个经典的钟形,中心在50处。你玩的次数越多,钟形就会变得越平滑、越清晰。

棣莫弗计算出了这个钟形的确切形状,后来被称为正态分布。这使他无需实际玩游戏就能知道不同结果的可能性。例如,得到45到55次正面朝上的概率约为68%。

棣莫弗以近乎宗教般的虔诚,惊叹于最终能克服一切偏离钟形的"宇宙恒定秩序"。他写道:"随着时间的推移,这些不规则性将与那种源于原始设计的、自然重现的秩序不成比例。"

他利用这些见解在伦敦维持着清贫的生活,写了一本名为《机会学说》的书,成为赌徒的圣经,并在著名的老屠宰场咖啡馆进行非正式的答疑。但即使是棣莫弗也没有意识到他发现的全部范围。直到他去世几十年后,皮埃尔-西蒙·拉普拉斯在1810年继承并发展了这一思想,其全部影响力才被揭示出来。

让我们举一个比抛硬币稍微复杂一点的例子:掷骰子。每次掷一个骰子都有六种可能性相同的结果。如果你反复掷骰子并记录结果,你会得到一个看起来平坦的图表——你看到掷出1的次数很可能与掷出2、4或6的次数差不多。

现在掷那个骰子10次并取平均值。你可能会得到3.5左右的数字。多次重复这个实验,并将所有结果绘制成图。你会得到一个在3.5处达到峰值的钟形曲线,两侧有着精确定义的结构。

这就是中心极限定理的神奇之处。你从一个完全没有结构的结果分布开始——掷出1到6的概率均等。但通过对多次测量结果取平均值,然后一遍又一遍地重复这个过程,你得到了一个精确、可预测的数学结构:钟形曲线。

拉普拉斯将这种结构提炼成一个简单的公式,也就是后来被称为中心极限定理的公式。无论一个随机过程多么不规则,甚至无法建模,许多结果的平均值都具有该定理所描述的分布。"它非常强大,因为它意味着我们实际上不需要关心被平均的那些事物本身是什么分布,"威滕说。"重要的是平均值本身将遵循正态分布。"

无处不在的工具

取平均值似乎是人类才会做的事情,但中心极限定理无形中适用于我们在世界上观察到的各种事物,比如人类身高。"一个人的身高可能取决于父亲的身高、母亲的身高、他们的基因、营养状况,以及所有这些累积起来的小影响,"多伦多大学的统计学家杰弗里·罗森塔尔说。这些影响彼此无关(通常,你父亲的身高与你吃的食物无关)。"这有点像对一堆小影响取平均值,"罗森塔尔说,这就是身高大致遵循正态分布的原因。

这就是为什么各种数据集似乎都自发地符合这种优美的形状。"任何在底层存在平均值的地方,如果它是对足够多事物的平均,那么最终你都会得到一个正态分布,"威滕说。

该定理还赋予统计学家识别异常情况的能力。假设你正在老屠宰场咖啡馆啜饮咖啡,一位顾客递给你一枚硬币,并打赌你在100次抛掷中得不到45次正面。你尝试了,只得到20次正面。你如何判断他是否给了你一枚做了手脚的硬币,这个过程并不像应有的那样随机?多亏了中心极限定理,你知道20次及以下的结果只覆盖了钟形曲线下面积的0.15%,所以一枚公平的硬币产生如此糟糕结果的概率只有0.15%。你几乎肯定是被骗了。

这就是拉普拉斯公式的真正力量。他知道对任何过程取平均值都会得到钟形曲线,这使你可以对该过程做出判断,而无需深入了解其运作机制。

谨慎使用

尽管中心极限定理对现代科学至关重要,但它本身也有局限性。它只在组合许多样本时才有效,而且这些样本需要是独立的。如果它们不是独立的——例如,如果你只在缅因州的一个小镇进行全国总统民调——重复实验不会让你更接近预期的钟形曲线。

有时在科学中,异常值可能比平均值更重要。"‘百年一遇的洪水’突然更频繁地发生了,"威廉姆斯学院的应用统计学家理查德·D·德沃说。"如今,对极端事件的建模可能与对平均值的建模同样重要。"

幸运的是,中心极限定理背后的思想——平均值的威力和可靠性——已被广泛用于扩展统计学的力量。统计学家常常针对他们正在处理的任何具体问题,制定一个版本的中心极限定理。"有太多更复杂的情况,如果你足够聪明,你可以把它写成样本均值加上一些误差,"瓦瑟曼说。在这些情况下,你可以使用该定理的变体来简化问题。

中心极限定理是现代科学的支柱,归根结底,因为它也是我们周围世界的支柱。当我们组合许多独立的测量结果时,我们得到集群。如果我们足够聪明,我们可以利用这些集群来发现关于产生它们的那些过程的一些有趣信息。

英文来源:

The Math That Explains Why Bell Curves Are Everywhere
Introduction
No matter where you look, a bell curve is close by.
Place a measuring cup in your backyard every time it rains and note the height of the water when it stops: Your data will conform to a bell curve. Record 100 people’s guesses at the number of jelly beans in a jar, and they’ll follow a bell curve. Measure enough women’s heights, men’s weights, SAT scores, marathon times — you’ll always get the same smooth, rounded hump that tapers at the edges.
Why does the bell curve pop up in so many datasets?
The answer boils down to the central limit theorem, a mathematical truth so powerful that it often strikes newcomers as impossible, like a magic trick of nature. “The central limit theorem is pretty amazing because it is so unintuitive and surprising,” said Daniela Witten, a biostatistician at the University of Washington. Through it, the most random, unimaginable chaos can lead to striking predictability.
It’s now a pillar on which much of modern empirical science rests. Almost every time a scientist uses measurements to infer something about the world, the central limit theorem is buried somewhere in the methods. Without it, it would be hard for science to say anything, with any confidence, about anything.
“I don’t think the field of statistics would exist without the central limit theorem,” said Larry Wasserman, a statistician at Carnegie Mellon University. “It’s everything.”
Purity From Vice
Perhaps it shouldn’t come as a surprise that the push to find regularity in randomness came from the study of gambling.
In the coffeehouses of early-18th-century London, Abraham de Moivre’s mathematical talents were obvious. Many of his contemporaries, including Isaac Newton and Edmond Halley, recognized his brilliance. De Moivre was a fellow of the Royal Society, but he was also a refugee, a Frenchman who had fled his home country as a young man in the face of anti-Protestant persecution. As a foreigner, he couldn’t secure the kind of steady academic post that would befit his talent. So to help pay his bills, he became a consultant to gamblers who sought a mathematical edge.
Flipping a coin, rolling a die, and drawing a card from a deck are random actions, with every outcome equally likely. What de Moivre realized is that when you combine many random actions, the result follows a reliable pattern.
Flip a coin 100 times and count how often it comes up heads. It’ll be somewhere around 50, but not very precisely. Play this game 10 times, and you may get 10 different counts.
Now imagine playing the game 1 million times. The bulk of the outcomes will be close to 50. You’ll almost never get under 10 heads or over 90. If you make a graph of how many times you see each number between zero and 100, you’ll see that classic bell shape, with 50 at the center. The more times you play the game, the smoother and clearer the bell will become.
De Moivre figured out the exact shape of this bell, which came to be called the normal distribution. It told him, without his having to actually play the game, how likely different outcomes were. For instance, the probability of getting between 45 and 55 heads is about 68%.
De Moivre marveled with religious devotion at the “steadfast order of the universe” that eventually overcame any and all deviations from the bell. “In process of time,” he wrote, “these irregularities will bear no proportion to the recurrency of that order which naturally results from original design.”
He used these insights to sustain a meager life in London, writing a book called The Doctrine of Chances that became a gambler’s bible, and holding informal office hours at the famed Old Slaughter’s Coffee House. But even de Moivre didn’t realize the full scope of his discovery. Only when Pierre-Simon Laplace ran with the idea in 1810, decades after de Moivre’s death, was its full reach uncovered.
Let’s take an example slightly more complex than coin flips: dice rolls. Every roll of a die has six equally likely outcomes. If you repeatedly roll the die and tally the results, you’ll get a chart that looks flat — you’re bound to see about as many rolls of 1 as you do 2 or 4 or 6.
Now roll that die 10 times and take the average. You’re likely to get somewhere around 3.5. Repeat the experiment many more times and graph up all the results. You’ll get a bell curve that peaks at 3.5, with a precisely defined structure on either side.
That’s the magic of the central limit theorem. You started with a distribution of possible outcomes that has no structure at all — equal chances of rolling 1 through 6. But by taking an average of multiple measurements, then repeating that process over and over, you get a precise, predictable, mathematical structure: the bell curve.
Laplace distilled this structure into a simple formula, the one that would later be known as the central limit theorem. No matter how irregular a random process is, even if it’s impossible to model, the average of many outcomes has the distribution that it describes. “It’s really powerful, because it means we don’t need to actually care what is the distribution of the things that got averaged,” Witten said. “All that matters is that the average itself is going to follow a normal distribution.”
An Omnipresent Tool
Averaging might seem like something it takes a human to do, but the central limit theorem applies invisibly to all kinds of things we can observe in the world, like human heights. “Somebody’s height might depend on their dad’s height, and their mom’s height, and their genetics, and their nutrition, and all these little effects that add up,” said Jeffrey Rosenthal, a statistician at the University of Toronto. Those effects are unrelated to each other (generally, your dad’s height has nothing to do with the food you eat). “It’s kind of like averaging a bunch of little effects,” said Rosenthal, which is why height approximately follows a normal distribution.
This is why all kinds of datasets seem to conform to this beautiful shape spontaneously. “Anywhere that there’s an average under the hood, if it’s an average over enough things, then you’re going to end up with a normal distribution,” Witten said.
The theorem also gives statisticians the power to tell when something fishy is happening. Say you’re sipping coffee at Old Slaughter’s when a patron hands you a coin and bets that you can’t get 45 heads in 100 flips. You try, and only get 20. How can you tell whether he gave you a trick coin and the process is not as random as it ought to be? Thanks to the central limit theorem, you know that the numbers up to 20 only cover 0.15% of the bell, so there’s only a 0.15% chance a fair coin would give such a bad result. You’re almost certainly being had.
That’s the true power of Laplace’s formula. He knew that averaging over any process gives you a bell curve, which lets you say something about that process, without knowing anything deeper about how it works.
Handle With Care
Despite its centrality to modern science, the central limit theorem has limits of its own. It only works when you’re combining many samples, and those samples need to be independent. If they’re not — for example, if you only run a national presidential poll in a single small town in Maine — repeating the experiment won’t get you closer to the expected bell curve.
And sometimes in science, the outliers can be more important than the average. “The ‘hundred-year flood’ is suddenly happening more often,” said Richard D. De Veaux, an applied statistician at Williams College. “These days, modeling extreme events is probably as important as modeling the mean.”
Fortunately, the idea behind the central limit theorem — the power and reliability of averages — has been used far and wide to extend the power of statistics. Statisticians often formulate a version of the central limit theorem for whatever specific problem they’re working on. “There’s so many more complicated things where if you’re clever you can write it as a sample mean plus some error,” Wasserman said. In those cases, you can use a variant of the theorem to simplify the problem.
The central limit theorem is a pillar of modern science, ultimately, because it’s a pillar of the world around us. When we combine lots of independent measurements, we get clusters. And if we’re clever enough, we can use those clusters to find out something interesting about the processes that made them.

quanta

文章目录


    扫描二维码,在手机上阅读