<?xml version="1.0"?>
<feed xmlns="http://www.w3.org/2005/Atom" xml:lang="en">
		<id>http://wiki.christophchamp.com/index.php?action=history&amp;feed=atom&amp;title=Maximum_likelihood</id>
		<title>Maximum likelihood - Revision history</title>
		<link rel="self" type="application/atom+xml" href="http://wiki.christophchamp.com/index.php?action=history&amp;feed=atom&amp;title=Maximum_likelihood"/>
		<link rel="alternate" type="text/html" href="http://wiki.christophchamp.com/index.php?title=Maximum_likelihood&amp;action=history"/>
		<updated>2026-04-30T00:51:45Z</updated>
		<subtitle>Revision history for this page on the wiki</subtitle>
		<generator>MediaWiki 1.26.2</generator>

	<entry>
		<id>http://wiki.christophchamp.com/index.php?title=Maximum_likelihood&amp;diff=2797&amp;oldid=prev</id>
		<title>Christoph at 04:49, 13 September 2006</title>
		<link rel="alternate" type="text/html" href="http://wiki.christophchamp.com/index.php?title=Maximum_likelihood&amp;diff=2797&amp;oldid=prev"/>
				<updated>2006-09-13T04:49:47Z</updated>
		
		<summary type="html">&lt;p&gt;&lt;/p&gt;
&lt;table class='diff diff-contentalign-left'&gt;
				&lt;col class='diff-marker' /&gt;
				&lt;col class='diff-content' /&gt;
				&lt;col class='diff-marker' /&gt;
				&lt;col class='diff-content' /&gt;
				&lt;tr style='vertical-align: top;' lang='en'&gt;
				&lt;td colspan='2' style=&quot;background-color: white; color:black; text-align: center;&quot;&gt;← Older revision&lt;/td&gt;
				&lt;td colspan='2' style=&quot;background-color: white; color:black; text-align: center;&quot;&gt;Revision as of 04:49, 13 September 2006&lt;/td&gt;
				&lt;/tr&gt;&lt;tr&gt;&lt;td colspan=&quot;2&quot; class=&quot;diff-lineno&quot; id=&quot;mw-diff-left-l53&quot; &gt;Line 53:&lt;/td&gt;
&lt;td colspan=&quot;2&quot; class=&quot;diff-lineno&quot;&gt;Line 53:&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class='diff-marker'&gt;&amp;#160;&lt;/td&gt;&lt;td style=&quot;background-color: #f9f9f9; color: #333333; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #e6e6e6; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;* [http://en.wikipedia.org/wiki/Maximum_likelihood Wikipedia article on '''Maximum likelihood''']&lt;/div&gt;&lt;/td&gt;&lt;td class='diff-marker'&gt;&amp;#160;&lt;/td&gt;&lt;td style=&quot;background-color: #f9f9f9; color: #333333; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #e6e6e6; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;* [http://en.wikipedia.org/wiki/Maximum_likelihood Wikipedia article on '''Maximum likelihood''']&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class='diff-marker'&gt;&amp;#160;&lt;/td&gt;&lt;td style=&quot;background-color: #f9f9f9; color: #333333; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #e6e6e6; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;/td&gt;&lt;td class='diff-marker'&gt;&amp;#160;&lt;/td&gt;&lt;td style=&quot;background-color: #f9f9f9; color: #333333; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #e6e6e6; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class='diff-marker'&gt;−&lt;/td&gt;&lt;td style=&quot;color:black; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #ffe49c; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;[[Category:&lt;del class=&quot;diffchange diffchange-inline&quot;&gt;Academic Research&lt;/del&gt;]]&lt;/div&gt;&lt;/td&gt;&lt;td class='diff-marker'&gt;+&lt;/td&gt;&lt;td style=&quot;color:black; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;&lt;ins class=&quot;diffchange diffchange-inline&quot;&gt;{{Phylogenetics}}&lt;/ins&gt;&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td colspan=&quot;2&quot;&gt;&amp;#160;&lt;/td&gt;&lt;td class='diff-marker'&gt;+&lt;/td&gt;&lt;td style=&quot;color:black; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;[[Category:&lt;ins class=&quot;diffchange diffchange-inline&quot;&gt;Phylogenetics&lt;/ins&gt;]]&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class='diff-marker'&gt;&amp;#160;&lt;/td&gt;&lt;td style=&quot;background-color: #f9f9f9; color: #333333; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #e6e6e6; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;[[Category:Statistics]]&lt;/div&gt;&lt;/td&gt;&lt;td class='diff-marker'&gt;&amp;#160;&lt;/td&gt;&lt;td style=&quot;background-color: #f9f9f9; color: #333333; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #e6e6e6; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;[[Category:Statistics]]&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;/table&gt;</summary>
		<author><name>Christoph</name></author>	</entry>

	<entry>
		<id>http://wiki.christophchamp.com/index.php?title=Maximum_likelihood&amp;diff=1533&amp;oldid=prev</id>
		<title>Christoph: Started article</title>
		<link rel="alternate" type="text/html" href="http://wiki.christophchamp.com/index.php?title=Maximum_likelihood&amp;diff=1533&amp;oldid=prev"/>
				<updated>2005-12-29T23:01:42Z</updated>
		
		<summary type="html">&lt;p&gt;Started article&lt;/p&gt;
&lt;p&gt;&lt;b&gt;New page&lt;/b&gt;&lt;/p&gt;&lt;div&gt;'''Maximum likelihood estimation (MLE)''' is a popular [[statistics|statistical]] method used to make inferences about parameters of the underlying [[probability distribution]] of a given [[data set]].&lt;br /&gt;
&lt;br /&gt;
The method was pioneered by [[geneticist]] and [[statistician]] [[Ronald Fisher|Sir Ronald A. Fisher]] between 1912 and 1922 (see external resources below for more information on the history of MLE).&lt;br /&gt;
&lt;br /&gt;
== Prerequisites ==&lt;br /&gt;
&lt;br /&gt;
The following discussion assumes that the reader is familiar with basic notions in [[probability theory]] such as [[probability distribution]]s, [[probability density function]]s, [[random variable]]s and [[expected value|expectation]]. It also assumes s/he is familiar with standard basic techniques of maximising [[continuous function|continuous]] [[real number|real-valued]] [[function (mathematics)|function]]s, such as using [[differentiation]] to find a function's [[maxima and minima|maxima]].&lt;br /&gt;
&lt;br /&gt;
==The philosophy of MLE==&lt;br /&gt;
&lt;br /&gt;
Given a probability distribution &amp;lt;math&amp;gt;D&amp;lt;/math&amp;gt;, associated with either a known [[probability density function]] (continuous distribution) or a known [[probability mass function]] (discrete distribution), denoted as &amp;lt;math&amp;gt;f_D&amp;lt;/math&amp;gt;, and distributional parameter &amp;lt;math&amp;gt;\theta&amp;lt;/math&amp;gt;, we may draw a sample &amp;lt;math&amp;gt;X_1, X_2, ..., X_n&amp;lt;/math&amp;gt; of &amp;lt;math&amp;gt;n&amp;lt;/math&amp;gt; values from this distribution and then using &amp;lt;math&amp;gt;f_D&amp;lt;/math&amp;gt; we may compute the probability associated with our observed data:&lt;br /&gt;
&lt;br /&gt;
:&amp;lt;math&amp;gt;\mathbb{P}(x_1,x_2,\dots,x_n) = f_D(x_1,\dots,x_n \mid \theta)&amp;lt;/math&amp;gt;&lt;br /&gt;
&lt;br /&gt;
However, it may be that we don't know the value of the parameter &amp;lt;math&amp;gt;\theta&amp;lt;/math&amp;gt; despite knowing (or believing) that our data comes from the distribution &amp;lt;math&amp;gt;D&amp;lt;/math&amp;gt;. How should we estimate &amp;lt;math&amp;gt;\theta&amp;lt;/math&amp;gt;? It is a sensible idea to draw a sample of &amp;lt;math&amp;gt;n&amp;lt;/math&amp;gt; values &amp;lt;math&amp;gt;X_1, X_2, ... X_n&amp;lt;/math&amp;gt; and use this data to help us make an estimate.&lt;br /&gt;
&lt;br /&gt;
Once we have our sample &amp;lt;math&amp;gt;X_1, X_2, ..., X_n&amp;lt;/math&amp;gt;, we may seek an estimate of the value of &amp;lt;math&amp;gt;\theta&amp;lt;/math&amp;gt; from that sample. MLE seeks the most likely value of the parameter &amp;lt;math&amp;gt;\theta&amp;lt;/math&amp;gt; (i.e., we maximise the ''likelihood'' of the observed data set over all possible values of &amp;lt;math&amp;gt;\theta&amp;lt;/math&amp;gt;). This is in contrast to seeking other estimators, such as an [[unbiased estimator]] of &amp;lt;math&amp;gt;\theta&amp;lt;/math&amp;gt;, which may not necessarily yield the most likely value of &amp;lt;math&amp;gt;\theta&amp;lt;/math&amp;gt; but which will yield a value that (on average) will neither tend to over-estimate nor under-estimate the true value of &amp;lt;math&amp;gt;\theta&amp;lt;/math&amp;gt;.&lt;br /&gt;
&lt;br /&gt;
To implement the MLE method mathematically, we define the &amp;lt;i&amp;gt;likelihood&amp;lt;/i&amp;gt;:&lt;br /&gt;
&lt;br /&gt;
:&amp;lt;math&amp;gt;\mbox{lik}(\theta) = f_D(x_1,\dots,x_n \mid \theta)&amp;lt;/math&amp;gt;&lt;br /&gt;
&lt;br /&gt;
and maximise this [[function (mathematics)|function]] over all possible values of the parameter &amp;lt;math&amp;gt;\theta&amp;lt;/math&amp;gt;. The value &amp;lt;math&amp;gt;\hat{\theta}&amp;lt;/math&amp;gt; which maximises the likelihood is known as the '''maximum likelihood estimator''' (MLE) for &amp;lt;math&amp;gt;\theta&amp;lt;/math&amp;gt;.&lt;br /&gt;
&lt;br /&gt;
=== Notes ===&lt;br /&gt;
*The likelihood is a function of &amp;lt;math&amp;gt;\theta&amp;lt;/math&amp;gt; for fixed values of &amp;lt;math&amp;gt;x_1,x_2,\ldots,x_n&amp;lt;/math&amp;gt;.&lt;br /&gt;
*The maximum likelihood estimator may not be unique, or indeed may not even exist.&lt;br /&gt;
&lt;br /&gt;
== Properties ==&lt;br /&gt;
&lt;br /&gt;
=== Functional invariance ===&lt;br /&gt;
If &amp;lt;math&amp;gt;\widehat{\theta}&amp;lt;/math&amp;gt; is the maximum likelihood estimator (MLE) for &amp;lt;math&amp;gt;\theta&amp;lt;/math&amp;gt;, then the MLE for &amp;lt;math&amp;gt;\alpha = g(\theta)&amp;lt;/math&amp;gt; is &amp;lt;math&amp;gt;\widehat{\alpha} = g(\widehat{\theta})&amp;lt;/math&amp;gt;.  The function ''g'' need not be one-to-one. For detail, please refer to the proof of Theorem 7.2.10 of ''Statistical Inference'' by George Casella and Roger L. Berger.&lt;br /&gt;
&lt;br /&gt;
=== Asymptotic behaviour ===&lt;br /&gt;
Maximum likelihood estimators achieve minimum variance (as given by the [[Cramer-Rao lower bound]]) in the limit as the sample size tends to infinity. When the MLE is unbiased, we may equivalently say that it has minimum [[mean squared error]] in the limit.&lt;br /&gt;
&lt;br /&gt;
For independent observations, the maximum likelihood estimator often follows an asymptotic [[normal distribution]].&lt;br /&gt;
&lt;br /&gt;
=== Bias ===&lt;br /&gt;
The [[unbiased estimator|bias]] of maximum-likelihood estimators can be substantial. Consider a case where ''n'' tickets numbered from 1 to ''n'' are placed in a box and one is selected at random (''see [[uniform distribution]]'').  If ''n'' is unknown, then the maximum-likelihood estimator of ''n'' is the value on the drawn ticket, even though the expectation is only &amp;lt;math&amp;gt;(n+1)/2&amp;lt;/math&amp;gt;. In estimating the highest number ''n'', we can only be certain that it is greater than or equal to the drawn ticket number.&lt;br /&gt;
&lt;br /&gt;
== See also ==&lt;br /&gt;
* The [[mean squared error]] is a measure of how 'good' an estimator of a distributional parameter is (be it the maximum likelihood estimator or some other estimator).&lt;br /&gt;
&lt;br /&gt;
* The article on the [[Rao-Blackwell theorem]] for a discussion on finding the best possible unbiased estimator (in the sense of having minimal [[mean squared error]]) by a process called Rao-Blackwellisation. The MLE is often a good starting place for the process.&lt;br /&gt;
&lt;br /&gt;
* The reader may be intrigued to learn that the MLE (if it exists) will always be a function of a [[sufficient statistic]] for the parameter in question.&lt;br /&gt;
&lt;br /&gt;
== External resources ==&lt;br /&gt;
* [http://projecteuclid.org/Dienst/UI/1.0/Summarize/euclid.ss/1030037906 A paper detailing the history of maximum likelihood, written by John Aldrich]&lt;br /&gt;
&lt;br /&gt;
== External links ==&lt;br /&gt;
* [http://en.wikipedia.org/wiki/Maximum_likelihood Wikipedia article on '''Maximum likelihood''']&lt;br /&gt;
&lt;br /&gt;
[[Category:Academic Research]]&lt;br /&gt;
[[Category:Statistics]]&lt;/div&gt;</summary>
		<author><name>Christoph</name></author>	</entry>

	</feed>