Efficiency (statistics)

From Wikipedia, the free encyclopedia

Jump to: navigation, search

In statistics, efficiency is one measure of desirability of an estimator. The efficiency of an unbiased statistic T is defined as


e(T)
=
\frac{1/\mathcal{I}(\theta)}{\mathrm{var}(T)}

where \mathcal{I}(\theta) is the Fisher information of the sample. Thus e(T) is the minimum possible variance for an unbiased estimator divided by its actual variance. The Cramér-Rao bound can be used to prove that e(T) \le 1:


\mathrm{var} \left(\widehat{\theta}\right)
\geq
\frac
 {1}
 {\mathcal{I}(\theta)}

1\geq
\frac
 {1/\mathcal{I}(\theta)}
 {\mathrm{var} \left(\widehat{\theta}\right)}
\to
1 \geq e(T)

Contents

If an unbiased estimator of a parameter \theta \in \Theta attains e(T) = 1 for all values of the parameter, then the estimator is called efficient.

Equivalently, the estimator achieves equality on the Cramér-Rao inequality for all \theta \in \Theta.

An efficient estimator is also a minimum variance unbiased estimator. This is because an efficient estimator maintains equality on the Cramér-Rao inequality for all parameter values, which means it attains the minimum variance for all parameters (the definition of an MVU estimator).

An MVU estimator is not necessarily efficient, because "minimum" does not mean equality holds on the Cramér-Rao inequality.

For some estimators, they can attain efficiency asymptotically and are thus called asymptotically efficient estimators. This can be the case for some maximum likelihood estimators or for any estimators that attain equality of the Cramér-Rao bound asymptotically.

Consider a sample of size N drawn from a normal distribution of mean μ and unit variance (i.e., x[n] \sim \mathcal{N}(\mu, 1)).

The sample mean, \overline{x}, of the sample x[0], x[1], \ldots, x[N-1], defined as


\overline{x} = \frac{1}{N} \sum_{n=0}^{N-1} x[n]

has variance \frac{1}{N}. This is equal to the reciprocal of the Fisher information from the sample (this is clear from the definition) and thus, by the Cramér-Rao inequality, the sample mean is efficient in the sense that its efficiency is unity.

Now consider the sample median. This is an unbiased and consistent estimator for μ. For large N the sample median is approximately normally distributed with mean μ and variance \frac{\pi}{2N} (i.e., x[n] \sim \mathcal{N}\left(\mu, \frac{\pi}{2N}\right)). The efficiency is thus \frac{2}{\pi}, or about 64%. Note that this is the asymptotic efficiency — that is, the efficiency in the limit as sample size N tends to infinity. For finite values of N the efficiency is higher than this (for example, a sample size of 3 gives an efficiency of about 74%).

Many workers prefer the sample median as an estimator of the mean, holding that the loss in efficiency is more than compensated for by its enhanced robustness in terms of its insensitivity to outliers.

If T1 and T2 are estimators for the parameter θ, then T1 is said to dominate T2 if:

  1. its mean squared error (MSE) is smaller for at least some value of θ
  2. the MSE does not exceed that of T2 for any value of θ.

Formally, T1 dominates T2 if


\mathrm{E}
\left[
 (T_1 - \theta)^2
\right]
\leq
\mathrm{E}
\left[
 (T_2-\theta)^2
\right]

holds for all θ, with strict inequality holding somewhere.

The relative efficiency is defined as


e(T_1,T_2)
=
\frac
 {\mathrm{E} \left[ (T_2-\theta)^2 \right]}
 {\mathrm{E} \left[ (T_1-\theta)^2 \right]}

Although e is in general a function of θ, in many cases the dependence drops out; if this is so, e being greater than one would indicate that T1 is preferable, whatever the true value of θ.

Advanced Search
Included Web Search Engines


Safe Search

close

Top Matching Results

Occasionally Search.com will highlight specialized results that are based on the context of your query. Examples of specialized results include specific links to news, images, or video.

Top Matching Results may highlight information from other Search.com pages, content from the CNET Network of sites, or third party content. The listings are based purely on relevance. Search.com does not receive payment for listings in this section but our partners that provide this data may get paid for listing these products.

Sponsored Links

This section contains paid listings which have been purchased by companies that want to have their sites appear for specific search terms and related content. These listings are administered, sorted and maintained by a third party and are not endorsed by Search.com.

Search Results

Search.com sends your search query to several search engines at one time and integrates the results into one list which has been sorted by relevance using Search.com's proprietary algorithm. You can customize the list of search engines included in your metasearch from the preferences.

The search engines that are used in your metasearch may allow companies to pay to have their Web sites included within the results. To view the Paid Inclusion policy for a specific search engine, please visit their Web site. Search.com does not accept payment or share revenue with any search engine partner for listings in this section.