Skewness

From Wikipedia, the free encyclopedia

(Redirected from Skewed distribution)
Jump to: navigation, search
Example of experimental data with non-zero skewness (gravitropic response of wheat coleoptiles, 1,790)
Example of experimental data with non-zero skewness (gravitropic response of wheat coleoptiles, 1,790)

In probability theory and statistics, skewness is a measure of the asymmetry of the probability distribution of a real-valued random variable.

Contents

Consider the distribution in the figure. The bars on the right side of the distribution taper differently than the bars on the left side. These tapering sides are called tails (or snakes), and they provide a visual means for determining which of the two kinds of skewness a distribution has:

  1. positive skew: The right tail is longer; the mass of the distribution is concentrated on the left of the figure. The distribution is said to be right-skewed.
  2. negative skew: The left tail is longer; the mass of the distribution is concentrated on the right of the figure. The distribution is said to be left-skewed.

Image:Skew.png

Skewness, the third standardized moment, is written as γ1 and defined as

\gamma_1 = \frac{\mu_3}{\sigma^3}, \!

where μ3 is the third moment about the mean and σ is the standard deviation. Equivalently, skewness can be defined as the ratio of the third cumulant κ3 and the third power of the square root of the second cumulant κ2:

\gamma_1 = \frac{\kappa_3}{\kappa_2^{3/2}}. \!

This is analogous to the definition of kurtosis, which is expressed as the fourth cumulant divided by the fourth power of the square root of the second cumulant.

For a sample of n values the sample skewness is

g_1 = \frac{m_3}{m_2^{3/2}} = \frac{\sqrt{n\,}\sum_{i=1}^n (x_i-\bar{x})^3}{\left(\sum_{i=1}^n (x_i-\bar{x})^2\right)^{3/2}}, \!

where xi is the ith value, \bar{x} is the sample mean, m3 is the sample third central moment, and m2 is the sample variance.

Given samples from a population, the equation for the sample skewness g1 above is a biased estimator of the population skewness. The usual estimator of skewness is

G_1 = \frac{k_3}{k_2^{3/2}}
= \frac{\sqrt{n\,(n-1)}}{n-2}\; g_1, \!

where k3 is the unique symmetric unbiased estimator of the third cumulant and k2 is the symmetric unbiased estimator of the second cumulant. Unfortunately G1 is, nevertheless, generally biased. Its expected value can even have the opposite sign from the true skewness.

The skewness of a random variable X is sometimes denoted Skew[X]. If Y is the sum of n independent random variables, all with the same distribution as X, then it can be shown that Skew[Y] = Skew[X] / √n.

Skewness has benefits in many areas. Many simplistic models assume normal distribution i.e. data is symmetric about the mean. The normal distribution has a skewness of zero. But in reality, data points are not perfectly symmetric. So, an understanding of the skewness of the dataset indicates whether deviations from the mean are going to be positive or negative.

Karl Pearson suggested two simpler calculations as a measure of skewness:

There is no guarantee that these will be the same sign as each other or as the ordinary definition of skewness.

Advanced Search
Included Web Search Engines


Safe Search

close

Top Matching Results

Occasionally Search.com will highlight specialized results that are based on the context of your query. Examples of specialized results include specific links to news, images, or video.

Top Matching Results may highlight information from other Search.com pages, content from the CNET Network of sites, or third party content. The listings are based purely on relevance. Search.com does not receive payment for listings in this section but our partners that provide this data may get paid for listing these products.

Sponsored Links

This section contains paid listings which have been purchased by companies that want to have their sites appear for specific search terms and related content. These listings are administered, sorted and maintained by a third party and are not endorsed by Search.com.

Search Results

Search.com sends your search query to several search engines at one time and integrates the results into one list which has been sorted by relevance using Search.com's proprietary algorithm. You can customize the list of search engines included in your metasearch from the preferences.

The search engines that are used in your metasearch may allow companies to pay to have their Web sites included within the results. To view the Paid Inclusion policy for a specific search engine, please visit their Web site. Search.com does not accept payment or share revenue with any search engine partner for listings in this section.