Extrapolation

From Wikipedia, the free encyclopedia

(Redirected from Extrapolate)
Jump to: navigation, search

In mathematics, extrapolation is the process of constructing new data points outside a discrete set of known data points. It is similar to the process of interpolation, which constructs new points between known points, but its results are often less meaningful, and are subject to greater uncertainty.

Example illustration of the extrapolation problem, consisting of assigning a meaningful value at the blue box, at x = 7, given the red data points.
Example illustration of the extrapolation problem, consisting of assigning a meaningful value at the blue box, at x = 7, given the red data points.

Contents

A sound choice of which extrapolation method to apply relies on a priori knowledge of the process that created the existing data points. Crucial questions are for example if the data can be assumed to be continuous, smooth, possibly periodic etc.

Linear extrapolation means creating a tangent line at the end of the known data and extending it beyond that limit. Linear extrapolation will only provide good results when used to extend the graph of an approximately linear function or not too far beyond the known data.

If the two data points nearest the point x * to be extrapolated are (xk − 1,yk − 1) and (xk,yk), linear extrapolation gives the function (identical to linear interpolation if xk − 1 < x * < xk),

y(x_*) = y_{k-1} + \frac{x_* - x_{k-1}}{x_{k}-x_{k-1}}(y_{k} - y_{k-1}).

It is possible to include more than two points, and averaging the slope of the linear interpolant, by regression-like techniques, on the data points chosen to be included. This is similar to linear prediction.

A polynomial curve can be created through the entire known data or just near the end. The resulting curve can then be extended beyond the end of the known data. Polynomial extrapolation is typically done by means of Lagrange interpolation or using Newton's method of finite differences to create a Newton series that fits the data. The resulting polynomial may be used to extrapolate the data.

High order polynomial extrapolation must be used with due care. For the example data set and problem in the figure above, anything above order 1 (linear extrapolation) will possibly yield unusable values, an error estimate of the extrapolated value will grow with the degree of the polynomial extrapolation. This is related to Runge's phenomenon.

A conic section can be created using five points near the end of the known data. If the conic section created is an ellipse or circle, it will loop back and rejoin itself. A parabolic or hyperbolic curve will not rejoin itself, but may curve back relative to the X-axis. This type of extrapolation could be done with a conic sections template (on paper) or with a computer.

A method of extrapolation suitable for any distribution that has a tendency to be exponential but with accelerating or decelerating factors is french curve extrapolation[1]. This method has been used successfully in providing forecast projections of the growth of HIV/AIDS in the UK since 1987 and variant CJD in the UK for a number of years.

Typically, the quality of a particular method of extrapolation is limited by the assumptions about the function made by the method. If the method assumes the data are smooth, then a non-smooth function will be poorly extrapolated.

Even for proper assumptions about the function, the extrapolation can diverge strongly from the function. The classic example is truncated power series representations of sin(x) and related trigonometric functions. For instance, taking only data from near the x = 0, we may estimate that the function behaves as sin(x) ~ x. In the neighborhood of x = 0, this is an excellent estimate. Away from x = 0 however, the extrapolation moves arbitrarily away from the x-axis while sin(x) remains in the interval [−1,1]. I.e., the error increases without bound.

Taking more terms in the power series of sin(x) around x = 0 will produce better agreement over a larger interval near x = 0, but will produce extrapolations that eventually diverge away from the x-axis even faster than the linear approximation.

This divergence is a specific property of extrapolation methods and is only circumvented when the functional forms assumed by the extrapolation method (inadvertently or intentionally due to additional information) accurately represent the nature of the function being extrapolated. For particular problems, this additional information may be available, but in the general case, it is impossible to satisfy all possible function behaviors with a workably small set of potential behaviors.

The extent to which an extrapolation is accurate is known as the "prediction confidence interval", and is usually expressed as an upper and lower boundary within which the prediction is expected to be accurate 19 times out of 20 (a 95% confidence interval).

An extrapolation's reliability is indicated by its prediction confidence interval, which often diverges to impossible values. Extrapolating into this range will probably give misleading results.

For example, the death rate from a new disease may increase dramatically early on. If the graph of the death rate is then extrapolated linearly, it might appear that the entire human population will be dead from the disease in a short number of years. In reality, the death rate from a newly discovered disease may fall as the susceptible die off and the remainder alter their behavior to avoid contracting the disease. Those who remain may also have a natural immunity to the disease or an acquired immunity due to exposure. Medical treatments affecting the spread and death rate of the disease may be developed, as well. A simple linear extrapolation assumes that there is an infinite population, and if the trend is growing faster than the population it will predict that more will have died than have ever been alive.

Similarly, if the amount of water in a lake is decreasing over time, a linear extrapolation will predict that there will be a negative amount of water shortly after the water is gone. This is an absurd result which indicates that the extrapolation is being performed in the wrong domain.

Selection of an improper domain, such as an infinite domain when all possible values are finite, or a negative domain for nonnegative values, is the second most common extrapolation error after failure to include a prediction confidence interval. In some cases, it is better to extrapolate some sort of transformed variable (taking the logarithm or the logit transformation for example), so that the untransformed variable is guaranteed to stay in a particular range.

In complex analysis, a problem of extrapolation may be converted into an interpolation problem by the change of variable \hat{z} = 1/z. This transform exchanges the part of the complex plane inside the unit circle with the part of the complex plane outside of the unit circle. In particular, the compactification point at infinity is mapped to the origin and vice versa. Care must be taken with this transform however, since the original function may have had "features", for example poles and other singularities, at infinity that were not evident from the sampled data.

Another problem of extrapolation is loosely related to the problem of analytic continuation, where (typically) a power series representation of a function is expanded at one of its points of convergence to produce a power series with a larger radius of convergence. In effect, a set of data from a small region is used to extrapolate a function onto a larger region.

Again, analytic continuation can be thwarted by function features that were not evident from the initial data.

Also, one may use sequence transformations like Padé approximants and Levin-type sequence transformations as extrapolation methods that lead to a summation of power series that are divergent outside the original radius of convergence. In this case, one often obtains rational approximants.

  • Extrapolation Methods. Theory and Practice by C. Brezinski and M. Redivo Zaglia, North-Holland, 1991.

Look up extrapolation in Wiktionary, the free dictionary.
Advanced Search
Included Web Search Engines


Safe Search

close

Top Matching Results

Occasionally Search.com will highlight specialized results that are based on the context of your query. Examples of specialized results include specific links to news, images, or video.

Top Matching Results may highlight information from other Search.com pages, content from the CNET Network of sites, or third party content. The listings are based purely on relevance. Search.com does not receive payment for listings in this section but our partners that provide this data may get paid for listing these products.

Sponsored Links

This section contains paid listings which have been purchased by companies that want to have their sites appear for specific search terms and related content. These listings are administered, sorted and maintained by a third party and are not endorsed by Search.com.

Search Results

Search.com sends your search query to several search engines at one time and integrates the results into one list which has been sorted by relevance using Search.com's proprietary algorithm. You can customize the list of search engines included in your metasearch from the preferences.

The search engines that are used in your metasearch may allow companies to pay to have their Web sites included within the results. To view the Paid Inclusion policy for a specific search engine, please visit their Web site. Search.com does not accept payment or share revenue with any search engine partner for listings in this section.