Zeta distribution

From Wikipedia, the free encyclopedia

zeta
Probability mass function
Plot of the Zeta PMF
Plot of the Zeta PMF on a log-log scale. (Note that the function is only defined at integer values of k. The connecting lines do not indicate continuity.)
Cumulative distribution function
Plot of the Zeta CMF
Parameters s\in(1,\infty)
Support k \in \{1,2,\ldots\}
Probability mass function (pmf) \frac{1/k^s}{\zeta(s)}
Cumulative distribution function (cdf) \frac{H_{k,s}}{\zeta(s)}
Mean \frac{\zeta(s-1)}{\zeta(s)}~\textrm{for}~s>2
Median
Mode 1\,
Variance \frac{\zeta(s)\zeta(s-2) - \zeta(s-1)^2}{\zeta(s)^2}~\textrm{for}~s>3
Skewness
Excess kurtosis
Entropy \sum_{k=1}^\infty\frac{1/k^s}{\zeta(s)}\log (k^s \zeta(s)).\,\!
Moment-generating function (mgf) \frac{\operatorname{Li}_s(e^t)}{\zeta(s)}
Characteristic function \frac{\operatorname{Li}_s(e^{it})}{\zeta(s)}

In probability theory and statistics, the zeta distribution is a discrete probability distribution. If X is a zeta-distributed random variable with parameter s, then the probability that X takes the integer value k is given by the probability mass function

f_s(k)=k^{-s}/\zeta(s)\,

where ζ(s) is the Riemann zeta function (which is undefined for s = 1).

The multiplicities of distinct prime factors of X are independent random variables.

The zeta distribution is equivalent to the Zipf distribution for infinite N. Indeed the terms "Zipf distribution" and the "zeta distribution" are often used interchangeably.

Contents

The nth raw moment is defined as the expected value of kn:

m_n = E(X^n) = \frac{1}{\zeta(s)}\sum_{k=1}^\infty \frac{1}{k^{s-n}}

The series on the right is just a series representation of the Riemann zeta function, but it only converges for values of s-n that are greater than unity. Thus:

m_n =\left\{ \begin{matrix} \zeta(s-n)/\zeta(s) & \textrm{for}~n < s-1 \\ \infty & \textrm{for}~n \ge s-1 \end{matrix} \right.

Note that the ratio of the zeta functions is well defined, even for n \ge s-1 because the series representation of the zeta function can be analytically continued. This does not change the fact that the moments are specified by the series itself, and are therefore undefined for large n.

The moment generating function is defined as:

M(t;s) = E(e^{tX}) = \frac{1}{\zeta(s)} \sum_{k=1}^\infty \frac{e^{tk}}{k^s}

The series is just the definition of the polylogarithm, valid for et < 1 so that:

M(t;s) = \frac{\operatorname{Li}_s(e^t)}{\zeta(s)} for t < 0

The Taylor series expansion of this function will not necessarily yield the moments of the distribution. The Taylor series using the moments as they usually occur in the moment generating function yields:

\sum_{n=0}^\infty \frac{m_n t^n}{n!}

which obviously is not well defined for any finite value of s since the moments become infinite for large n. If we use the analytically continued terms instead of the moments themselves, we obtain from a series representation of the polylogarithm

\frac{1}{\zeta(s)}\sum_{n=0,n\ne s-1}^\infty \frac{\zeta(s-n)}{n!}\,t^n=\frac{\operatorname{Li}_s(e^t)-\Phi(s,t)}{\zeta(s)}

for | t | < 2π. Φ(s,t) is given by:

\Phi(s,t)=\Gamma(1-s)(-t)^{s-1}\, for s\ne 1,2,3\ldots
\Phi(s,t)=\frac{t^{s-1}}{(s-1)!}\left[H_s-\ln(-t)\right] for s=2,3,4\ldots
\Phi(s,t)=-\ln(-t)\, for s=1\,

where Hs is a harmonic number.

ζ(1) is infinite as the harmonic series, and so the case when s = 1 is not meaningful. However, if A is any set of positive integers that has a density, i.e. if

\lim_{n\rightarrow\infty}\frac{N(A,n)}{n}

exists where N(An) is the number of members of A less than or equal to n, then

\lim_{s\rightarrow 1+}P(X\in A)\,

is equal to that density.

The latter limit can also exist in some cases in which A does not have a density. For example, if A is the set of all positive integers whose first digit is d, then A has no density, but nonetheless the second limit given above exists and is proportional to

log(d + 1) − log(d),

similar to Benford's law.

Other "power-law" distributions

  • Some remarks on the Riemann zeta distribution by Allan Gut. What Gut calls the Riemann zeta distribution is actually the probability distribution of −log X, where X is a random variable with what this article calls the zeta distribution.
Image:Bvn-small.png Probability distributionsview  talk  edit ]
Univariate Multivariate
Discrete: BenfordBernoullibinomialBoltzmanncategoricalcompound PoissondegenerateGauss-Kuzmingeometrichypergeometriclogarithmicnegative binomialparabolic fractalPoissonRademacherSkellamuniformYule-SimonzetaZipfZipf-Mandelbrot Ewensmultinomialmultivariate Polya
Continuous: BetaBeta primeCauchychi-squareDirac delta functionErlangexponentialexponential powerFfadingFisher's zFisher-TippettGammageneralized extreme valuegeneralized hyperbolicgeneralized inverse GaussianHalf-LogisticHotelling's T-squarehyperbolic secanthyper-exponentialhypoexponentialinverse chi-squareinverse Gaussianinverse gammaKumaraswamyLandauLaplaceLévyLévy skew alpha-stablelogisticlog-normalMaxwell-BoltzmannMaxwell speednormal (Gaussian)normal inverse GaussianParetoPearsonpolarraised cosineRayleighrelativistic Breit-WignerRiceshifted GompertzStudent's ttriangulartype-1 Gumbeltype-2 GumbeluniformVariance-GammaVoigtvon MisesWeibullWigner semicircleWilks' lambda DirichletKentmatrix normalmultivariate normalmultivariate Studentvon Mises-FisherWigner quasiWishart
Miscellaneous: Cantorconditionalexponential familyinfinitely divisiblelocation-scale familymarginalmaximum entropyphase-typeposteriorpriorquasisamplingsingular
Advanced Search
Included Web Search Engines


Safe Search

close

Top Matching Results

Occasionally Search.com will highlight specialized results that are based on the context of your query. Examples of specialized results include specific links to news, images, or video.

Top Matching Results may highlight information from other Search.com pages, content from the CNET Network of sites, or third party content. The listings are based purely on relevance. Search.com does not receive payment for listings in this section but our partners that provide this data may get paid for listing these products.

Sponsored Links

This section contains paid listings which have been purchased by companies that want to have their sites appear for specific search terms and related content. These listings are administered, sorted and maintained by a third party and are not endorsed by Search.com.

Search Results

Search.com sends your search query to several search engines at one time and integrates the results into one list which has been sorted by relevance using Search.com's proprietary algorithm. You can customize the list of search engines included in your metasearch from the preferences.

The search engines that are used in your metasearch may allow companies to pay to have their Web sites included within the results. To view the Paid Inclusion policy for a specific search engine, please visit their Web site. Search.com does not accept payment or share revenue with any search engine partner for listings in this section.