Statistical Estimation

All Products Maple MapleSim

Home : Support : Online Help : Statistics and Data Analysis : Statistics Package : Example Worksheets : Statistical Estimation

Statistics Estimation

Estimation is the process of determining a numerical value for one or more parameters of a population from a set of data samples. The Statistics package provides a framework for performing maximum likelihood estimation and other techniques used in statistical estimation.

1 Maximum Likelihood Estimation

Maximum likelihood estimation applies the concept of the likelihood function - an equation that considers the probability of a set of recorded observations given one or more unknown parameters. The maximum value that this function attains when mapped over the unknown parameter is of particular importance, as it indicates the "most likely" value for that parameter.

The following is an example problem that demonstrates a method for finding the maximum likelihood estimate of the scale parameter c in a Pareto distribution.

>	$restart$ $with (Statistics) &colon;$

Generate a sample from the Pareto distribution that will act as the provided data.

>	$S := Sample (Pareto (5, 7), 1000) &colon;$

Compute the likelihood function of a Pareto distribution with unknown scale parameter, assuming only one sample has been provided.

>	$Likelihood (Pareto (5, c), A, samplesize = 1)$

${\begin{array}{c} 0 & A_{1} < 5 \\ c 5^{c} A_{1}^{(- c - 1)} & 5 \leq A_{1} \end{array}$

(1.1)

Compute the log likelihood function, assuming four samples have been provided.

>	$LogLikelihood (Pareto (5, c), A, samplesize = 4)$

$4 \ln (5) c + \ln (c A_{1}^{(- c - 1)}) + \ln (c A_{2}^{(- c - 1)}) + \ln (c A_{3}^{(- c - 1)}) + \ln (c A_{4}^{(- c - 1)})$

(1.2)

Compute the score (derivative of the log likelihood function), under the same considerations.

>	$Score (Pareto (5, c), A, samplesize = 4)$

$4 \ln (5) + \frac{4}{c} - \ln (A_{1}) - \ln (A_{2}) - \ln (A_{3}) - \ln (A_{4})$

(1.3)

Compute the score, instead applying the generated sample data.

>	$F := Score (Pareto (5, c), S)$

$F := 1000 \ln (5) - 1744.554810 + \frac{1000}{c}$

(1.4)

Solve for c, such that the score is equal to zero.

>	$fsolve (F = 0)$

$7.400998824$

(1.5)

Calculate the maximum likelihood estimate (this should give us the same result).

>	$M := MaximumLikelihoodEstimate (Pareto (5, c), S)$

$M := 7.400998143$

(1.6)

Compute the likelihood ratio statistic.

>	$L := LikelihoodRatioStatistic (Pareto (5, c), S)$

$L := 2003.229898 - 2000. \ln (c) + 270.233800 c$

(1.7)

The likelihood ratio statistic is generally distributed as a ChiSquare random variable with 1 degree of freedom. Using this and the fact that the confidence interval bounds the maximum likelihood estimate, we can determine an approximate 95% confidence interval for the maximum likelihood estimate.

>	$θ := Quantile (ChiSquare (1), 0.95)$

$θ := 3.841456066$

(1.8)

>	$CI := fsolve (L = θ, c = - \infty .. M) .. fsolve (L = θ, c = M .. \infty)$

$CI := 6.951716117 .. 7.869234199$

(1.9)

2 Method of Moments

The method of moments is used to determine the parameters of a probability distribution function by comparing the moments of a sample to the moments of a probability distribution. In order to apply the method of moments, apply the following steps.

1. Calculate the first n moments of the fitting distribution, where n is the number of free indeterminates in the fitting distribution. For example, to perform a fit using the Normal distribution, which has parameters mu and sigma, calculate the first 2 moments.

2. Calculate the first n moments of the data sample.

3. Equate the equations generated by steps 1 and 2 and solve for the indeterminates.

Assuming that the equations created in step 3 can be solved, these steps provide you with an estimate to the values of the parameters of the fitting distribution. As an example, consider an estimate of the parameters of a Beta distribution, with parameters $ν$ and $ω$ :

>	$restart$ $with (Statistics) &colon;$

Generate a sample from a Beta distribution to act as the provided data.

>	$S := Sample (' Β' (5, 7), 10000) &colon;$

1. Calculate the first and second moments of the Beta distribution with free parameters $ν$ and $ω$ .

>	$DistMoments := [Moment (' Β' (ν, ω), 1), Moment (' Β' (ν, ω), 2)]$

$DistMoments := [\frac{ν}{ν + ω}, \frac{ν (ν + 1)}{(ν + ω) (ν + ω + 1)}]$

(2.1)

2. Calculate the first and second moments of the provided data.

>	$DataMoments := [Moment (S, 1), Moment (S, 2)]$

$DataMoments := [0.4164527455, 0.1921293371]$

(2.2)

3. Equate the moments and solve for $ν$ and $ω$ .

>	$solve (Equate (DistMoments, DataMoments))$

$\{ν = 4.996676373, ω = 7.001506919\}$

(2.3)

Return to Index for Example Worksheets

Download Help Document

Maple

Maple Add-Ons

MapleSim

MapleSim Add-Ons

Systems Engineering

Consulting Services

Maple T.A. and Möbius

Education

Industries

Automotive and Aerospace

Robotics

Machine Design & Industrial Automation

Other

Application Areas

Product Pricing

Purchasing

Institutional Student Licensing

Maplesoft Elite Maintenance (EMP)

Support

Product Training

Online Product Help

Webinars & Events

Publications

Content Hubs

Examples & Applications

Community

About Maplesoft

Media Center

User Community

Contact

Online Help

All Products Maple MapleSim

Maple

Powerful math software that is easy to use

Maple Add-Ons

MapleSim

Advanced System Level Modeling

MapleSim Add-Ons

Systems Engineering

Consulting Services

Maple T.A. and Möbius

Education

Industries

Automotive and Aerospace

Robotics

Machine Design & Industrial Automation

Other

Application Areas

Product Pricing

Purchasing

Institutional Student Licensing

Maplesoft Elite Maintenance (EMP)

Support

Product Training

Online Product Help

Webinars & Events

Publications

Content Hubs

Examples & Applications

Community

About Maplesoft

Media Center

User Community

Contact

Online Help

All Products Maple MapleSim