| Fit a Generalized Pareto Distribution |
| See Also |
Where only the maxima of groups of observations have been recorded, use the Fit a Generalized Extreme Value Distribution menu.
See GPARETO procedure for further details about the model, assumptions and graphs produced by this analysis.
This can be a single value (scalar) so that the same threshold is used for all the data, or may be a variate so that the observations above a varying limit is modelled. For example with a series of temperatures over a year, the threshold may be a periodic curve giving the expected value of a quantile over the year.
If you are not certain of the threshold to use, a missing value, *, can be entered, in which case the analysis will produce the Mean Residual life and Stability plots for sigma and eta to help you in your selection of the threshold.
The clusters are defined using the Gaps to use between Clusters parameter. Units above the threshold are grouped together into a single cluster, until the gap between values above the threshold exceeds the gap size specified. For example, the series: 7, 9, 1, 9, 5, 2, 7, 8, 7, 4, 3 using a threshold of 6, has the status of being above the threshold of: 1,1,0,1,0,0,1,1,1,0,0 and the clusters above the threshold would be (7,9), (9), (7,8,7) with a gap size of 1, giving the series 9,9,7 when the maximum of the clusters is taken. If a gap size of >2 was used, the first two clusters would coalesce to give clusters (7,9,9) as the gap between the two 9s is only one unit (< 2), giving the series 9,8 when the maxima are taken.
The value Theta is printed by the analysis, and is the reciprocal of the mean cluster size. A value of theta close to 1 indicates all observations over the threshold occur in isolation, and a small value of theta indicates that the observations exceeding the threshold are strongly clustered together.
| Estimate | maximum likelihood estimation is used to obtain the value of eta which gives the best fit of the data to the model |
| Fix as | allows you to specify a value of eta in the space provided to fix the value of eta used in the model. Setting eta to 0 will force the Gumbel distribution to be fitted |
Return Levels - You provide a return probability and a return level which occurs at this rate in the data is calculated (e.g. 5% = 1/20 so the predicted return level would occur once in every 20 maxima).
Return Probabilities - You provide a return level and the probability that a random observation would exceed this value is calculated.
The Observations in each period value is used to specify that you would like return periods measured on a different scale to the individual units in your series. Thus if you had daily data, and wanted a 1 in 20 year occurrence, you would enter a value of 365 in this edit box to specify that you wanted 365 days grouped together to form the units that the repeats are measured on.
| Run | Run the analysis. |
| Cancel | Close the menu without further changes. |
| Options | Opens a dialog where additional options and settings can be specified for the analysis. |
| Defaults | Set the menu settings back to the default settings. Clicking the right mouse on this button produces a shortcut menu where you can choose to set the menu using the currently stored defaults or the GenStat default settings. |
| Store | Opens a dialog to specify names of structures to store the results from the analysis. The names to save the structures should be supplied before running the analysis. |