HLIST directive

Lists the data matrix in abbreviated form.


Options

GROUPS = factor
Defines groupings of the units; used to split the printed table at appropriate places and to label the groups; default *

UNITS = text or variate
Names for the rows (i.e. units) of the table; default *


Parameters

DATA = variates
The data values

TEST = strings
Test type, defining how each variate is treated in the calculation of the similarity between each unit (simplematching, jaccard, russellrao, dice, antidice, sneathsokal, rogerstanimoto, cityblock, manhattan, ecological, euclidean, pythagorean, minkowski, divergence, canberra, braycurtis, soergel); default * ignores that variate

RANGE = scalars
Range of possible values of each variate; if omitted, the observed range is taken


Description

HLIST lists the values of the data matrix in a condensed form, either in their original order or, more usefully, in the order determined by a cluster analysis (see HCLUSTER). This representation can be very helpful for revealing patterns in the data, associated with clusters, or for an initial scan of the data to pick out interesting features of the variates.

   The DATA parameter specifies a list of variates, all of which must be of the same length. The TEST parameter specifies a list of strings, one for each variate in the DATA parameter list, to define the "type" of each variate; these are used by FSIMILARITY to determine how differences in variate values for each unit contribute to the overall similarity between units. However, HLIST distinguishes only between qualitative variates (settings simplematching - rogerstanimoto) and quantitative variates (other settings). The values of qualitative variates are printed directly. If the range of a quantitative variate is greater than 10, the printed values are scaled to lie in the range 0 to 10. This scaling is done by subtracting the minimum value from the variate, dividing by the range and then multiplying by 10. If the range is less than 10, the values are printed unscaled; so variates with values that are all less than 1 will appear as 0 in the abbreviated table. The values are printed with no decimal places, and in a field-width of 3.

   The RANGE parameter contains a list of scalars, one for each variate in the DATA list. This allows you to check that the values of each variate lie within the given range. The range is also used to standardize quantitative variates, so that you can impose a standard range for example when variates are measured on commensurate scales. You can omit the RANGE parameter for all or any of the variates by giving a missing identifier or a scalar with a missing value; GenStat then uses the observed range.

   The UNITS option allows you to change the labelling of the units in the table; you can specify a text or a pointer or a variate.

   You can use the GROUPS option to specify a factor that will split the units into groups. The table from HLIST is then divided into sections corresponding to the groups. If the factor has labels, these are used to annotate the sections; otherwise a group number is used.

 

Options: GROUPS, UNITS.

Parameters: DATA, TEST, RANGE.


Action with RESTRICT

You can restrict any of the DATA variates to list only a subset of the units. If more than one of these is restricted, then they must all be restricted to the same set of units.