Multivariate and cluster analysis


Several standard multivariate methods are provided by GenStat directives. These include methods that analyse data in the form of units-by-variates, and methods that use a similarity or distance matrix.


The following directives carry out standard multivariate analyses:


CVA
canonical variates analysis

PCP
principal components analysis

PCO
principal coordinates analysis

ROTATE
Procrustes rotation

MDS
non-metric multidimensional scaling


Separate directives are available to process results from multivariate analyses:


FACROTATE
rotates factor loadings from a PCP or CVA

ADDPOINTS
adds points for new objects to a PCO

RELATE
relates principal coordinates to original data variates


The following directives are used for hierarchical or non-hierarchical cluster analysis:


FSIMILARITY
forms a similarity matrix or a between-group similarity matrix from a units-by-variates data matrix

REDUCE
forms a reduced similarity matrix (by groups)

HCLUSTER
hierarchical cluster analysis from a similarity matrix

CLUSTER
non-hierarchical clustering from a data matrix


Separate directives that process the results from hierarchical cluster analyses are:


HDISPLAY
displays results associated with hierarchical clustering

HLIST
lists a data matrix in abbreviated form

HSUMMARIZE
summarizes data variates by clusters


Other multivariate techniques are provided by procedures in the mva module of the Library:


AMMI
allows exploratory analysis of genotype × environment interactions

BCLASSIFICATION
constructs a classification tree

BCDISPLAY
displays a classification tree

BCIDENTIFY
identifies specimens using a classification tree

BCVALUES
forms values for nodes of a classification tree

BIPLOT
produces a biplot from a set of variates

BKEY
constructs an identification key

BKDISPLAY
displays an identification key

BKIDENTIFY
identifies specimens using a key

IDENTIFY
identifies an unknown specimen from a defined set of objects

CANCORRELATION
does canonical correlation analysis

CINTERACTION
clusters rows and columns of a two-way interaction table

CLASSIFY
obtains a starting classification for non-hierarchical clustering

CONVEXHULL
finds the points of a single or a full peel of convex-hulls

CORANALYSIS
does correspondence analysis, or reciprocal averaging (synonym CORRESP)

CVAPLOT
plots the mean and unit scores from a canonical variates analysis

CVASCORES
calculates scores for individual units in canonical variates analysis

DDENDROGRAM
draws dendrograms with control over structure and style

DISCRIMINATE
performs discriminant analysis

DMST
gives a high resolution plot of an ordination with minumum spanning tree

DPARALLEL
displays multivariate data using parallel coordinates

GENPROCRUSTES
performs a generalized Procrustes analysis

LRVSCREE
prints a scree diagram and/or a difference table of latent roots

MANOVA
performs multivariate analysis of variance and covariance

MANTEL
assesses the association between similarity matrices

MULTMISSING
estimates missing values for units in a multivariate data set

NORMTEST
performs tests of univariate and/or multivariate normality

PCOPROCRUSTES
performs a multiple Procrustes analysis

PLS
fits a partial least squares regression model

RIDGE
produces ridge regression and principal component regression analyses

RLFUNCTIONAL
fits a linear functional relationship model

RMULTIVARIATE
performs multivariate linear regression with accumulated testing of terms (synonym FITMULTIVARIATE)

ROBSSPM
forms robust estimates of sum-of-squares-and-products matrices

SKEWSYMMETRY
provides an analysis of skew-symmetry for an asymmetric matrix