Hu Chunyan,Hu Liangping,Cluster analysis of disordered samples: based on the methods of the third set of distances and the average linkage method[J].SICHUAN MENTAL HEALTH,2024,37(S1):84-89
Cluster analysis of disordered samples: based on the methods of the third set of distances and the average linkage method
DOI:10.11886/scjsws20240310003
English keywords:Symmetric nominal variables  Matching coefficient  Hamann coefficient  Hierarchical cluster analysis  Dendrogram
Fund projects:
Author NameAffiliationPostcode
Hu Chunyan Graduate School Academy of Military Sciences PLA China Beijing 100850 China 100850
Hu Liangping* Graduate School Academy of Military Sciences PLA China Beijing 100850 China
Specialty Committee of Clinical Scientific Research Statistics of World Federation of Chinese Medicine Societies Beijing 100029 China 
100029
Hits:
Download times:
English abstract:
      The purpose of this article was to introduce the basic concepts, calculation methods, two examples, and the implementation of SAS calculation methods related to the cluster analysis of disordered samples. The basic concepts included symmetric nominal variables, missing values, matching coefficients, and Hamann coefficients. The calculation method involved the third set of distance method (8 in total) and average linkage method. The data in the two examples were "survey data reflecting the air pollution conditions in 39 cities in the United States" and "population growth and decline data of stem borer in a certain area from 1962 to 1988". Using SAS software, Cluster analysis of disordered samples was performed on the data from two instances, and the SAS output results were explained.
View Full Text   View/Add Comment  Download reader
Close