Hu Chunyan,Hu Liangping,Cluster analysis of disordered samples: based on the methods of Euclidean distance and the minimum distance[J].SICHUAN MENTAL HEALTH,2024,37(S1):48-53
Cluster analysis of disordered samples: based on the methods of Euclidean distance and the minimum distance
DOI:10.11886/scjsws20240110002
English keywords:Disordered samples  Euclidean distance  Minimum distance  Cluster analysis  Dendrogram
Fund projects:
Author NameAffiliationPostcode
Hu Chunyan Graduate School Academy of Military Sciences PLA China Beijing 100850 China 100850
Hu Liangping* Graduate School Academy of Military Sciences PLA China Beijing 100850 China
Specialty Committee of Clinical Scientific Research Statistics of World Federation of Chinese Medicine Societies Beijing 100029 China 
100029
Hits:
Download times:
English abstract:
      The purpose of this article was to introduce the basic concepts, calculation methods, two examples and the calculation methods using SAS related to the cluster analysis of disordered samples. Basic concepts included disordered samples and ordered samples, factors affecting distance calculation, clustering rules, hierarchical clustering and its two steps, and the difference between the clustering data and the classification data. The calculation methods involved the Minkowski distance (including Euclidean distance) calculation formula and the minimum distance calculation formula. The data in the two examples were "body morphological data of 16-year-old boys from 27 ethnic minorities in China" and "labor health supervision data of 14 provinces and municipalities in China in 1995". With the help of SAS software, cluster analysis of disordered samples was performed on the data in the two examples, and the explanation of the SAS output results was given.
View Full Text   View/Add Comment  Download reader
Close