Hu Chunyan,Hu Liangping,Cluster analysis of disordered samples: based on the methods of the fourth set of distance and the centroid method[J].SICHUAN MENTAL HEALTH,2024,37(S1):90-94
Cluster analysis of disordered samples: based on the methods of the fourth set of distance and the centroid method
DOI:10.11886/scjsws20240310004
English keywords:Asymmetric nominal variables  Disordered samples  Dice coefficient  Hierarchical cluster analysis  Dendrogram
Fund projects:
Author NameAffiliationPostcode
Hu Chunyan Graduate School Academy of Military Sciences PLA China Beijing 100850 China 100850
Hu Liangping* Graduate School Academy of Military Sciences PLA China Beijing 100850 China
Specialty Committee of Clinical Scientific Research Statistics of World Federation of Chinese Medicine Societies Beijing 100029 China 
100029
Hits:
Download times:
English abstract:
      The purpose of the paper was to introduce the basic concepts, calculation methods, two examples and the implementation of SAS calculation methods related to the cluster analysis of disordered samples. The basic concepts included asymmetric nominal variables, asymmetric ratio variables, Dice coefficient and Kulcynski 1 coefficient. The calculation methods involved measuring the distance between two samples using the fourth set of distance methods (8 in total) and measuring the distance between two classes of samples using the centroid method. The data in the two examples were "whether the nine divorce reasons apply to each state in the United States" and "the first symptom situation of ischemic stroke patients". Using SAS software, Cluster analysis of disordered samples was performed on the data in the first instance, and the SAS output results were explained.
View Full Text   View/Add Comment  Download reader
Close