胡纯严,胡良平.无序样品聚类分析——基于第三组距离法和类平均法[J].四川精神卫生杂志,2024,37(S1):84-89.Hu Chunyan,Hu Liangping,Cluster analysis of disordered samples: based on the methods of the third set of distances and the average linkage method[J].SICHUAN MENTAL HEALTH,2024,37(S1):84-89
无序样品聚类分析——基于第三组距离法和类平均法
Cluster analysis of disordered samples: based on the methods of the third set of distances and the average linkage method
投稿时间:2024-03-10  
DOI:10.11886/scjsws20240310003
中文关键词:  对称名义变量  匹配系数  海曼系数  谱系聚类分析  树形图
英文关键词:Symmetric nominal variables  Matching coefficient  Hamann coefficient  Hierarchical cluster analysis  Dendrogram
基金项目:
作者单位邮编
胡纯严 军事科学院研究生院北京 100850 100850
胡良平* 军事科学院研究生院北京 100850
世界中医药学会联合会临床科研统计学专业委员会北京 100029 
100029
摘要点击次数:
全文下载次数:
中文摘要:
      本文目的是介绍与无序样品聚类分析有关的基本概念、计算方法、两个实例以及使用SAS实现计算的方法。基本概念包括对称名义变量、缺失值、匹配系数和海曼(Hamann)系数;计算方法涉及第三组距离法(共8个)和类平均法(average法);两个实例分别为“反映美国39座城市空气污染情况的调查数据”和“某地1962年—1988年三化螟种群消长资料”;借助SAS软件,对两个实例的数据进行了无序样品聚类分析,并对SAS输出结果做出了解释。
英文摘要:
      The purpose of this article was to introduce the basic concepts, calculation methods, two examples, and the implementation of SAS calculation methods related to the cluster analysis of disordered samples. The basic concepts included symmetric nominal variables, missing values, matching coefficients, and Hamann coefficients. The calculation method involved the third set of distance method (8 in total) and average linkage method. The data in the two examples were "survey data reflecting the air pollution conditions in 39 cities in the United States" and "population growth and decline data of stem borer in a certain area from 1962 to 1988". Using SAS software, Cluster analysis of disordered samples was performed on the data from two instances, and the SAS output results were explained.
查看全文  查看/发表评论  下载PDF阅读器
关闭