胡纯严,胡良平.无序样品聚类分析——基于第一组距离法和离差平方和法[J].四川精神卫生杂志,2024,37(S1):71-77.Hu Chunyan,Hu Liangping,Cluster analysis of disordered samples: based on the methods of the first set of distances and the Ward's method[J].SICHUAN MENTAL HEALTH,2024,37(S1):71-77
无序样品聚类分析——基于第一组距离法和离差平方和法
Cluster analysis of disordered samples: based on the methods of the first set of distances and the Ward's method
投稿时间:2024-03-10  
DOI:10.11886/scjsws20240310001
中文关键词:  名义变量  有序变量  区间变量  比率变量  谱系聚类分析
英文关键词:Nominal variable  Ordered variables  Interval variables  Ratio variables  Hierarchical cluster analysis
基金项目:
作者单位邮编
胡纯严 军事科学院研究生院北京 100850 100850
胡良平* 军事科学院研究生院北京 100850
世界中医药学会联合会临床科研统计学专业委员会北京 100029 
100029
摘要点击次数:
全文下载次数:
中文摘要:
      本文目的是介绍与无序样品聚类分析有关的基本概念、计算方法、两个实例以及使用SAS实现计算的方法。基本概念包括名义变量、有序变量、定性变量、区间变量、比率变量、距离;计算方法涉及第一组距离计算公式(共15个)和度量两类样品之间距离大小的离差平方和法(Ward法)计算公式;两个实例分别是“我国27个少数民族16岁男孩身体形态学数据”和“1995年我国14个省、直辖市的劳动卫生监督数据”;对两个实例的数据进行了无序样品聚类分析,并对SAS输出结果做出了解释。
英文摘要:
      The purpose of this article was to introduce the basic concepts, calculation methods, two examples and the implementation of SAS calculation methods related to the cluster analysis of disordered samples. The basic concepts included nominal variables, ordered variables, qualitative variables, interval variables ratio variables and distance. The calculation methods involved the first set of 15 distance calculation formulas and the Ward's method calculation formula for measuring the distance between two clusters of samples. The data in the two examples were "physical morphology data of 16-year-old boys from 27 ethnic minorities in China" and "labor health supervision data from 14 provinces and municipalities in China in 1995". Using SAS software, Cluster analysis of disordered samples was performed on the data from two instances, and the SAS output results were explained.
查看全文  查看/发表评论  下载PDF阅读器
关闭