胡良平.提高回归模型拟合优度的策略(Ⅳ)———优化计分变换与其他变量变换[J].四川精神卫生杂志,2019,32(1):21-28.,Strategy of improving the goodness of fit of the regression model(Ⅳ) ——the optimal scoring transformation and the other variable transformations[J].SICHUAN MENTAL HEALTH,2019,32(1):21-28 |
提高回归模型拟合优度的策略(Ⅳ)———优化计分变换与其他变量变换 |
Strategy of improving the goodness of fit of the regression model(Ⅳ) ——the optimal scoring transformation and the other variable transformations |
|
DOI:10.11886/j.issn.1007-3256.2019.01.004 |
中文关键词: 优化计分变换 单调变换 样条变换 BOX-COX变换 |
英文关键词:Optimal scoring transformation Spline transformation Monotonic transformation Box - cox transformation |
基金项目:国家高技术研究发展计划课题资助(2015AA020102) |
|
摘要点击次数: |
全文下载次数: |
中文摘要: |
【摘要】 本文目的是介绍第四种提高回归模型拟合优度的策略,即优化计分变换与其他变量变换。 具体方法包括以下几个方面:①第一,对多值名义自变量采取“优化计分变换”;②对有序自变量分别采取“单调变换”与“优化计分变换”;③对定量自变量分别采取“样条变换”和“单调样条变换”;④对定量因变量分别采取“样条变换”“单调样条变换”和“BOX - COX变换”。 全部变量变换方法组合起来共 12 种,共创建了 12 个多重非线性回归模型。 依据“拟合优度评价指标”的取值,从 12个回归模型中挑选出一个,即本文中的“模型 1”,其“均方误差平方根 = 0. 30935?R2 = 0. 9586?调整 R2 = 0. 9527”。 结合本期科研方法专题同类文章的结果和结论,得出提高回归模型拟合优度的策略主要在于以下四点:①应对“定量因变量”“定量自变量”和“多值有序自变量”采取合适的变量变换方法;②在拟合回归模型的过程中,应尽可能多地引入派生变量;③应假定回归模型中不含截距项;④在构建回归模型的过程中,应尽可能多地使用筛选自变量的策略,如“前进法”“后退法”和“逐步法”。 |
英文摘要: |
The purpose of this paper was to introduce the fourth strategy of improving the goodness of fit of the regression models, the optimal scoring transformation and the other variable transformations. The concrete approaches were as follows: ①“ The optimal scoring transformation” was adopted to the multi - value nominal independent variable. ②“ The monotonic transformation” and “ the optimal scoring transformation” were adopted to the multi - value ordered independent variable, respectively. ③ “ The spline transformation” and “ the monotonic spline transformation” were adopted to the quantitative independent variables, respectively. ④“ The spline transformation ” “ the monotonic spline transformation ” and “ the BOX - COX transformation ” were adopted to the quantitative dependent variable, respectively. There were twelve variable transformation ways, so the twelve multiple nonlinear regression models were built. One best regression model, which was “ the model one” in this article, was selected from the twelve models mentioned above in terms of the results of the goodness of fit evaluation. The results were as follows: Root MSE = 0. 30935, R - Square = 0. 9586, and the adjusted R - Square = 0. 9527. Combined the results of this article with the other results of the previous three articles in the similar titles in this journal, the final conclusions were acquired as follows: ① “ The quantitative dependent variable” “ the quantitative independent variables” and “ the multi - value ordered independent variables” should be transformed in an appropriate form. ②The derived variables should be introduced as many as possible in fitting the regression model. ③No intercept term should be applied in fitting the regression models. ④The strategies of screening independent variables should be adopted as many as possible during fitting the regression models, such as “ forward selection” “ backward selection” and “ stepwise selection” . |
查看全文 查看/发表评论 下载PDF阅读器 |
关闭 |