版权说明 帮助中心
首页 > 成果 > 详情

Incorporating variable importance into kernel PLS for modeling the structure-activity relationship

SCI-E
认领
导出
Link by Springer Journal
反馈
分享
QQ微信 微博
成果类型:
期刊论文
作者:
Huang, Xin;Luo, Yi-Ping;Xu, Qing-Song;Liang, Yi-Zeng
通讯作者:
Huang, X
作者机构:
[Huang, Xin; Luo, Yi-Ping] Hunan City Univ, Dept Math, Yiyang 413000, Peoples R China.
[Liang, Yi-Zeng] Cent S Univ, Coll Chem & Chem Engn, Changsha 410083, Hunan, Peoples R China.
[Xu, Qing-Song] Cent S Univ, Sch Math & Stat, Changsha 410075, Hunan, Peoples R China.
通讯机构:
[Huang, Xin] Hunan City Univ, Dept Math, Yiyang 413000, Peoples R China.
语种:
英文
关键词:
Kernel partial least squares (KPLS);Variable importance (VI);Kernel methods;Regression coefficients;Structure-activity relationship (SAR)
期刊:
Journal of Mathematical Chemistry
ISSN:
0259-9791
年:
2018
卷:
56
期:
3
页码:
713-727
文献类别:
WOS:Article
所属学科:
ESI学科类别:化学;WOS学科类别:Chemistry, Multidisciplinary;Mathematics, Interdisciplinary Applications
入藏号:
基金类别:
National Bureau of Statistics of P.R. China [2015LY79]; Hunan Provincial Natural Science Foundation of China [2016JJ2011]; Hunan Provincial Education Department of China [16C0295]
机构署名:
本校为第一且通讯机构
院系归属:
理学院
摘要:
Kernel partial least squares (KPLS) has become popular techniques for chemical and biological modeling, which is a nonlinear extension of linear PLS. Training samples are transformed into a feature space via a nonlinear mapping, and then PLS algorithm can be carried out in the feature space. However, one of the main limitations of KPLS is that each feature is given the same importance in the kernel matrix, thus explaining the poor performance of KPLS for data with many irrelevant features. In this study, we provide a new strategy incorporated variable importance into KPLS, which is termed as the WKPLS approach. The WKPLS approach by modifying the kernel matrix provides a feasible way to differentiate between the true and noise variables. On the basis of the fact that the regression coefficients of the PLS model reflect the importance of variables, we firstly obtain the normalized regression coefficients by establishing the PLS model with all the variables. Then, Variable importance is incorporated into primary kernel. The performance of WKPLS is investigated with one simulated dataset and two structure–activity relationship (SAR) datasets. Compared with standard linear kernel PLS and Gaussian kernel PLS, The results show that WKPLS yields superior prediction performances to standard KPLS. WKPLS could be considered as a good mechanism by introducing extra information to improve the performance of KPLS for modeling SAR.
参考文献:
Avery MA, 2002, J MED CHEM, V45, P292, DOI 10.1021/jm0100234
Cao DS, 2013, BIOINFORMATICS, V29, P1092, DOI 10.1093/bioinformatics/btt105
Cao DS, 2011, ANAL CHIM ACTA, V706, P97, DOI 10.1016/j.aca.2011.08.025
Cao DS, 2010, CHEMOMETR INTELL LAB, V103, P129, DOI 10.1016/j.chemolab.2010.06.008
Centner V, 1996, ANAL CHEM, V68, P3851, DOI 10.1021/ac960321m

反馈

验证码:
看不清楚,换一个
确定
取消

成果认领

标题:
用户 作者 通讯作者
请选择
请选择
确定
取消

提示

该栏目需要登录且有访问权限才可以访问

如果您有访问权限,请直接 登录访问

如果您没有访问权限,请联系管理员申请开通

管理员联系邮箱:yun@hnwdkj.com