当前位置:
X-MOL 学术
›
J. Hazard. Mater.
›
论文详情
Our official English website, www.x-mol.net, welcomes your
feedback! (Note: you will need to create a separate account there.)
Construction of Interpretable Ensemble Learning Models for Predicting Bioaccumulation Parameters of Organic Chemicals in Fish
Journal of Hazardous Materials ( IF 12.2 ) Pub Date : 2024-11-20 , DOI: 10.1016/j.jhazmat.2024.136606 Minghua Zhu, Zijun Xiao, Tao Zhang, Guanghua Lu
Journal of Hazardous Materials ( IF 12.2 ) Pub Date : 2024-11-20 , DOI: 10.1016/j.jhazmat.2024.136606 Minghua Zhu, Zijun Xiao, Tao Zhang, Guanghua Lu
Accurate prediction of bioaccumulation parameters is essential for assessing exposure, hazards, and risks of chemicals. However, the majority of prediction models on bioaccumulation parameters are individual models based on a single algorithm and lack model interpretation, resulting in unsatisfactory prediction accuracy due to inherent constraints of the algorithm and weak interpretability. Ensemble learning (EL) that combine multiple algorithms, coupled with SHapley Additive exPlanation (SHAP) method, may overcome the limitations. Herein, EL models were constructed for three bioaccumulation parameters using datasets covering 2496 chemicals. The EL models demonstrated superior prediction accuracy compared to both individual models developed in this study and those from previous research, achieving a coefficient of determination of up to 0.861 on the validation sets. Applicability domains were characterized using a structure-activity landscape-based (abbreviated as ADSAL) methodology. The optimal EL models, together with the ADSAL, were successfully used to predict bioaccumulation parameters for 4,374 chemicals included in the Inventory of Existing Chemical Substances of China. Model interpretation using the SHAP method offered insight into key features influencing bioaccumulation potential, including hydrophobicity, water solubility, polarizability, ionization potential, weight, and volume of molecules. Overall, the study provides data and models to support the sound management and risk assessment of chemicals.
更新日期:2024-11-20