Online First

Machine Learning-Driven Prediction of Lactic Acid Production from Anaerobic Fermentation of Organic Solid Waste

Received Date:2026-01-26 Revised Date:2026-02-14 Accepted Date:2026-02-28

DOI:10.20078/j.eep.20260310

Abstract:Anaerobic fermentation of organic solid waste (OSW) for lactic acid production represents an effective pathway for high-... Open+
Abstract:Anaerobic fermentation of organic solid waste (OSW) for lactic acid production represents an effective pathway for high-value resource recovery. However, conventional experience-based parameter regulation methods are limited in their ability to accurately characterize the coupled effects of multiple operational factors, often leading to extended fermentation periods, suboptimal parameter design, and unstable performance. To identify key control variables and quantitatively optimize process conditions, an interpretable causal machine learning (ICML) framework was applied to evaluate lactic acid production during the anaerobic fermentation of OSW. Three machine learning models—random forest (RF), gradient boosting regression (GBR), and gradient boosting decision tree (XGBoost, XGB)—were systematically compared. The XGB model exhibited superior predictive performance in estimating lactic acid concentration, yield, and production rate. Based on the optimized XGB model, Shapley additive explanations (SHAP) were employed to quantify the relative importance of input variables. Hydraulic retention time (HRT), total solids (TS), volatile solids (VS), and organic loading rate (OLR) were identified as key factors influencing lactic acid production performance. To further elucidate causal relationships, the linear non-Gaussian acyclic model (LiNGAM) was used to distinguish direct and indirect causal effects between features and target variables. The results indicated that HRT exerted a significant direct effect on lactic acid yield, while OLR was the primary positive driver of the production rate. Additionally, substrate type and inoculum ratio regulated the process through direct or indirect pathways. Partial dependence plots (PDPs) were subsequently applied to determine optimal parameter ranges. Lactic acid concentration exceeds 20 g/L when VS > 50 g/L, TS is 200–250 g/L, HRT is 2.5–5.0 d, and OLR < 18 g VS/(L·d); lactic acid yield peaks above 0.30 g/g VS when HRT is 10–15 d, TS < 25 g/L, OLR < 10 g VS/(L·d), and temperature is 30–45 °C; and lactic acid production rate exceeds 7 g/(L·d) when organic loading rate > 30 g VS/(L·d), hydraulic retention time < 3 d, VS > 100 g/L, and TS > 130 g/L. Two-dimensional partial dependence analysis (2D-PDP) further demonstrated the interactive effects of HRT and TS, defining suitable operating windows for balancing concentration and production rate. These findings demonstrate that the ICML approach can effectively elucidate the nonlinear relationships between process parameters and lactic acid production performance, offering quantitative guidance for parameter optimization and experimental design in OSW-based lactic acid production, thereby contributing to the advancement of organic waste resource recovery. Close-

Authors:

  • GUO Xueqiang1
  • TAO Xue2
  • LIU Shiqi1
  • ZHANG Guangming1,*

Units

  • School of Energy & Environmental Engineering, Hebei University of Technology
  • 2.  Department of Resources and Environment, Moutai Institute

Keywords

  • Machine  learning
  • Organic  solid  waste
  • Anaerobic  fermentation
  • Lactic  acid production
  • Parameter optimization

Citation

GUO Xueqiang, TAO Xue, LIU Shiqi, ZHANG Guangming. Machine Learning-Driven Prediction of Lactic Acid Production from Anaerobic Fermentation of Organic Solid Waste[J/OL]. Energy Environmental Protection: 1-10[2026-03-18]. https://doi.org/10.20078/j.eep.20260310.

Add: No. 288 Gongxiu Road, Xiaoshan District, Hangzhou, Zhejiang 311201, China
Post Code: 311201
Tel: +86-571-82989702 / +86-19558121964
Email: office@eep1987.com
Website Copyright © Editorial Department of Energy Environmental Protection 京ICP备05086979号