Research Article: An individualized risk prediction tool for ectopic pregnancy within the first 10?weeks of gestation based on machine learning algorithms
Abstract:
As the main cause of maternal deaths in early pregnancy, delayed diagnosis of ectopic pregnancy (EP) may lead to severe consequences. Patients with pregnancy of unknown location (PUL) exhibit a significantly higher incidence of EP and associated risks compared to the general population. Therefore, this study aims to construct an early prediction model to identify EP risk among patients with PUL and provide a valuable direction for clinical intervention.
Retrospectively recruited 1896 patients with PUL within 10?weeks of gestation. Feature selection was done using the least absolute shrinkage and selection operator (LASSO). Logistic Regression (LR), Extreme Gradient Boosting (XGB), Random Forest (RFC), Support Vector Machine (SVM), and CatBoost were used to construct the early risk prediction model of EP. The model’s performance was evaluated by the area under the receiver operating characteristic curve (AUROC), the area under the precision-recall curve (AUPRC), and the F1 score. SHapley Additive exPlanations (SHAP) algorithms ranked the feature importance for model output interpretation.
Among the PUL patients included in this study, 66 (4.08%) were diagnosed with EP. Key predictors selected for model construction included vaginal bleeding, progesterone, homogeneous adnexal mass, gravidity, hCG levels, history of cesarean section, abdominal tenderness, and history of pelvic surgery. Among the five models, the CatBoost algorithm demonstrated the best performance, achieving an AUROC of 0.930 (95% CI, 0.876–0.984) and an AUPRC of 0.685 (95% CI, 0.464–0.845). A user-friendly web-based platform was developed for EP risk assessment based on this model. According to SHAP analysis, the three most important clinical predictors were vaginal bleeding, progesterone levels, and the presence of a homogeneous adnexal mass.
This study employed the CatBoost algorithm to develop an individualized risk prediction model by integrating multiple features from the initial visit. This model enhances the detection rate of EP in patients with PUL during early pregnancy. Additionally, we created a web-based tool, offering potential for future clinical applications.
Introduction:
As the main cause of maternal deaths in early pregnancy, delayed diagnosis of ectopic pregnancy (EP) may lead to severe consequences. Patients with pregnancy of unknown location (PUL) exhibit a significantly higher incidence of EP and associated risks compared to the general population. Therefore, this study aims to construct an early prediction model to identify EP risk among patients with PUL and provide a valuable direction for clinical intervention.
Read more