Analytical Performance Specifications for Input Variables: Investigation of the Model of End-Stage Liver Disease,Clinical Chemistry

当前位置： X-MOL 学术 › Clin. Chem. › 论文详情

Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)

Analytical Performance Specifications for Input Variables: Investigation of the Model of End-Stage Liver Disease
Clinical Chemistry ( IF 7.1 ) Pub Date : 2024-02-28 , DOI: 10.1093/clinchem/hvae019
Eline S Andersen _{1,

2} , Richard Röttger ₃ , Claus L Brasen _{1,

2} , Ivan Brandslund _{1,

2}

Affiliation

Background Artificial intelligence models constitute specific uses of analysis results and, therefore, necessitate evaluation of analytical performance specifications (APS) for this context specifically. The Model of End-stage Liver Disease (MELD) is a clinical prediction model based on measurements of bilirubin, creatinine, and the international normalized ratio (INR). This study evaluates the propagation of error through the MELD, to inform choice of APS for the MELD input variables. Methods A total of 6093 consecutive MELD scores and underlying analysis results were retrospectively collected. “Desirable analytical variation” based on biological variation as well as current local analytical variation was simulated onto the data set as well as onto a constructed data set, representing a worst-case scenario. Resulting changes in MELD score and risk classification were calculated. Results Biological variation-based APS in the worst-case scenario resulted in 3.26% of scores changing by ≥1 MELD point. In the patient-derived data set, the same variation resulted in 0.92% of samples changing by ≥1 MELD point, and 5.5% of samples changing risk category. Local analytical performance resulted in lower reclassification rates. Conclusions Error propagation through MELD is complex and includes population-dependent mechanisms. Biological variation-derived APS were acceptable for all uses of the MELD score. Other combinations of APS can yield equally acceptable results. This analysis exemplifies how error propagation through artificial intelligence models can become highly complex. This complexity will necessitate that both model suppliers and clinical laboratories address analytical performance specifications for the specific use case, as these may differ from performance specifications for traditional use of the analyses.

中文翻译：

输入变量的分析性能规范：终末期肝病模型的调查

背景人工智能模型构成了分析结果的特定用途，因此需要专门针对此背景评估分析性能规范（APS）。终末期肝病模型（MELD）是一种基于胆红素、肌酐和国际标准化比值（INR）测量值的临床预测模型。本研究评估了误差通过 MELD 的传播，为 MELD 输入变量的 APS 选择提供信息。方法回顾性收集 6093 例连续 MELD 评分和相关分析结果。基于生物变化和当前局部分析变化的“理想分析变化”被模拟到数据集以及构建的数据集上，代表了最坏的情况。计算 MELD 评分和风险分类的结果变化。结果在最坏的情况下，基于生物变异的 APS 导致 3.26% 的分数变化了 ≥1 个 MELD 点。在患者来源的数据集中，相同的变化导致 0.92% 的样本变化了 ≥1 个 MELD 点，5.5% 的样本改变了风险类别。本地分析性能导致较低的重新分类率。结论通过 MELD 的误差传播很复杂，包括群体依赖性机制。生物变异衍生的 APS 对于 MELD 评分的所有用途都是可接受的。其他 APS 组合可以产生同样可接受的结果。此分析举例说明了通过人工智能模型传播的误差如何变得高度复杂。这种复杂性将要求模型供应商和临床实验室都解决特定用例的分析性能规格，因为这些规格可能不同于传统分析使用的性能规格。

更新日期：2024-02-28

点击分享查看原文

点击收藏

阅读更多本刊新发论文本刊介绍/投稿指南