PuSH - Publication Server of Helmholtz Zentrum München

Benchmarking explainable AI methods for toxicophore detection and toxicity prediction.

Digit. Discov., DOI: 10.1039/d5dd00576k (2026)
Publ. Version/Full Text DOI
Open Access Gold
Creative Commons Lizenzvertrag
Recent studies have reported inconsistent behavior across explainable AI (XAI) methods in molecular property prediction, raising concerns about their reliability. This work investigates whether such inconsistencies arise from the XAI methods themselves or from the accuracy of the underlying predictive model. A high-accuracy model was first trained on deterministic functional-group labels, where all evaluated XAI methods consistently highlighted the correct atoms corresponding to the true structural motifs. The analysis was extended to mutagenicity prediction, where the methods again identified known toxicophores and chemically meaningful scaffolds. Model performance was then systematically degraded by introducing controlled amounts of label noise. As predictive accuracy decreased, agreement between XAI methods weakened gradually, and the highlighted features became less chemically relevant. When accuracy reached around 0.65, this trend changed, with a much sharper loss of agreement, indicating an explainability cliff. These findings underline the importance of assessing model accuracy before drawing conclusions from XAI outputs.
Altmetric
Additional Metrics?
Edit extra informations Login
Publication type Article: Journal article
Document type Scientific Article
Keywords Carcinogenicity; Mutagenicity; Chemicals; Alerts; System
ISSN (print) / ISBN 2635-098X
e-ISSN 2635-098X
Publisher Royal Society of Chemistry (RSC)
Publishing Place Thomas Graham House, Science Park, Milton Rd, Cambridge Cb4 0wf, Cambs, England
Reviewing status Peer reviewed
Grants European Union's Horizon Europe programme