PuSH - Publication Server of Helmholtz Zentrum München

Li, Y.* ; Sun, H.* ; Fang, W.* ; Ma, Q.* ; Han, S.* ; Wang-Sattler, R. ; Du, W.* ; Yu, Q.*

SURE: Screening unlabeled samples for reliable negative samples based on reinforcement learning.

Information Sci. 629, 299-312 (2023)
Publ. Version/Full Text DOI
For many classification tasks, particularly in the bioinformatics field, only experimentally validated positive samples are available, and experimentally validated negative samples are not recorded. The lack of negative samples poses a challenge for using machine learning to perform such tasks. To address this problem, we propose a novel deep reinforcement learning-based model to screen reliable negative samples from unlabeled samples, named SURE. The model has two modules: a sample selector and a sample inspector. The sample selector screens reliable negative samples from unlabeled samples by two reinforcement strategies (learn to identify positive samples and reduce sample noise) and feeds the screened samples into the sample inspector. The sample inspector classifier provides rewards to the sample selector. The two modules are trained together to optimize the sample selector and sample inspector strategies. In this paper, we focus on one popular issue in the bioinformatics field: the ncRNA-protein interaction (NPI) prediction task, which lacks reliable negative samples. Thirty datasets for NPI prediction are used to test the screening effect of SURE. The experimental results show that our model has a robust negative sample screening capability and is superior to all outstanding sample screening methods used in the NPI prediction task. In addition, we refine 5 NPI datasets containing reliable negative samples screened by SURE, and a web server (ww.csbg-jlu.info/sure) is available offering the NPI prediction refined by SURE.
Impact Factor
Scopus SNIP
Altmetric
8.100
2.404
Tags
Annotations
Special Publikation
Hide on homepage

Edit extra information
Edit own tags
Private
Edit own annotation
Private
Hide on publication lists
on hompage
Mark as special
publikation
Publication type Article: Journal article
Document type Scientific Article
Keywords Deep Reinforcement Learning ; Ncrna-protein Interaction ; Negative Sample Screening
Language english
Publication Year 2023
HGF-reported in Year 2023
ISSN (print) / ISBN 0020-0255
e-ISSN 0020-0255
Quellenangaben Volume: 629, Issue: , Pages: 299-312 Article Number: , Supplement: ,
Publisher Elsevier
Publishing Place Ste 800, 230 Park Ave, New York, Ny 10169 Usa
Reviewing status Peer reviewed
Institute(s) Institute of Translational Genomics (ITG)
POF-Topic(s) 30205 - Bioengineering and Digital Health
Research field(s) Genetics and Epidemiology
PSP Element(s) G-506700-001
Scopus ID 85147542128
Erfassungsdatum 2023-11-28