Prediction approaches for partly missing multi-omics covariate data: A literature review and an empirical comparison study.
    
    
        
    
    
        
        WIREs Comput. Sta. 16:e1626 (2024)
    
    
    
      
      
	
	    As the availability of omics data has increased in the last few years, more multi-omics data have been generated, that is, high-dimensional molecular data consisting of several types such as genomic, transcriptomic, or proteomic data, all obtained from the same patients. Such data lend themselves to being used as covariates in automatic outcome prediction because each omics type may contribute unique information, possibly improving predictions compared to using only one omics data type. Frequently, however, in the training data and the data to which automatic prediction rules should be applied, the test data, the different omics data types are not available for all patients. We refer to this type of data as block-wise missing multi-omics data. First, we provide a literature review on existing prediction methods applicable to such data. Subsequently, using a collection of 13 publicly available multi-omics data sets, we compare the predictive performances of several of these approaches for different block-wise missingness patterns. Finally, we discuss the results of this empirical comparison study and draw some tentative conclusions. This article is categorized under: Applications of Computational Statistics > Genomics/Proteomics/Genetics Applications of Computational Statistics > Health and Medical Data/Informatics Statistical and Graphical Methods of Data Analysis > Analysis of High Dimensional Data.
	
	
	    
	
       
      
	
	    
		Impact Factor
		Scopus SNIP
		Web of Science
Times Cited
		Scopus
Cited By
		Altmetric
		
	     
	    
	 
       
      
     
    
        Publication type
        Article: Journal article
    
 
    
        Document type
        Review
    
 
    
        Thesis type
        
    
 
    
        Editors
        
    
    
        Keywords
        Missing Values ; Molecular Data ; Multi-omics ; Prediction; Imputation; Shrinkage
    
 
    
        Keywords plus
        
    
 
    
    
        Language
        english
    
 
    
        Publication Year
        2024
    
 
    
        Prepublished in Year
        2023
    
 
    
        HGF-reported in Year
        2023
    
 
    
    
        ISSN (print) / ISBN
        1939-5108
    
 
    
        e-ISSN
        1939-0068
    
 
    
        ISBN
        
    
    
        Book Volume Title
        
    
 
    
        Conference Title
        
    
 
	
        Conference Date
        
    
     
	
        Conference Location
        
    
 
	
        Proceedings Title
        
    
 
     
	
    
        Quellenangaben
        
	    Volume: 16,  
	    Issue: 1,  
	    Pages: ,  
	    Article Number: e1626 
	    Supplement: ,  
	
    
 
    
        
            Series
            
        
 
        
            Publisher
            Wiley
        
 
        
            Publishing Place
            111 River St, Hoboken 07030-5774, Nj Usa
        
 
	
        
            Day of Oral Examination
            0000-00-00
        
 
        
            Advisor
            
        
 
        
            Referee
            
        
 
        
            Examiner
            
        
 
        
            Topic
            
        
 
	
        
            University
            
        
 
        
            University place
            
        
 
        
            Faculty
            
        
 
    
        
            Publication date
            0000-00-00
        
 
         
        
            Application date
            0000-00-00
        
 
        
            Patent owner
            
        
 
        
            Further owners
            
        
 
        
            Application country
            
        
 
        
            Patent priority
            
        
 
    
        Reviewing status
        Peer reviewed
    
 
     
    
        POF-Topic(s)
        30205 - Bioengineering and Digital Health
    
 
    
        Research field(s)
        Enabling and Novel Technologies
    
 
    
        PSP Element(s)
        G-503800-001
    
 
    
        Grants
        
    
 
    
        Copyright
        
    
 	
    
    
    
    
        Erfassungsdatum
        2023-12-08