Proteomics biomarker discovery for individualized prevention of familial pancreatic cancer using statistical learning.
    
    
        
    
    
        
        PLoS ONE 18:e0280399 (2023)
    
    
    
		
		
			
				BACKGROUND: The low five-year survival rate of pancreatic ductal adenocarcinoma (PDAC) and the low diagnostic rate of early-stage PDAC via imaging highlight the need to discover novel biomarkers and improve the current screening procedures for early diagnosis. Familial pancreatic cancer (FPC) describes the cases of PDAC that are present in two or more individuals within a circle of first-degree relatives. Using innovative high-throughput proteomics, we were able to quantify the protein profiles of individuals at risk from FPC families in different potential pre-cancer stages. However, the high-dimensional proteomics data structure challenges the use of traditional statistical analysis tools. Hence, we applied advanced statistical learning methods to enhance the analysis and improve the results' interpretability. METHODS: We applied model-based gradient boosting and adaptive lasso to deal with the small, unbalanced study design via simultaneous variable selection and model fitting. In addition, we used stability selection to identify a stable subset of selected biomarkers and, as a result, obtain even more interpretable results. In each step, we compared the performance of the different analytical pipelines and validated our approaches via simulation scenarios. RESULTS: In the simulation study, model-based gradient boosting showed a more accurate prediction performance in the small, unbalanced, and high-dimensional datasets than adaptive lasso and could identify more relevant variables. Furthermore, using model-based gradient boosting, we discovered a subset of promising serum biomarkers that may potentially improve the current screening procedure of FPC. CONCLUSION: Advanced statistical learning methods helped us overcome the shortcomings of an unbalanced study design in a valuable clinical dataset. The discovered serum biomarkers provide us with a clear direction for further investigations and more precise clinical hypotheses regarding the development of FPC and optimal strategies for its early detection.
			
			
				
			
		 
		
			
				
					
					Impact Factor
					Scopus SNIP
					Web of Science
Times Cited
					Scopus
Cited By
					
					Altmetric
					
				 
				
			 
		 
		
     
    
        Publikationstyp
        Artikel: Journalartikel
    
 
    
        Dokumenttyp
        Wissenschaftlicher Artikel
    
 
    
        Typ der Hochschulschrift
        
    
 
    
        Herausgeber
        
    
    
        Schlagwörter
        Binding Protein; R Package; Expression; Risk; Adenocarcinoma; Regularization; Prediction; Molecule; Homolog; Cloning
    
 
    
        Keywords plus
        
    
 
    
    
        Sprache
        englisch
    
 
    
        Veröffentlichungsjahr
        2023
    
 
    
        Prepublished im Jahr 
        0
    
 
    
        HGF-Berichtsjahr
        2023
    
 
    
    
        ISSN (print) / ISBN
        1932-6203
    
 
    
        e-ISSN
        
    
 
    
        ISBN
        
    
 
    
        Bandtitel
        
    
 
    
        Konferenztitel
        
    
 
	
        Konferzenzdatum
        
    
     
	
        Konferenzort
        
    
 
	
        Konferenzband
        
    
 
     
		
    
        Quellenangaben
        
	    Band: 18,  
	    Heft: 1,  
	    Seiten: ,  
	    Artikelnummer: e0280399 
	    Supplement: ,  
	
    
 
  
        
            Reihe
            
        
 
        
            Verlag
            Public Library of Science (PLoS)
        
 
        
            Verlagsort
            Lawrence, Kan.
        
 
	
        
            Tag d. mündl. Prüfung
            0000-00-00
        
 
        
            Betreuer
            
        
 
        
            Gutachter
            
        
 
        
            Prüfer
            
        
 
        
            Topic
            
        
 
	
        
            Hochschule
            
        
 
        
            Hochschulort
            
        
 
        
            Fakultät
            
        
 
    
        
            Veröffentlichungsdatum
            0000-00-00
        
 
         
        
            Anmeldedatum
            0000-00-00
        
 
        
            Anmelder/Inhaber
            
        
 
        
            weitere Inhaber
            
        
 
        
            Anmeldeland
            
        
 
        
            Priorität
            
        
 
    
        Begutachtungsstatus
        Peer reviewed
    
 
     
    
        POF Topic(s)
        30501 - Systemic Analysis of Genetic and Environmental Factors that Impact Health
30203 - Molecular Targets and Therapies
    
 
    
        Forschungsfeld(er)
        Genetics and Epidemiology
Enabling and Novel Technologies
    
 
    
        PSP-Element(e)
        G-504100-001
A-630700-001
G-505700-001
    
 
    
        Förderungen
        GAUFF-Foundation
German Research Foundation
Munich Center of Health Sciences (MC-Health), Ludwig Maximilian University of Munich, as part of LMUinnovativ
Wilhelm Sander-Stiftung
    
 
    
        Copyright
        
    
 	
    
    
    
    
    
        Erfassungsdatum
        2023-02-01