Paper
27 August 2024 DPPred-indel:a pathogenic inframe indel prediction method based on biological language models and fusion of DNA and protein features
Ming Lin
Author Affiliations +
Proceedings Volume 13252, Fourth International Conference on Biomedicine and Bioinformatics Engineering (ICBBE 2024); 132521N (2024) https://doi.org/10.1117/12.3044406
Event: 2024 Fourth International Conference on Biomedicine and Bioinformatics Engineering (ICBBE 2024), 2024, Kaifeng, China
Abstract
Inframe insertion/deletion (indel) mutations play a crucial role in human genetic variation and are closely associated with various diseases. Currently, most methods for predicting pathogenic inframe indel mutations are based on machine learning, utilizing manually engineered features for prediction, which may overlook certain mutation data. Moreover, existing deep learning methods mainly focus on protein sequence features, neglecting other aspects. This paper proposes a method for predicting pathogenic inframe indel mutations, termed DPPred-indel, based on a combination of a biological language model and feature fusion. DPPred-indel leverages transfer learning to address the limited data availability and integrates features from both protein and DNA sequence levels for prediction. Subsequent experiments validate the effectiveness of transfer learning and feature fusion. DPPred-indel demonstrates comparable performance to state-of-the-art methods on two independent test sets constructed in this study.
(2024) Published by SPIE. Downloading of the abstract is permitted for personal use only.
Ming Lin "DPPred-indel:a pathogenic inframe indel prediction method based on biological language models and fusion of DNA and protein features", Proc. SPIE 13252, Fourth International Conference on Biomedicine and Bioinformatics Engineering (ICBBE 2024), 132521N (27 August 2024); https://doi.org/10.1117/12.3044406
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Proteins

Data modeling

Pathogens

Performance modeling

Biological samples

Databases

Feature extraction

Back to Top