Performance evaluation of features for gene essentiality prediction

dc.contributor.authorAromolaran, Olufemi
dc.contributor.authorOyelade, Jelili
dc.contributor.authorAdebiyi, Ezekiel
dc.date.accessioned2023-08-31T13:41:04Z
dc.date.available2023-08-31T13:41:04Z
dc.date.issued2021
dc.description.abstract. Essential genes are subset of genes required by an organism for growth and sustenance of life and as well responsible for phenotypic changes when their activities are altered. They have been utilized as drug targets, disease control agent, etc. Essential genes have been widely identified especially in microorganisms, due to the extensive experimental studies on some of them such as Escherichia coli and Saccharomyces cerevisiae. Experimental approach has been a reliable method to identify essential genes. However, it is complex, costly, labour and time intensive. Therefore, computational approach has been developed to complement the experimental approach in order to minimize resources required for essentiality identification experiments. Machine learning approaches have been widely used to predict essential genes in model organisms using different categories of features with varying degrees of accuracy and performance. However, previous studies have not established the most important categories of features that provide the distinguishing power in machine learning essentiality predictions. Therefore, this study evaluates the discriminating strength of major categories of features used in essential gene prediction task as well as the factors responsible for effective computational prediction. Four categories of features were considered and kfold cross-validation machine learning technique was used to build the classification model. Our results show that ontology features with an AUROC score of 0.936 has the most discriminating power to classify essential and non-essential genes. This studyconcludes that more ontology related features will further improve the performance of machine learning approach and also sensitivity, precision and AUPRC are realistic measures of performance in essentiality prediction.en_US
dc.description.sponsorshipACE: Applied Informatics and Communicationen_US
dc.identifier.issn1755-1307
dc.identifier.urihttps://datad.aau.org/handle/123456789/2104
dc.language.isoenen_US
dc.publisherIOP Conference Series: Earth and Environmental Scienceen_US
dc.relation.ispartofseriesIOP Conference Series: Earth and Environmental Science;
dc.subjectEssential genesen_US
dc.subjectEssential proteinsen_US
dc.subjectEssential proteinsen_US
dc.subjectMachine-learningen_US
dc.subjectDigital Developmenten_US
dc.subjectCovenant Universityen_US
dc.subjectACE: Applied Informatics and Communicationen_US
dc.subjectNigeriaen_US
dc.titlePerformance evaluation of features for gene essentiality predictionen_US
dc.typeArticleen_US

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Performance.pdf
Size:
794.01 KB
Format:
Adobe Portable Document Format
Description:
Main article

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed upon to submission
Description:

Collections