Predicting functional residues of protein sequence alignments as a feature selection task

Chris Haddow, Justin Perry, Marcus Durrant, Joe Faith

Research output: Contribution to journalArticlepeer-review

7 Citations (Scopus)

Abstract

Determining which residues within a multiple alignment of protein sequences are most responsible for protein function is a difficult and important task in bioinformatics. Here, we show that this task is an application of the standard Feature Selection (FS) problem. We show the comparison of standard FS techniques with more specialised algorithms on a range of data sets backed by experimental evidence, and find that some standard algorithms perform as well as specialised ones. We also discuss how considering the discriminating power of combinations of residue positions, rather than the power of each position individually, has the potential to improve the performance of such algorithms.
Original languageEnglish
Pages (from-to)691-705
JournalInternational Journal of Data Mining in Bioinformatics
Volume5
Issue number6
DOIs
Publication statusPublished - 2011

Keywords

  • feature selection
  • protein structure
  • functional residue prediction
  • targeted projection pursuit
  • information gain

Fingerprint

Dive into the research topics of 'Predicting functional residues of protein sequence alignments as a feature selection task'. Together they form a unique fingerprint.

Cite this