TY - BOOK
T1 - Multiple gold standards address bias in functional network integration
AU - James, K.
AU - Lycett, Samantha J.
AU - Wipat, A.
AU - Hallinan, Jennifer S.
PY - 2011/11/1
Y1 - 2011/11/1
N2 - Network integration is a widely-used method of combining large, diverse data sets. Edge weights, representing the probability that an edge actually exists, can add greatly to the value of the networks. The edge weights are usually calculated using a Gold Standard dataset. However, all Gold Standards suffer from incomplete coverage of the genome, and from bias in the type of interactions detected by different experimental techniques. Consequently the use of a single Gold Standard tends to bias the integrated network. We describe a novel Bayesian Data Fusion method for selecting and using multiple Gold Standards for scoring datasets prior to integration. We demonstrate the utility of networks scored against multiple Gold Standards for the pre-diction of Gene Ontology annotations for genes from KEGG pathways. Finally, we apply the networks to the functional prediction of genes which were uncharacterised in datasets from 2007, and evaluate the network results in the light of recent annotations.
AB - Network integration is a widely-used method of combining large, diverse data sets. Edge weights, representing the probability that an edge actually exists, can add greatly to the value of the networks. The edge weights are usually calculated using a Gold Standard dataset. However, all Gold Standards suffer from incomplete coverage of the genome, and from bias in the type of interactions detected by different experimental techniques. Consequently the use of a single Gold Standard tends to bias the integrated network. We describe a novel Bayesian Data Fusion method for selecting and using multiple Gold Standards for scoring datasets prior to integration. We demonstrate the utility of networks scored against multiple Gold Standards for the pre-diction of Gene Ontology annotations for genes from KEGG pathways. Finally, we apply the networks to the functional prediction of genes which were uncharacterised in datasets from 2007, and evaluate the network results in the light of recent annotations.
M3 - Other report
VL - CS-TR-1302
T3 - Newcastle University Technical Report Series
BT - Multiple gold standards address bias in functional network integration
CY - Newcastle upon Tyne
ER -