Projects per year
Abstract
Proteins are macromolecules that perform essential biological functions which depend on their three-dimensional structure. Determining this structure involves complex laboratory and computational work. For the computational work, multiple software pipelines have been developed to build models of the protein structure from crystallographic data. Each of these pipelines performs differently depending on the characteristics of the electron-density map received as input. Identifying the best pipeline to use for a protein structure is difficult, as the pipeline performance differs significantly from one protein structure to another. As such, researchers often select pipelines that do not produce the best possible protein models from the available data. Here, a software tool is introduced which predicts key quality measures of the protein structures that a range of pipelines would generate if supplied with a given crystallographic data set. These measures are crystallographic quality-of-fit indicators based on included and withheld observations, and structure completeness. Extensive experiments carried out using over 2500 data sets show that the tool yields accurate predictions for both experimental phasing data sets (at resolutions between 1.2 and 4.0 Å) and molecular-replacement data sets (at resolutions between 1.0 and 3.5 Å). The tool can therefore provide a recommendation to the user concerning the pipelines that should be run in order to proceed most efficiently to a depositable model.
Original language | English |
---|---|
Pages (from-to) | 1591-1601 |
Number of pages | 11 |
Journal | Acta crystallographica. Section D, Structural biology |
Volume | 77 |
Issue number | Pt 12 |
DOIs | |
Publication status | Published - 1 Dec 2021 |
Projects
- 1 Finished
-
CCP4 Advanced integrated approaches to macromolecular structure determination
Cowtan, K. (Principal investigator) & Agirre, J. (Co-investigator)
BBSRC (BIOTECHNOLOGY AND BIOLOGICAL SCIENCES RESEARCH COUNCIL)
1/04/19 → 31/03/24
Project: Research project (funded) › Research
Datasets
-
Predicting the performance of automated crystallographic model-building pipelines
Alharbi, E. (Creator), Bond, P. (Contributor), Calinescu, R. (Contributor) & Cowtan, K. D. (Contributor), University of York, 20 Oct 2021
DOI: 10.15124/ee9d169f-c34b-44f2-8c75-3b68e7cd68a8
Dataset