Membrane protein topology : prediction, experimental mapping and genome-wide analysis
Author: Nilsson, Johan
Date: 2004-06-02
Location: Samuelssonsalen, Scheelelaboratoriet, Karolinska Institutet
Time: 10.00
Department: Institutionen för medicinsk biokemi och biofysik (MBB) / Department of Medical Biochemistry and Biophysics
Abstract
Membrane proteins constitute a large and important class of proteins.
They are responsible for many crucial processes in living cells, and
about 25% of all proteins encoded by a typical genome are predicted to be
membrane proteins. Despite their importance, very few structures have
been determined at atomic resolution. As an alternative, the topology of
a membrane protein may provide valuable information on e.g. the location
of ligand binding sites on receptors and transporters.
In this thesis, bioinformatics and experimental approaches have been applied to study membrane protein topologies. A method to identify reliably predicted topologies or partial topologies, using a consensus of five different prediction algorithms, has been developed. The approach has been used in combination with limited fusion protein analysis to rapidly produce topology models of E. coli inner-membrane proteins. In addition, a comprehensive statistical analysis has been performed on predicted membrane protein topologies from the genomes of 108 organisms. The correlation between the number of independent prediction methods that agree on a topology model, and the reliability of the predicted topology was investigated. Analysis on a data set of E. coli inner-membrane proteins with experimentally determined topologies revealed that predicted topologies were very reliable (>90% correct) when all methods or all but one agreed on the prediction. When the consensus approach was applied to all putative membrane proteins in a number of completely sequenced genomes, we found that highly reliable topology predictions could be made for nearly half of all membrane proteins in most prokaryotic organisms. The method was subsequently extended to predict also parts of a global topology with an equally high reliability. With the partial consensus topology predictions, the average fraction of prokaryotic membrane proteins for which either a complete or a partial topology of high reliability could be predicted was increased to approximately 70%. Fusion protein analysis has been used in combination with the consensus topology predictions to reduce the amount of experimental efforts required to establish a topology model. The topologies of 43 E. coli inner-membrane proteins have been mapped using the combined approach.
Comparative studies of amino acid distributions in predicted membrane proteins from a total of 108 genomes were performed. The main objective of the study was to establish if the so called positive-inside rule (the asymmetric distribution of Lys and Arg residues between polar loops on opposite sides of a membrane) is valid for all organisms where the genome has been sequenced. An algorithm for identification of membrane spanning helices was optimised to yield a minimum number of incorrect predictions. The optimised method was then applied to all putative membrane proteins in each genome, and the average frequency bias of each amino acid residue type between polar segments on opposite sides of the membrane was estimated. The study strongly suggested that the positive-inside rule is valid in most, if not all, organisms investigated, although it is less pronounced in eukaryotes than in eubacteria and archaea.
In this thesis, bioinformatics and experimental approaches have been applied to study membrane protein topologies. A method to identify reliably predicted topologies or partial topologies, using a consensus of five different prediction algorithms, has been developed. The approach has been used in combination with limited fusion protein analysis to rapidly produce topology models of E. coli inner-membrane proteins. In addition, a comprehensive statistical analysis has been performed on predicted membrane protein topologies from the genomes of 108 organisms. The correlation between the number of independent prediction methods that agree on a topology model, and the reliability of the predicted topology was investigated. Analysis on a data set of E. coli inner-membrane proteins with experimentally determined topologies revealed that predicted topologies were very reliable (>90% correct) when all methods or all but one agreed on the prediction. When the consensus approach was applied to all putative membrane proteins in a number of completely sequenced genomes, we found that highly reliable topology predictions could be made for nearly half of all membrane proteins in most prokaryotic organisms. The method was subsequently extended to predict also parts of a global topology with an equally high reliability. With the partial consensus topology predictions, the average fraction of prokaryotic membrane proteins for which either a complete or a partial topology of high reliability could be predicted was increased to approximately 70%. Fusion protein analysis has been used in combination with the consensus topology predictions to reduce the amount of experimental efforts required to establish a topology model. The topologies of 43 E. coli inner-membrane proteins have been mapped using the combined approach.
Comparative studies of amino acid distributions in predicted membrane proteins from a total of 108 genomes were performed. The main objective of the study was to establish if the so called positive-inside rule (the asymmetric distribution of Lys and Arg residues between polar loops on opposite sides of a membrane) is valid for all organisms where the genome has been sequenced. An algorithm for identification of membrane spanning helices was optimised to yield a minimum number of incorrect predictions. The optimised method was then applied to all putative membrane proteins in each genome, and the average frequency bias of each amino acid residue type between polar segments on opposite sides of the membrane was estimated. The study strongly suggested that the positive-inside rule is valid in most, if not all, organisms investigated, although it is less pronounced in eukaryotes than in eubacteria and archaea.
List of papers:
I. Nilsson J, Persson B, von Heijne G (2000). "Consensus predictions of membrane protein topology. " FEBS Lett 486(3): 267-9
Pubmed
II. Nilsson J, Persson B, Von Heijne G (2002). "Prediction of partial membrane protein topologies using a consensus approach. " Protein Sci 11(12): 2974-80
Pubmed
III. Drew D, Sjostrand D, Nilsson J, Urbig T, Chin CN, de Gier JW, von Heijne G (2002). "Rapid topology mapping of Escherichia coli inner-membrane proteins by prediction and PhoA/GFP fusion analysis." Proc Natl Acad Sci U S A 99(5): 2690-5. Epub 2002 Feb 26
Pubmed
IV. Rapp M, Drew D, Daley DO, Nilsson J, Carvalho T, Melen K, De Gier JW, Von Heijne G (2004). "Experimentally based topology models for E. coli inner membrane proteins." Protein Sci 13(4): 937-45
Pubmed
V. Nilsson J, Persson B, von Heijne G (2004). "Comparative analysis of amino acid distributions in integral membrane proteins from 108 genomes." (Manuscript)
I. Nilsson J, Persson B, von Heijne G (2000). "Consensus predictions of membrane protein topology. " FEBS Lett 486(3): 267-9
Pubmed
II. Nilsson J, Persson B, Von Heijne G (2002). "Prediction of partial membrane protein topologies using a consensus approach. " Protein Sci 11(12): 2974-80
Pubmed
III. Drew D, Sjostrand D, Nilsson J, Urbig T, Chin CN, de Gier JW, von Heijne G (2002). "Rapid topology mapping of Escherichia coli inner-membrane proteins by prediction and PhoA/GFP fusion analysis." Proc Natl Acad Sci U S A 99(5): 2690-5. Epub 2002 Feb 26
Pubmed
IV. Rapp M, Drew D, Daley DO, Nilsson J, Carvalho T, Melen K, De Gier JW, Von Heijne G (2004). "Experimentally based topology models for E. coli inner membrane proteins." Protein Sci 13(4): 937-45
Pubmed
V. Nilsson J, Persson B, von Heijne G (2004). "Comparative analysis of amino acid distributions in integral membrane proteins from 108 genomes." (Manuscript)
Issue date: 2004-05-12
Publication year: 2004
ISBN: 91-7349-963-3
Statistics
Total Visits
Views | |
---|---|
Membrane ...(legacy) | 278 |
Membrane ... | 119 |
Total Visits Per Month
March 2024 | April 2024 | May 2024 | June 2024 | July 2024 | August 2024 | September 2024 | |
---|---|---|---|---|---|---|---|
Membrane ... | 0 | 0 | 0 | 1 | 4 | 5 | 2 |
Top country views
Views | |
---|---|
United States | 82 |
China | 52 |
Sweden | 43 |
Germany | 41 |
South Korea | 11 |
Ireland | 8 |
Russia | 7 |
Finland | 6 |
Denmark | 3 |
Spain | 3 |
Top cities views
Views | |
---|---|
Sunnyvale | 22 |
Shenzhen | 19 |
Kiez | 17 |
Seoul | 9 |
Beijing | 8 |
Dublin | 8 |
University Park | 5 |
Ashburn | 4 |
Mountain View | 4 |
Tianjin | 4 |