蛋白质结构预测

2019-08-30 23:36

鼠伤寒沙门氏菌H-1鞭毛蛋白的结构预测

蛋白质序列截图

>sp|P06179|FLIC_SALTY Flagellin OS=Salmonella typhimurium GN=fliC PE=1 SV=4

MAQVINTNSLSLLTQNNLNKSQSALGTAIERLSSGLRINSAKDDAAGQAIANRFTANIKG LTQASRNANDGISIAQTTEGALNEINNNLQRVRELAVQSANSTNSQSDLDSIQAEITQRL NEIDRVSGQTQFNGVKVLAQDNTLTIQVGANDGETIDIDLKQINSQTLGLDTLNVQQKYK VSDTAATVTGYADTTIALDNSTFKASATGLGGTDQKIDGDLKFDDTTGKYYAKVTVTGGT GKDGYYEVSVDKTNGEVTLAGGATSPLTGGLPATATEDVKNVQVANADLTEAKAALTAAG VTGTASVVKMSYTDNNGKTIDGGLAVKVGDDYYSATQNKDGSISINTTKYTADDGTSKTA LNKLGGADGKTEVVSIGGKTYAASKAEGHNFKAQPDLAEAAATTTENPLQKIDAALAQVD TLRSDLGAVQNRFNSAITNLGNTVNNLTSARSRIEDSDYATEVSNMSRAQILQQAGTSVL AQANQVPQNVLSLLR

等电点、相对分子质量计算

10 20 30 40 50 60 MAQVINTNSL SLLTQNNLNK SQSALGTAIE RLSSGLRINS AKDDAAGQAI ANRFTANIKG

70 80 90 100 110 120 LTQASRNAND GISIAQTTEG ALNEINNNLQ RVRELAVQSA NSTNSQSDLD SIQAEITQRL

130 140 150 160 170 180 NEIDRVSGQT QFNGVKVLAQ DNTLTIQVGA NDGETIDIDL KQINSQTLGL DTLNVQQKYK

190 200 210 220 230 240

VSDTAATVTG YADTTIALDN STFKASATGL GGTDQKIDGD LKFDDTTGKY YAKVTVTGGT

250 260 270 280 290 300 GKDGYYEVSV DKTNGEVTLA GGATSPLTGG LPATATEDVK NVQVANADLT EAKAALTAAG

310 320 330 340 350 360 VTGTASVVKM SYTDNNGKTI DGGLAVKVGD DYYSATQNKD GSISINTTKY TADDGTSKTA

370 380 390 400 410 420 LNKLGGADGK TEVVSIGGKT YAASKAEGHN FKAQPDLAEA AATTTENPLQ KIDAALAQVD

430 440 450 460 470 480 TLRSDLGAVQ NRFNSAITNL GNTVNNLTSA RSRIEDSDYA TEVSNMSRAQ ILQQAGTSVL

490

AQANQVPQNV LSLLR

蛋白质参数预测

Number of amino acids: 495

Molecular weight: 51611.7

Theoretical pI: 4.79

Amino acid composition: Ala (A) 61 12.3% Arg (R) 14 2.8% Asn (N) 42 8.5% Asp (D) 37 7.5% Cys (C) 0 0.0% Gln (Q) 32 6.5% Glu (E) 17 3.4% Gly (G) 43 8.7% His (H) 1 0.2% Ile (I) 25 5.1% Leu (L) 42 8.5% Lys (K) 28 5.7% Met (M) 3 0.6% Phe (F) 6 1.2% Pro (P) 5 1.0% Ser (S) 38 7.7% Thr (T) 57 11.5% Trp (W) 0 0.0%

Tyr (Y) 12 2.4% Val (V) 32 6.5% Pyl (O) 0 0.0% Sec (U) 0 0.0%

(B) 0 0.0% (Z) 0 0.0% (X) 0 0.0%

Total number of negatively charged residues (Asp + Glu): 54 Total number of positively charged residues (Arg + Lys): 42

Atomic composition:

Carbon C 2194 Hydrogen H 3597 Nitrogen N 641 Oxygen O 785 Sulfur S 3

Formula: C2194H3597N641O785S3 Total number of atoms: 7220

Extinction coefficients:

This protein does not contain any Trp residues. Experience shows that this could result in more than 10% error in the computed extinction coefficient.

Extinction coefficients are in units of M-1 cm-1, at 280 nm measured in water.

Ext. coefficient 17880 Abs 0.1% (=1 g/l) 0.346

Estimated half-life:

The N-terminal of the sequence considered is M (Met). The estimated half-life is: 30 hours (mammalian reticulocytes, in vitro). >20 hours (yeast, in vivo).

>10 hours (Escherichia coli, in vivo).

Instability index:

The instability index (II) is computed to be 24.98 This classifies the protein as stable.

Aliphatic index: 83.86

Grand average of hydropathicity (GRAVY): -0.395

氨基酸组成、电荷分布、疏水区域、跨膜区域等 SAPS. Version of April 11, 1996. Date run: Mon Nov 22 12:41:22 2010

SAPS (Statistical Analysis of Protein Sequences) evaluates by statistical

criteria a wide variety of protein sequence properties. A full description

of the methods is given in the paper referred to below. The output is or-

ganized in the following sections: file name, sequence printout, composi-

tional analysis, charge distributional analysis (charge clusters; high

scoring (un)charged segments; charge runs and patterns), distribution of

other amino acid types (high scoring hydrophobic and transmembrane seg-

ments; cysteine spacings), repetitive structures (in the amino acid alpha-

bet and in a 11-letter reduced alphabet), multiplets (counts, spacings,

and clusters in the amino acid and charge alphabets), periodicity

analysis, spacing analysis. Each section is annotated below under its sec-

tion title.

The SAPS program was developed in the group of Prof. Samuel Karlin at

Stanford University. Correspondence relating to SAPS should be addressed

to either Volker Brendel or Samuel Karlin at the Department of Mathemat-

ics, Stanford University, Stanford CA 94305, U.S.A.; phone: (415) 723-

2209; fax: (415) 725-2040; email: volker@gnomic.stanford.edu. Users of the

program should cite the following reference: Brendel, V., Bucher, P.,

Nourbakhsh, I., Blaisdell, B.E., Karlin, S. (1992) Methods and algorithms

for statistical analysis of protein sequences. Proc. Natl. Acad. Sci. USA

89: 2002-2006.

Protein 1 (File: wwwtmp/.SAPS.19208.6283.seq)

SWISS-PROT ANNOTATION: ID unknown

DE unknown, 549 bases, 502107F4 checksum.

number of residues: 545; molecular weight: 57.1 kdal

1 SPPFLICSAL TYFLAGELLI NSSALMNELL ATYPHIMRIM GNFLICPESV MAQVINTNSL

61 SLLTQNNLNK SQSALGTAIE RLSSGLRINS AKDDAAGQAI ANRFTANIKG LTQASRNAND

121 GISIAQTTEG ALNEINNNLQ RVRELAVQSA NSTNSQSDLD SIQAEITQRL NEIDRVSGQT

181 QFNGVKVLAQ DNTLTIQVGA NDGETIDIDL KQINSQTLGL DTLNVQQKYK VSDTAATVTG

241 YADTTIALDN STFKASATGL GGTDQKIDGD LKFDDTTGKY YAKVTVTGGT GKDGYYEVSV

301 DKTNGEVTLA GGATSPLTGG LPATATEDVK NVQVANADLT EAKAALTAAG VTGTASVVKM

361 SYTDNNGKTI DGGLAVKVGD DYYSATQNKD GSISINTTKY TADDGTSKTA LNKLGGADGK

421 TEVVSIGGKT YAASKAEGHN FKAQPDLAEA AATTTENPLQ KIDAALAQVD TLRSDLGAVQ

481 NRFNSAITNL GNTVNNLTSA RSRIEDSDYA TEVSNMSRAQ ILQQAGTSVL AQANQVPQNV


蛋白质结构预测.doc 将本文的Word文档下载到电脑 下载失败或者文档不完整,请联系客服人员解决!

下一篇:贵州省情期末复习题答案2011.7

相关阅读
本类排行
× 注册会员免费下载(下载后可以自由复制和排版)

马上注册会员

注:下载文档有可能“只有目录或者内容不全”等情况,请下载之前注意辨别,如果您已付费且无法下载或内容有问题,请联系我们协助你处理。
微信: QQ: