LOADING...
   TM0023;TM0023
Protein Sequence Comparative Analysis   (PSCA)
 
  Back to Target List
 
TM0023    TM0023            JCSG 281904 Protein Purification 04-JUL-20

Protein Sequence Information


TOPSAN
JCSG Internal annotations

ACCESSIONTM0023
DESCRIPTIONMethyl-accepting chemotaxis protein.
ORGANISMThermotoga maritima
COMMENT 
DR UNIPROT; Q9WXN0_THEMA
DR SPTR; Q9WXN0 (flat text); Q9WXN0 (good view)
DR GenBank; AAD35117
DR Pfam; PF00015; MCPsignal;
DR Pfam; PF00672; HAMP;
DR Interpro; IPR004089; Chmtaxis_transd;
DR Interpro; IPR003660; HAMP;
DR FFAS; 281904; Fold and function assignment.
DR TVPC; TM0023; Homologs in PDB, JCSG and SG center.
DR OVP; TM0023; Ortholog view popup.
DR TPM; TM0023; Target PDB monitor.
DR FSS; TM0023; Target function coverage.
PROPERTY Residues: 656 aa
Molecule Weight: 72039.03 Dalton
Isoelectric Point: 4.8
Extinction Coefficient: 37820
Gravy Index: -.15
Number of Met residues: 20
Percentage of Met residues: 3.05 %
Number of Cys residues: 0
Percentage of Cys residues: 0.00 %
SEQUENCE  amino acids 656 aa
>TM0023  TM0023 methyl-accepting chemotaxis protein
MNIRFRIIFIMVVILVSFVLSAYLVQRSTSSLLISNAKDYMEKSVSSLSKYISQKLNEVQ
RNLKTLLGSSLIGGYTIASNLQSVLQGATDTVSVGIVVDEMSESAYLVLPEKIEEKNYSE
YEKYLSLMKEKNKTSLVLAESIDGKPVLLFVEGVTTFGSEPSGLIALGISLSENTDLWKA
VVEEGKASKSGYGLLVTSDGKVLIHKDMGNFMKDVKELGGFEKAFEEAKSGGEKYVEYEY
NGEKKYTVWEKVPGYDFYIFSTGYLDELLAEGRKATFGTIVTYVVFGGVIFAVLFVSMMP
VVKRMRQQVEKVKRFGEGDLTVEFEAKGKDELTQIEESLKEAVLSLKEMIVSIIEASKEL
SGASEEIKVLSEESHKMSENLHEEAKKILDEANNMSSALTEVTSGVEEVAASAQNISKIT
QDLTERSEAVTKAAREGTERVEAVGGVINKLKGSAERQRDYLRELVDSAKTIGEIVDTIS
SIAEQTNLLALNAAIEAARAGEAGRGFAVVADEIRKLAEESQRATEDIAKMLSSLRATIE
HVENGSKEMFEGVDEIAVMGEEVTKRFREILGRIEEINSMIENTAATAQEQGAAAEEMAS
AMDNVTKIVEGVVESLNRMESLIEDQTESAARVSEAAERLSELSEQLSTLVQKFKV

CDS  cDNA 1968 bp
   1 atgaacataa gattcagaat catattcata atggttgtga ttcttgtttc ttttgttctc    60
  61 agtgcttatc tggttcagag aagtacctcg tctcttctca tcagcaacgc aaaagattac   120
 121 atggagaagt cggtgtcttc tctttcaaaa tacatctccc agaagctgaa cgaggttcaa   180
 181 agaaatctga aaaccctcct tggcagctct ctcataggag ggtacacgat tgcttcgaac   240
 241 cttcagtctg ttcttcaggg ggctacggat acagtctccg ttggtattgt tgtcgatgag   300
 301 atgagcgaat ccgcttattt ggttttacca gaaaaaatag aagagaagaa ctattcagag   360
 361 tacgaaaagt acctcagcct tatgaaggaa aaaaacaaaa cttcgcttgt ccttgccgaa   420
 421 tccatcgatg gaaagcctgt tttgcttttc gtggagggag ttacaacgtt cggttcagaa   480
 481 ccatctggac tgatagcatt aggtatatcc ctctctgaga acaccgatct ctggaaagcg   540
 541 gttgtagaag aagggaaagc gagcaagagc ggctacgggc ttcttgtcac aagcgacggg   600
 601 aaagtactga tccacaaaga catggggaac ttcatgaaag acgtgaagga actaggtggc   660
 661 tttgagaaag cattcgagga agcaaagagt ggtggagaga aatacgtaga gtacgaatac   720
 721 aacggagaga agaaatacac cgtgtgggag aaagtgcccg gatacgactt ctacatcttc   780
 781 tcgacggggt acctggatga acttcttgca gaaggaagga aagcgacctt tgggacgata   840
 841 gtgacgtatg tagtgtttgg cggcgtgatc tttgcggtgc tgtttgtttc gatgatgccg   900
 901 gtagtgaaga gaatgaggca gcaggtagag aaagtgaaga gattcggaga aggggacctg   960
 961 acagtagagt tcgaagcgaa agggaaagat gaactgaccc agatagaaga gagcctgaaa  1020
1021 gaagcggtac tatcactcaa agagatgata gtgagcatca tagaagcttc gaaagagctg  1080
1081 agcggagcat cagaagagat aaaagttctc tcagaagaga gccacaagat gtcagagaac  1140
1141 ctgcacgaag aagccaagaa gatactggac gaggcgaaca acatgagcag tgcgctgaca  1200
1201 gaagtgacga gcggcgtaga agaagtggca gcgagtgcgc agaacatctc aaagatcacc  1260
1261 caggatctga cagaaagatc agaagcggtg acgaaagcgg caagagaagg aacagagaga  1320
1321 gtagaagcgg tgggaggagt cataaacaaa ctcaaagggt cagcagaaag acagagggac  1380
1381 tacctgagag aactcgttga ctcagccaag acgataggag agatagtgga cacgatcagc  1440
1441 tcgatagcag agcagacgaa cctgctcgcg ttgaacgcag cgatagaagc agctcgagct  1500
1501 ggggaagcgg gaaggggctt tgcggttgtg gcagacgaga taaggaaact tgcagaagag  1560
1561 agccagaggg cgacagaaga catagcgaag atgttgagca gtttgagggc aacgatagaa  1620
1621 cacgtggaaa acggctcgaa agagatgttc gaaggagtgg acgagatagc ggtgatggga  1680
1681 gaagaagtca caaagagatt cagagagatc cttggaagga tagaagagat caacagcatg  1740
1741 atagagaaca cagctgccac tgcgcaggaa cagggtgcgg cggcggagga gatggcgagc  1800
1801 gcgatggaca acgtcacaaa gatagtcgaa ggagttgtgg aaagtctaaa cagaatggag  1860
1861 tctctcatag aagatcagac cgagtctgct gccagggtct ctgaggccgc tgaaagactg  1920
1921 tctgagcttt ctgaacagct ctcaacgctc gttcagaagt tcaaagta  1968
Target constructs
Download sequence in PIR or FASTA format
Chemical properties of sequence with tag
GENE PREDICTION
LEFT END RIGHT END FRAME PREDICTOR SCORE 
19428 21398 -3 Glimmer3 score 16.19 good
Genemark probabilities .96   .99 good
GenemarkHMM class 1 good

Notes

The start codon is a ATG/Methionine in most of sequences, but the GTG/V, TTG/L, CTG/I can be the start codons in some cases as expressing in E.coli. The start codon warning is labelled by RED color in sequence(sample: 10174951).

Protein sequence information contains the annotation contents from both of JCSG and SWISS-PROT. The SWISS-PROT/TrEMBL annotation is accessed from SWALL(SPTR) on the EBI SRS server. PDB homologes show both identical and highly similar proteins with released date and protein function.


Contact Webmaster JCSG Menu