LOADING...
   NP_251051.1;SP19117A
Protein Sequence Comparative Analysis   (PSCA)
 
  Back to Target List
 
NP_251051.1    SP19117A            JCSG 413324 Proposed Target 09-NOV-12

Protein Sequence Information


TOPSAN
JCSG Internal annotations

ACCESSIONNP_251051.1
DESCRIPTIONgi|15597557|ref|NP_251051.1| hypothetical protein [Pseudomonas aeruginosa PA01]
ORGANISMPSEUDOMONAS AERUGINOSA
COMMENT 
DR UNIPROT;
DR SPTR;
DR GenBank;
DR FFAS; 413324; Fold and function assignment.
DR TVPC; NP_251051.1; Homologs in PDB, JCSG and SG center.
DR OVP; SP19117A; Ortholog view popup.
DR TPM; NP_251051.1; Target PDB monitor.
DR FSS; NP_251051.1; Target function coverage.
PROPERTY Residues: 1271 aa
Molecule Weight: 141339.53 Dalton
Isoelectric Point: 5.84
Extinction Coefficient: 249230
Gravy Index: -.22
Number of Met residues: 17
Percentage of Met residues: 1.34 %
Number of Cys residues: 7
Percentage of Cys residues: 0.55 %
SEQUENCE  amino acids 1271 aa
>NP_251051.1  gi|15597557|ref|NP_251051.1| hypothetical protein [Pseudomonas aeruginosa PA01]
MSGATLFMLVLLLVLLALLLGALGWWWRTRGGTEIRSFYAAVRQMEREQGWQGRYEAPWL
LMLGNETEGEQLCSTWRLLPVARPAWFGRWWSDGEGAILLVPESVFLPDEGLRRQSGAWL
RLLRLFLRLRGRRALDGVVWNIPLARLQDGEQAANLGLAARRRYVELTQRLGLSLPVYVV
ITGMEDLPGFQELLAALPEEARERALGWSSPFAAEAAWQSRWCEQALEEITATLTESIVE
LGTLRGQVDNELYCLPPRLESLRGSLQALLEPVFQGNARGEAPRFRGLYLSGSEAAGAAA
DEVLPAVDAPRRRSSFASQLWARRILAEEGLAQAVPRILQLRQRWQRGIGLAALCLCLLW
GGLMTWVWRDALRDAGELSQLLHGASERYQPLDDDTRRAAQVRQNVQAWWQLVSQAPRWR
FTSLAFPSSWFSSLDARIDNAYRRVSERLLVRPLRSLLEGEASDLRAIRSDGQPGLKEGD
DPSQWKDYLAAKDLVARATTLERHNRLFAQAIDNRRTPLDDLLQLSNDALGSSYNAGSLA
RLAYYNRTLFAERPADLAALDLGRVAATAADNFHDLMARWLNAYFLTDSLERTGNALTQE
LAQLGQERDATAAQLLGIGELINHLQQQVNRINLIWGNGVGQDLVPGYQNLLKSAQQSSL
LGNRVEELQNLTVQLQQQFRSEWIAPPEAAGDGLLVKQGATLKLADDLLGLKRAIDDLKT
KDFVSVALADKTLADHSLLSIDDIGVTQALSFFNDFTGYYESTLPSLNPRYRYSVLHAAA
SAATEAMWQSLGPRSRSLASRNASRFDVQVKQVQVLQAALLELQDLQAGARLLLSLNALA
VADIGQALRDIDEQAVLREPVDFSRWSGAPNFGLQMFRAQDSAELKQSLNSQFNAMAAVA
ERHAPALEWLQAQQSSLATADYNAFVRFSALNAELQKFKADNPASTAAQLAKLVGNDFNQ
MDIGSCADILNGVLLPSSRSDLATRLVDLRQGALGRCQSLQQQQAAQAWKDLADYFNQYL
AGRFPFAYSLEAADAEPGRVRHLLQLMETRLPQVREGLAQVRSPDLPAAEDFVRRLEQAQ
RWLGPLFERDKSGLLGVALDIDWRSDRSLERGADQVIAWSLYSGDQESRFPGAQDKGLTW
NVGDPLKIMLRWAKNGSQRPADDPRQASLAVADLEAGWSYQGSWALLRMMRAHFSRQRPP
NVDYTEFPLVLQLPVYAPYSPENEARMFLRLSLMSLGGKTPLSIQPLPVRAPQSPFATLL
PATVASTGGTP

CDS  cDNA 3816 bp
   1 atgagcggcg cgacgctgtt catgctggtt ctcctcctgg tgctgctggc gctgctgctg    60
  61 ggggcgctgg gctggtggtg gcgaacccgc ggtggtacgg agatccgcag cttctacgcg   120
 121 gcggtccggc agatggagcg cgagcagggc tggcagggcc gctacgaagc gccgtggctg   180
 181 ctgatgctcg gcaacgagac cgagggcgag cagttgtgct ccacctggcg tctgctgccg   240
 241 gtggcgcgcc cggcctggtt cggccgctgg tggtcggacg gcgagggggc gatcctgctg   300
 301 gtgcccgagt cggtgttcct gcccgacgag ggcctgcgcc ggcaaagcgg cgcctggctg   360
 361 cgcctgttgc gtctgttcct gcgcctgcgc ggcaggcgcg cgctggacgg agtggtctgg   420
 421 aacattccgc tggcgcggct gcaggatggc gagcaggcgg ccaacctcgg cctcgccgcg   480
 481 cgccgtcgct acgtggaact gacccagcgc cttggcctga gcctgccggt ttacgtggtg   540
 541 atcactggca tggaagacct gcctggcttc caggaacttc tcgcggcgct gcccgaggag   600
 601 gcgcgcgagc gcgccctcgg ctggtcctcg ccgttcgccg ccgaagcggc ctggcagtcg   660
 661 cgctggtgcg agcaggcgct ggaggaaatc accgccaccc tgaccgagtc gatcgtcgaa   720
 721 ctcggtacct tgcgcggcca ggtggacaat gagctgtatt gcctgccgcc acgcctggaa   780
 781 agtctccgcg gttcgttgca ggcgctgctc gaaccggtct tccagggcaa cgccaggggc   840
 841 gaggcgccgc gctttcgcgg tctctacctg agcggcagcg aggcggcggg cgcggcagcc   900
 901 gacgaagtcc tgccggcggt cgacgcgccg cgtcggcgca gcagcttcgc cagccaactg   960
 961 tgggcacggc ggatccttgc cgaggaaggc ctggcccagg cggtgccgcg catcctccag  1020
1021 ctgcgtcagc gctggcagcg cgggatcggc ctcgccgcgt tgtgcctgtg cctgctctgg  1080
1081 ggcggcctga tgacctgggt ctggcgcgac gcgctgcgcg acgccggtga actgtcgcaa  1140
1141 ctgctgcatg gcgccagtga gcgctaccag ccgctcgacg acgacactcg gcgcgccgcc  1200
1201 caggtccggc agaacgtcca ggcctggtgg caactggtgt cacaggcgcc gcgctggcgc  1260
1261 ttcacttcgc tggcgtttcc cagttcctgg ttttcctccc tggatgcacg tatcgacaac  1320
1321 gcgtatcgcc gcgtttccga gcgcctgctg gtgcgaccgc tgcgcagcct gctggaaggc  1380
1381 gaggcgagcg acctgcgggc gatccgcagc gatggccaac ccggcctgaa ggagggcgac  1440
1441 gaccccagcc agtggaagga ctacctggcg gcgaaggacc tggtcgcccg cgccactacg  1500
1501 ctggagcgcc acaatcgctt gttcgcccag gccatcgaca accgcaggac gccgctcgac  1560
1561 gaccttctgc aactgagtaa cgatgccttg ggcagcagct acaacgccgg cagcctggcg  1620
1621 cgcctggcct attacaaccg cacgctgttc gccgagcgtc cggccgatct cgccgccctc  1680
1681 gacctgggcc gggtcgccgc cacggcggcg gataacttcc atgacctgat ggcacgctgg  1740
1741 ctgaacgcct acttcctcac cgacagcctg gaacgcaccg gcaatgccct gacccaggaa  1800
1801 ctcgcgcagc tcggccagga acgagacgcc acggcggccc agttgctggg gatcggcgaa  1860
1861 ttgatcaacc acctgcagca acaggtcaac cgcatcaacc tgatctgggg caatggcgtc  1920
1921 ggccaggatc tcgtccctgg ctaccagaac ctgctcaaga gcgcccagca aagcagcttg  1980
1981 ctgggaaacc gcgtggagga actgcagaac ctcaccgtgc agttgcagca gcagtttcgc  2040
2041 agcgaatgga tcgcgcctcc ggaggctgcc ggcgacggcc tgctggtgaa acagggggcg  2100
2101 acgctcaagc tggccgatga cctgctcggt ctgaaacgcg ccatcgatga cctgaagacc  2160
2161 aaggacttcg tctccgtggc cctggcggac aagacactgg cggaccactc cttgctgagc  2220
2221 atcgacgata tcggcgtgac tcaggcgttg agcttcttca acgacttcac cggctactac  2280
2281 gagagcaccc tgccatcgct caacccgcgc tatcgctatt cggtcctgca cgccgccgcc  2340
2341 agcgcggcga ccgaggccat gtggcaaagc ctcgggccgc gcagtcgtag cctggccagt  2400
2401 cgcaatgcct cgcgtttcga cgtgcaggtc aagcaggtcc aggtgttgca ggcggcgctg  2460
2461 ctggagttgc aggacctgca ggccggcgcg cgcctgctgc tcagcctcaa cgcgctggcc  2520
2521 gtggccgata tcggccaggc cctgcgcgac atcgacgagc aggcggtgct gcgcgagccg  2580
2581 gtcgatttca gccgctggtc gggcgcgccg aatttcggcc tgcagatgtt ccgcgcccag  2640
2641 gacagtgccg agctgaagca aagcctgaac agccagttca atgccatggc ggccgtcgcc  2700
2701 gaacgccacg ccccggcgct ggagtggttg caggcgcagc aatccagcct ggccaccgcc  2760
2761 gactacaacg cgttcgtgcg cttcagcgcg ctcaacgccg aactgcagaa attcaaggcc  2820
2821 gacaacccgg cgagcaccgc ggcgcagttg gcgaagctgg tgggtaacga tttcaaccag  2880
2881 atggacatcg gcagttgcgc cgacattctc aacggcgtgc tcctgccgtc cagccgcagc  2940
2941 gacctggcga cgcgcctggt ggacctgcgc cagggcgcgc tcgggcgctg ccagtcgttg  3000
3001 caacagcagc aggcggcgca ggcctggaag gacctcgccg actacttcaa ccagtacctc  3060
3061 gccggacgct tcccgttcgc ctacagcctg gaagcggccg atgccgagcc ggggcgggtg  3120
3121 cggcacctgc tgcaactgat ggagacgcgc ttgccgcagg tgcgcgaagg gctggcccag  3180
3181 gttcgctcgc cggacctgcc ggccgccgag gacttcgtcc gccgcctgga gcaggcgcag  3240
3241 cgctggctcg gcccgctgtt cgagcgcgac aagtcgggcc tgctcggcgt cgccctggac  3300
3301 atcgactggc gcagcgaccg cagcctggaa cgcggcgccg accaggtgat cgcctggagc  3360
3361 ctgtactcgg gcgaccagga aagccgtttc cccggcgcgc aggacaaggg cctgacctgg  3420
3421 aatgtcggcg atccgctgaa aatcatgctg cgctgggcga agaacggctc gcagcgtccg  3480
3481 gcagacgacc cgcgccaggc cagcctggcg gtggccgacc tcgaggccgg ctggagctac  3540
3541 cagggctcct gggcgctgtt gcgcatgatg cgcgcgcact tctcccggca gcgcccgccg  3600
3601 aacgtcgact acacggagtt tccgctggtc ctgcaactgc cggtgtatgc gccctacagc  3660
3661 ccggaaaacg aggcgcggat gttcctccgc ctgtcgttga tgagcctggg cggcaagacc  3720
3721 ccgctgtcca tccagccgct gccggtgcgc gcgccgcaat cgccgttcgc gacgctgttg  3780
3781 ccggccaccg tcgccagcac cggaggtacc ccatga  3816
Target constructs
Download sequence in PIR or FASTA format
Chemical properties of sequence with tag
GENE PREDICTION
LEFT END RIGHT END FRAME PREDICTOR SCORE 
2608229 2612044 -2 Glimmer3 score 16.59 good
Genemark probabilities .97   .99 good
GenemarkHMM class 1 good

Notes

The start codon is a ATG/Methionine in most of sequences, but the GTG/V, TTG/L, CTG/I can be the start codons in some cases as expressing in E.coli. The start codon warning is labelled by RED color in sequence(sample: 10174951).

Protein sequence information contains the annotation contents from both of JCSG and SWISS-PROT. The SWISS-PROT/TrEMBL annotation is accessed from SWALL(SPTR) on the EBI SRS server. PDB homologes show both identical and highly similar proteins with released date and protein function.


Contact Webmaster JCSG Menu