Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS668_0537 |
Symbol | |
ID | 4884660 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 668 |
Kingdom | Bacteria |
Replicon accession | NC_009074 |
Strand | + |
Start bp | 506850 |
End bp | 509453 |
Gene Length | 2604 bp |
Protein Length | 867 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 640126465 |
Product | phosphoryl transfer system, HPr/phosphoenolpyruvate-protein phosphotransferase |
Protein accession | YP_001057590 |
Protein GI | 126439503 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1080] Phosphoenolpyruvate-protein kinase (PTS system EI component in bacteria) [COG1925] Phosphotransferase system, HPr-related proteins [COG2190] Phosphotransferase system IIA components |
TIGRFAM ID | [TIGR00830] PTS system, glucose subfamily, IIA component [TIGR01003] Phosphotransferase System HPr (HPr) Family [TIGR01417] phosphoenolpyruvate-protein phosphotransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.394986 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGAAGCAAC AAGCATCCCA CGACCAGATC GTGCTGGTCG CTCCGCTGAC GGGGCCCGTC GTGCCGCTCG CCGACGTACC CGATCCCGTG TTCTCGGGCG GCATGTTCGG CGACGGCATC GGCATCGATC CGCTCGAGGG CCGGCTGCTC GCGCCGTGCG CGGGCGTCGT GTCGCACGTC GCGCGCACCG GCCACGCGGT GACGATCGCG GCCGACGGCG GCGCGGAGAT CCTGCTGCAC ATCGGCATCG ACACGGTCGA GCTGAACGGG CTCGGCTTCA CGGCGAAGAT CGCCGAGGGC GCGCGCGTCG CGGCGGGCGA TCTGCTGATC GAATTCGATC AGGACGCGAT CGCGCGCGCC GCGCACAGCC TCGTATCGGT GATCGCGATC GCGAACTCGG ATGCGTTCGA AGTCGTCGAG CGCGCGGGCG CGGGCGTCGT GAAAGCGGGC GAGACGCCGC TGCTCGCGCT GCGCGCGCGC GGCGCGGATG CAAGTGCGGA TGCAAGTGCG GGTACAAGTG CGAGTGCGGG CGCGGCTGCT GACGCGAGCT GCGCGCAGCC CGCCGCCGAA GCGCGCAGGT CGATCACGCT CACGCAGCCG GGCGGCCTGC ACGCGCGGCC GGCCGCGCGC GCGCGCGAGG CGGCGCGCGG GCTCGACGCG CACGTCGACG TGCACTTCGA AGGGCGCAAG GCGGCGCTGC AAAGCGTGGT CGGGCTGCTC GGGCTCGGCG CGGGCGAGCA TGCGACGATC GAGCTCGTCG CGACGGGCCG CGACGCGGCG AAGGCGCTCG AGCGCGTCGC GCACGAGCTG CTGCGCGAGG CGCACGGCGA GGCCGAAGAG AAGCCGGCGC GCATCGTGTC GCCCGCGCCC GCCGCCGCCG CGGGCATCGC GCGCGCGCCG CTCGAGCCGA ACACGCTCGC GGGCGTGTGC GCGGCGCCCG GCATCGCGGT CGGCACGCTC GTGCGCTGGG ATGATGCGCA GATCGTGCCG CCCGAGCTCG CGAGCGGCAC GCCGGCGGCC GAGAGCCGGC TGCTCGACCG CGCGCTCGCC GAAGTCGACG CGCAACTCGA GACGACGGTG CGCGAAGCGT CGCGGCGCGG CGCGATCGGC GAAGCGGGCA TCTTCGCCGT GCATCGCGTG CTGCTCGAAG ATCCGGCGCT CGTCGACGCC GCGCGCGACC TGATCAGCCT CGGCAAGAGC GCGGGCTACG CGTGGCGCGA GACGATCCGC GCGCAGACGG CCGTGCTCGC CGACGTCGAC GACACGCTCC TCGCCGAGCG CGCGGCCGAT CTGCGCGACA TCGACAAGCG CGTGCTGCGC GCGCTCGGCT ATGCGAGCGC GAGCGCGCGC GAGCTGCCCG CCGAAGCGGT GCTCGCGGCG GAGGAGTTCA CGCCGTCCGA TCTCGCGTCG CTCGATCGCG AGCGCGTCGC GGCGCTCGTG ATGGCGCGCG GCGGCGCAAC CTCGCATGCG GCGATCATCG CGCGGCAGTT GGGCATTCCG GCGCTCGTCG CGGTGGGCGA CGCGCTGTAC GCGATTGCGC AGCGCACACA GGTCGTCGTC GACGCGAGCG CCGGCCGCCT CGAATACGCG CCGAGCGCGC TCGACGTCGA GCGCGCGCGT CACGAGCGGC AGCGCCTTGC CGGCGTGCGC GAGGCGAACC GGCGGATGTC GGGCGAGGCG GCGCTCACGC GCGACGGCCA CCGGATCGAG GTGGCCGCGA ACATCGCGAC GCTCGACGAC GCGCGCGTCG CGCTCGACAA CGGCGCCGAC GCGGTCGGCC TGCTGCGCAC CGAACTGATG TTCATCCATC GTCAGGCGGC GCCGACGGCG TCCGAGCATC AGCAGAGCTA TCAATCGATC GTCGACGCGC TGCAAGGGCG CGCCGCGATC ATCCGCACGC TCGACGTCGG CGCGGACAAG GAAGTCGATT ACCTGACGCT GCCGCCCGAG CCGAACCCGG CGCTCGGCCT GCGCGGGATC CGTCTCGCGC AGGTGCGCCC CGATCTGCTC GACGACCAGT TGCGGGGCCT GCTCGCCGTG AAGCCGTACG GCTCGGTGCG CATCCTGCTG CCGATGGTGA CGGACGTGGG CGAGCTCGTG CGGATCCGCA AGCGCATCGA CGATTTCGCG CGCGCGATGG GCCGCGCGCA GGCCGTCGAG GTCGGCGTGA TGATCGAAGT GCCGTCGGCC GCGCTTCTCG CGGATCAACT CGCGCAGCAC GCGGACTTCC TGTCGATCGG CACGAACGAT CTCACGCAGT ACACGCTCGC GATGGACCGC TGCCAGGCGG ATCTCGCCGC GCAGGCGGAC GGCCTGCATC CGGCCGTGCT GCGGCTCGTC GACGCGACGG TGCGCGGCGC CGAGAAGCAC GGCAAGTGGG TCGGCGTGTG CGGCGCGCTG GGCGGCGATC CGGTCGCGGT GCCGGTGCTC GTCGGCCTCG GCGTGACGGA GTTGTCGGTG GACCCGGTGT CGGTGCCGGG CATCAAGGCG CAGGTGCGCC GTCTCGATTA CCAGCTGTGC CGCCAGCGCG CGCAAGACCT GCTCGCGCTC GAATCGGCGC AGGCGGTGAG GGCAGCAAGC CGCGAGATCT GGCCGGCGGA ATGA
|
Protein sequence | MKQQASHDQI VLVAPLTGPV VPLADVPDPV FSGGMFGDGI GIDPLEGRLL APCAGVVSHV ARTGHAVTIA ADGGAEILLH IGIDTVELNG LGFTAKIAEG ARVAAGDLLI EFDQDAIARA AHSLVSVIAI ANSDAFEVVE RAGAGVVKAG ETPLLALRAR GADASADASA GTSASAGAAA DASCAQPAAE ARRSITLTQP GGLHARPAAR AREAARGLDA HVDVHFEGRK AALQSVVGLL GLGAGEHATI ELVATGRDAA KALERVAHEL LREAHGEAEE KPARIVSPAP AAAAGIARAP LEPNTLAGVC AAPGIAVGTL VRWDDAQIVP PELASGTPAA ESRLLDRALA EVDAQLETTV REASRRGAIG EAGIFAVHRV LLEDPALVDA ARDLISLGKS AGYAWRETIR AQTAVLADVD DTLLAERAAD LRDIDKRVLR ALGYASASAR ELPAEAVLAA EEFTPSDLAS LDRERVAALV MARGGATSHA AIIARQLGIP ALVAVGDALY AIAQRTQVVV DASAGRLEYA PSALDVERAR HERQRLAGVR EANRRMSGEA ALTRDGHRIE VAANIATLDD ARVALDNGAD AVGLLRTELM FIHRQAAPTA SEHQQSYQSI VDALQGRAAI IRTLDVGADK EVDYLTLPPE PNPALGLRGI RLAQVRPDLL DDQLRGLLAV KPYGSVRILL PMVTDVGELV RIRKRIDDFA RAMGRAQAVE VGVMIEVPSA ALLADQLAQH ADFLSIGTND LTQYTLAMDR CQADLAAQAD GLHPAVLRLV DATVRGAEKH GKWVGVCGAL GGDPVAVPVL VGLGVTELSV DPVSVPGIKA QVRRLDYQLC RQRAQDLLAL ESAQAVRAAS REIWPAE
|
| |