Gene BURPS668_0537 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_0537 
Symbol 
ID4884660 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009074 
Strand
Start bp506850 
End bp509453 
Gene Length2604 bp 
Protein Length867 aa 
Translation table11 
GC content73% 
IMG OID640126465 
Productphosphoryl transfer system, HPr/phosphoenolpyruvate-protein phosphotransferase 
Protein accessionYP_001057590 
Protein GI126439503 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1080] Phosphoenolpyruvate-protein kinase (PTS system EI component in bacteria)
[COG1925] Phosphotransferase system, HPr-related proteins
[COG2190] Phosphotransferase system IIA components 
TIGRFAM ID[TIGR00830] PTS system, glucose subfamily, IIA component
[TIGR01003] Phosphotransferase System HPr (HPr) Family
[TIGR01417] phosphoenolpyruvate-protein phosphotransferase 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.394986 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGAAGCAAC AAGCATCCCA CGACCAGATC GTGCTGGTCG CTCCGCTGAC GGGGCCCGTC 
GTGCCGCTCG CCGACGTACC CGATCCCGTG TTCTCGGGCG GCATGTTCGG CGACGGCATC
GGCATCGATC CGCTCGAGGG CCGGCTGCTC GCGCCGTGCG CGGGCGTCGT GTCGCACGTC
GCGCGCACCG GCCACGCGGT GACGATCGCG GCCGACGGCG GCGCGGAGAT CCTGCTGCAC
ATCGGCATCG ACACGGTCGA GCTGAACGGG CTCGGCTTCA CGGCGAAGAT CGCCGAGGGC
GCGCGCGTCG CGGCGGGCGA TCTGCTGATC GAATTCGATC AGGACGCGAT CGCGCGCGCC
GCGCACAGCC TCGTATCGGT GATCGCGATC GCGAACTCGG ATGCGTTCGA AGTCGTCGAG
CGCGCGGGCG CGGGCGTCGT GAAAGCGGGC GAGACGCCGC TGCTCGCGCT GCGCGCGCGC
GGCGCGGATG CAAGTGCGGA TGCAAGTGCG GGTACAAGTG CGAGTGCGGG CGCGGCTGCT
GACGCGAGCT GCGCGCAGCC CGCCGCCGAA GCGCGCAGGT CGATCACGCT CACGCAGCCG
GGCGGCCTGC ACGCGCGGCC GGCCGCGCGC GCGCGCGAGG CGGCGCGCGG GCTCGACGCG
CACGTCGACG TGCACTTCGA AGGGCGCAAG GCGGCGCTGC AAAGCGTGGT CGGGCTGCTC
GGGCTCGGCG CGGGCGAGCA TGCGACGATC GAGCTCGTCG CGACGGGCCG CGACGCGGCG
AAGGCGCTCG AGCGCGTCGC GCACGAGCTG CTGCGCGAGG CGCACGGCGA GGCCGAAGAG
AAGCCGGCGC GCATCGTGTC GCCCGCGCCC GCCGCCGCCG CGGGCATCGC GCGCGCGCCG
CTCGAGCCGA ACACGCTCGC GGGCGTGTGC GCGGCGCCCG GCATCGCGGT CGGCACGCTC
GTGCGCTGGG ATGATGCGCA GATCGTGCCG CCCGAGCTCG CGAGCGGCAC GCCGGCGGCC
GAGAGCCGGC TGCTCGACCG CGCGCTCGCC GAAGTCGACG CGCAACTCGA GACGACGGTG
CGCGAAGCGT CGCGGCGCGG CGCGATCGGC GAAGCGGGCA TCTTCGCCGT GCATCGCGTG
CTGCTCGAAG ATCCGGCGCT CGTCGACGCC GCGCGCGACC TGATCAGCCT CGGCAAGAGC
GCGGGCTACG CGTGGCGCGA GACGATCCGC GCGCAGACGG CCGTGCTCGC CGACGTCGAC
GACACGCTCC TCGCCGAGCG CGCGGCCGAT CTGCGCGACA TCGACAAGCG CGTGCTGCGC
GCGCTCGGCT ATGCGAGCGC GAGCGCGCGC GAGCTGCCCG CCGAAGCGGT GCTCGCGGCG
GAGGAGTTCA CGCCGTCCGA TCTCGCGTCG CTCGATCGCG AGCGCGTCGC GGCGCTCGTG
ATGGCGCGCG GCGGCGCAAC CTCGCATGCG GCGATCATCG CGCGGCAGTT GGGCATTCCG
GCGCTCGTCG CGGTGGGCGA CGCGCTGTAC GCGATTGCGC AGCGCACACA GGTCGTCGTC
GACGCGAGCG CCGGCCGCCT CGAATACGCG CCGAGCGCGC TCGACGTCGA GCGCGCGCGT
CACGAGCGGC AGCGCCTTGC CGGCGTGCGC GAGGCGAACC GGCGGATGTC GGGCGAGGCG
GCGCTCACGC GCGACGGCCA CCGGATCGAG GTGGCCGCGA ACATCGCGAC GCTCGACGAC
GCGCGCGTCG CGCTCGACAA CGGCGCCGAC GCGGTCGGCC TGCTGCGCAC CGAACTGATG
TTCATCCATC GTCAGGCGGC GCCGACGGCG TCCGAGCATC AGCAGAGCTA TCAATCGATC
GTCGACGCGC TGCAAGGGCG CGCCGCGATC ATCCGCACGC TCGACGTCGG CGCGGACAAG
GAAGTCGATT ACCTGACGCT GCCGCCCGAG CCGAACCCGG CGCTCGGCCT GCGCGGGATC
CGTCTCGCGC AGGTGCGCCC CGATCTGCTC GACGACCAGT TGCGGGGCCT GCTCGCCGTG
AAGCCGTACG GCTCGGTGCG CATCCTGCTG CCGATGGTGA CGGACGTGGG CGAGCTCGTG
CGGATCCGCA AGCGCATCGA CGATTTCGCG CGCGCGATGG GCCGCGCGCA GGCCGTCGAG
GTCGGCGTGA TGATCGAAGT GCCGTCGGCC GCGCTTCTCG CGGATCAACT CGCGCAGCAC
GCGGACTTCC TGTCGATCGG CACGAACGAT CTCACGCAGT ACACGCTCGC GATGGACCGC
TGCCAGGCGG ATCTCGCCGC GCAGGCGGAC GGCCTGCATC CGGCCGTGCT GCGGCTCGTC
GACGCGACGG TGCGCGGCGC CGAGAAGCAC GGCAAGTGGG TCGGCGTGTG CGGCGCGCTG
GGCGGCGATC CGGTCGCGGT GCCGGTGCTC GTCGGCCTCG GCGTGACGGA GTTGTCGGTG
GACCCGGTGT CGGTGCCGGG CATCAAGGCG CAGGTGCGCC GTCTCGATTA CCAGCTGTGC
CGCCAGCGCG CGCAAGACCT GCTCGCGCTC GAATCGGCGC AGGCGGTGAG GGCAGCAAGC
CGCGAGATCT GGCCGGCGGA ATGA
 
Protein sequence
MKQQASHDQI VLVAPLTGPV VPLADVPDPV FSGGMFGDGI GIDPLEGRLL APCAGVVSHV 
ARTGHAVTIA ADGGAEILLH IGIDTVELNG LGFTAKIAEG ARVAAGDLLI EFDQDAIARA
AHSLVSVIAI ANSDAFEVVE RAGAGVVKAG ETPLLALRAR GADASADASA GTSASAGAAA
DASCAQPAAE ARRSITLTQP GGLHARPAAR AREAARGLDA HVDVHFEGRK AALQSVVGLL
GLGAGEHATI ELVATGRDAA KALERVAHEL LREAHGEAEE KPARIVSPAP AAAAGIARAP
LEPNTLAGVC AAPGIAVGTL VRWDDAQIVP PELASGTPAA ESRLLDRALA EVDAQLETTV
REASRRGAIG EAGIFAVHRV LLEDPALVDA ARDLISLGKS AGYAWRETIR AQTAVLADVD
DTLLAERAAD LRDIDKRVLR ALGYASASAR ELPAEAVLAA EEFTPSDLAS LDRERVAALV
MARGGATSHA AIIARQLGIP ALVAVGDALY AIAQRTQVVV DASAGRLEYA PSALDVERAR
HERQRLAGVR EANRRMSGEA ALTRDGHRIE VAANIATLDD ARVALDNGAD AVGLLRTELM
FIHRQAAPTA SEHQQSYQSI VDALQGRAAI IRTLDVGADK EVDYLTLPPE PNPALGLRGI
RLAQVRPDLL DDQLRGLLAV KPYGSVRILL PMVTDVGELV RIRKRIDDFA RAMGRAQAVE
VGVMIEVPSA ALLADQLAQH ADFLSIGTND LTQYTLAMDR CQADLAAQAD GLHPAVLRLV
DATVRGAEKH GKWVGVCGAL GGDPVAVPVL VGLGVTELSV DPVSVPGIKA QVRRLDYQLC
RQRAQDLLAL ESAQAVRAAS REIWPAE