Gene BURPS1710b_0723 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1710b_0723 
Symbol 
ID3690231 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1710b 
KingdomBacteria 
Replicon accessionNC_007434 
Strand
Start bp732723 
End bp735356 
Gene Length2634 bp 
Protein Length877 aa 
Translation table11 
GC content73% 
IMG OID637727179 
ProductPTS system, glucose-specific EIIA/HPr/phosphoenolpyruvate-protein phosphotransferase components 
Protein accessionYP_332137 
Protein GI76808751 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1080] Phosphoenolpyruvate-protein kinase (PTS system EI component in bacteria)
[COG1925] Phosphotransferase system, HPr-related proteins
[COG2190] Phosphotransferase system IIA components 
TIGRFAM ID[TIGR00830] PTS system, glucose subfamily, IIA component
[TIGR01003] Phosphotransferase System HPr (HPr) Family
[TIGR01417] phosphoenolpyruvate-protein phosphotransferase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGACGTG CCGAGGAGTC ACAGTTGAAG CAACAAGCAT CCCACGACCA GATCGTGCTG 
GTCGCTCCGC TGACGGGGCC CGTCGTGCCG CTCGCCGACG TACCCGATCC CGTGTTCTCG
GGCGGCATGT TCGGCGACGG CATCGGCATC GATCCGCTCG AGGGCCGGCT GCTCGCGCCG
TGCGCGGGCG TCGTGTCGCA CGTCGCGCGC ACCGGCCACG CGGTGACGAT CGCGGCCGAC
GGCGGCGCGG AGATCCTGCT GCACATCGGC ATCGACACGG TCGAGCTGAA CGGGCTCGGC
TTCACGGCGA AGATCGCCGA GGGCGCGCGC GTCGCGGCGG GCGATCTGCT GATCGAATTC
GATCAGGACG CGATCGCGCG CGCCGCGCAC AGCCTCGTAT CGGTGATCGC GATCGCGAAC
TCGGATGCGT TCGAAGTCGT CGAGCGCGCG GGCGCGGGCG TCGTGAAAGC GGGCGAGACG
CCGCTGCTCG CGCTGCGCGC GCGCGGCGCG GATGCAAGTG CGGGTACAAG TGCGGGTACA
AGTGCGAGTG CGAGTGCGGG CGCGGCTGCT GACGCGAGCT GCGCGCAGCC CGCCGCCGAA
GCGCGCAAGT CGATCACGCT CACGCAGCCG GGCGGCCTGC ACGCGCGGCC GGCCGCGCGC
GCGCGCGAGG CGGCGCGCGG GCTCGACGCG CACGTCGACG TGCACTTCGA AGGGCGCAAG
GCGGCGCTGC AAAGCGTGGT CGGGCTGCTC GGGCTCGGCG CGGGCGAGCA TGCGACGATC
GAGCTCGTCG CGACGGGCCG CGACGCGGCG AAGGCGCTCG AGCGCGTCGC GCACGAGCTG
CTGCGCGAGG CGCACGGCGA GGCCGAAGAG AAGCCGGCAC GCATCGTGTC GCCCGCGCCC
GCCGCCGCCG CGGGCATCGC GCGCGCGCCG CTCGAGCCGA ACACGCTCGC GGGCGTGTGC
GCGGCGCCCG GCATCGCGGT CGGCACGCTC GTGCGCTGGG ATGATGCGCA GATCGTGCCG
CCCGAGCTCG CGAGCGGCAC GCCGGCGGCC GAGAGCCGGC TGCTCGACCG CGCGCTCGCC
GAAGTCGACG CGCAACTCGA GACGACGGTG CGCGAAGCGT CGCGGCGCGG CGCGATCGGC
GAAGCGGGCA TCTTCGCCGT GCATCGCGTG CTGCTCGAAG ATCCGGCGCT CGTCGACGCC
GCGCGCGACC TGATCAGCCT CGGCAAGAGC GCGGGCTACG CGTGGCGCGA GACGATCCGC
GCGCAGACGG CCGTGCTCGC CGACGTCGAC GACACGCTCC TCGCCGAGCG CGCGGCCGAT
CTGCGCGACA TCGACAAGCG CGTGCTGCGC GCGCTCGGCT ATGCGAGCGC GAGCGCGCGC
GAGCTGCCCG CCGAAGCGGT GCTCGCCGCG GAGGAGTTCA CGCCGTCCGA TCTCGCGTCG
CTCGATCGCG AGCGCGTCGC GGCGCTCGTG ATGGCGCGCG GCGGCGCAAC CTCGCATGCG
GCGATCATCG CGCGGCAGTT GGGCATTCCG GCGCTCGTCG CGGTGGGCGA CGCGCTGTAC
GCGATTGCGC AGCGCACACA GGTCGTCGTC GACGCGAGCG CCGGCCGCCT CGAATACGCG
CCGAGCGCGC TCGACGTCGA GCGCGCGCAT CACGAGCGGC AGCGCCTTGC CGGCGTGCGC
GAGGCGAACC GGCGGATGTC GGGCGAGGCG GCGCTCACGC GCGACGGCCA CCGGATCGAG
GTGGCCGCGA ACATCGCGAC GCTCGACGAC GCGCGCGTCG CGCTCGACAA CGGCGCCGAC
GCGGTCGGCC TGCTGCGCAC CGAGCTGATG TTCATCCATC GTCAGGCGGC GCCGACGGCG
TCCGAGCATC AGCAGAGCTA TCAATCGATC GTCGACGCGC TGCAAGGGCG CACCGCGATC
ATCCGCACGC TCGACGTCGG CGCGGACAAG GAAGTCGATT ACCTGACGCT GCCGCCCGAG
CCGAACCCGG CGCTCGGCCT GCGCGGGATC CGTCTCGCGC AGGTGCGCCC CGATCTGCTC
GACGACCAGT TGCGGGGCCT GCTCGCCGTG AAGCCGTACG GCTCGGTGCG CATCCTGCTG
CCGATGGTGA CGGACGTGGG CGAGCTCGTG CGGATCCGCA AGCGCATCGA CGATTTCGCG
CGCGCGATGG GCCGCGCGCA GGCCGTCGAG GTCGGCGTGA TGATCGAAGT GCCGTCGGCC
GCGCTTCTCG CGGATCAACT CGCGCAGCAC GCGGACTTCC TGTCGATCGG CACGAACGAT
CTCACGCAGT ACACGCTCGC GATGGACCGC TGCCAGGCGG ATCTCGCCGC GCAGGCGGAC
GGCCTGCATC CGGCCGTGCT GCGGCTCGTC GACGCGACGG TGCGCGGTGC CGAGAAGCAC
GGCAAGTGGG TCGGCGTGTG CGGCGCGCTG GGCGGCGATC CGGTCGCGGT GCCGGTGCTC
GCCGGCCTCG GCGTGACGGA GTTGTCGGTG GACCCGGTGT CGGTGCCGGG CATCAAGGCG
CAGGTGCGCC GTCTCGATTA CCAGCTGTGC CGCCAGCGCG CGCAAGACCT GCTCGCGCTC
GAATCGGCGC AGGCGGTGAG GGCAGCAAGC CGCGAGATCT GGCCGGCGGA ATGA
 
Protein sequence
MRRAEESQLK QQASHDQIVL VAPLTGPVVP LADVPDPVFS GGMFGDGIGI DPLEGRLLAP 
CAGVVSHVAR TGHAVTIAAD GGAEILLHIG IDTVELNGLG FTAKIAEGAR VAAGDLLIEF
DQDAIARAAH SLVSVIAIAN SDAFEVVERA GAGVVKAGET PLLALRARGA DASAGTSAGT
SASASAGAAA DASCAQPAAE ARKSITLTQP GGLHARPAAR AREAARGLDA HVDVHFEGRK
AALQSVVGLL GLGAGEHATI ELVATGRDAA KALERVAHEL LREAHGEAEE KPARIVSPAP
AAAAGIARAP LEPNTLAGVC AAPGIAVGTL VRWDDAQIVP PELASGTPAA ESRLLDRALA
EVDAQLETTV REASRRGAIG EAGIFAVHRV LLEDPALVDA ARDLISLGKS AGYAWRETIR
AQTAVLADVD DTLLAERAAD LRDIDKRVLR ALGYASASAR ELPAEAVLAA EEFTPSDLAS
LDRERVAALV MARGGATSHA AIIARQLGIP ALVAVGDALY AIAQRTQVVV DASAGRLEYA
PSALDVERAH HERQRLAGVR EANRRMSGEA ALTRDGHRIE VAANIATLDD ARVALDNGAD
AVGLLRTELM FIHRQAAPTA SEHQQSYQSI VDALQGRTAI IRTLDVGADK EVDYLTLPPE
PNPALGLRGI RLAQVRPDLL DDQLRGLLAV KPYGSVRILL PMVTDVGELV RIRKRIDDFA
RAMGRAQAVE VGVMIEVPSA ALLADQLAQH ADFLSIGTND LTQYTLAMDR CQADLAAQAD
GLHPAVLRLV DATVRGAEKH GKWVGVCGAL GGDPVAVPVL AGLGVTELSV DPVSVPGIKA
QVRRLDYQLC RQRAQDLLAL ESAQAVRAAS REIWPAE