Gene BMA3171 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBMA3171 
Symbol 
ID3089994 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia mallei ATCC 23344 
KingdomBacteria 
Replicon accessionNC_006348 
Strand
Start bp3274426 
End bp3276990 
Gene Length2565 bp 
Protein Length854 aa 
Translation table11 
GC content74% 
IMG OID637563730 
ProductPTS system, glucose-specific EIIA/HPr/phosphoenolpyruvate-protein phosphotransferase components 
Protein accessionYP_104650 
Protein GI53724496 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1080] Phosphoenolpyruvate-protein kinase (PTS system EI component in bacteria)
[COG1925] Phosphotransferase system, HPr-related proteins
[COG2190] Phosphotransferase system IIA components 
TIGRFAM ID[TIGR00830] PTS system, glucose subfamily, IIA component
[TIGR01003] Phosphotransferase System HPr (HPr) Family
[TIGR01417] phosphoenolpyruvate-protein phosphotransferase 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCTGGTCG CTCCGCTGAC GGGGCCCGTC GTGCCGCTCG CCGACGTACC CGATCCCGTG 
TTCTCGGGCG GCATGTTCGG CGACGGCATC GGCATCGATC CGCTCGAGGG CCGGCTGCTC
GCGCCGTGCG CGGGCGTCGT GTCGCACGTC GCGCGCACCG GCCACGCGGT GACGATCGCG
GCCGACGGCG GCGCGGAGAT CCTGCTGCAC ATCGGCATCG ACACGGTCGA GCTGAACGGG
CTCGGCTTCA CGGCGAAGAT CGCCGAGGGC GCGCGCGTCG CGGCGGGCGA TCTGCTGATC
GAATTCGATC AGGACGCGAT CGCGCGCGCC GCGCACAGCC TCGTATCGGT GATCGCGATC
GCGAACTCGG ATGCGTTCGA AGTCGTCGAG CGCGCGGGCG CGGGCGTCGT GAAAGCGGGC
GAGACGCCGC TGCTCGCGCT GCGCGCGCGC GGCGCGGATG CAAGTGCGGA TGCAAGTGCG
AGTGCGAGTG CGGGCGCGGC TGCTGACGCG AGCTGCGCGC AGCCCGCCGC CGAAGCGCGC
AAGTCGATCA CGCTCACGCA GCCGGGCGGC CTGCACGCGC GGCCGGCCGC GCGCGCGCGC
GAGGCGGCGC GCGGGCTCGA CGCGCACGTC GACGTGCACT TCGAAGGGCG CAAGGCGGCG
CTGCAAAGCG TGGTCGGGCT GCTCGGGCTC GGCGCGGGCG AGCATGCGAC GATCGAGCTC
GTCGCGACGG GCCGCGACGC GGCGAAGGCG CTCGAGCGCG TCGCGCACGA GCTGCTGCGC
GAGGCGCACG GCGAGGCCGA AGAGAAGCCG GCGCGCATCG TGTCGCCCGC GCCCGCCGCC
GCGGGCATCG CGCGCGCGCC GCTCGAGCCG AACACGCTCG CGGGCGTGTG CGCGGCGCCC
GGCATCGCGG TCGGCACGCT CGTGCGCTGG GATGATGCGC AGATCGTGCC GCCCGAGCTC
GCGAGCGGCA CGCCGGCGGC CGAGAGCCGG CTGCTCGACC GCGCGCTCGC CGAAGTCGAC
GCGCAACTCG AGACGACGGT GCGCGAAGCG TCGCGGCGCG GCGCGATCGG CGAAGCGGGC
ATCTTCGCCG TGCATCGCGT GCTGCTCGAA GATCCGGCGC TCGTCGACGC CGCGCGCGAC
CTGATCAGCC TCGGCAAGAG CGCGGGCTAC GCGTGGCGCG AGACGATCCG CGCGCAGACG
GCCGTGCTCG CCGACGTCGA CGACACGCTC CTCGCCGAGC GCGCGGCCGA TCTGCGCGAC
ATCGACAAGC GCGTGCTGCG CGCGCTCGGC TATGCGAGCG CGAGCGCGCG CGAGCTGCCC
GCCGAAGCGG TGCTCGCCGC GGAGGAGTTC ACGCCGTCCG ATCTCGCGTC GCTCGATCGC
GAGCGCGTCG CGGCGCTCGT GATGGCGCGC GGCGGCGCAA CCTCGCATGC GGCGATCATC
GCGCGGCAGT TGGGCATTCC GGCACTCGTC GCGGTGGGCG ATGCGCTGTA CGCGATTGCG
CAGCGCACAC AGGTCGTCGT CGACGCGAGC GCCGGCCGCC TCGAATACGC GCCGAGCGCG
CTCGACGTCG AGCGCGCGCG TCACGAGCGA CAGCGCCTTG CCGGCGTGCG CGAGGCGAAC
CGGCGGATGT CGGGCGAGGC GGCGCTCACG CGCGACGGCC ACCGGATCGA GGTGGCCGCG
AACATCGCGA CGCTCGACGA CGCGCGCGTC GCGCTCGACA ACGGCGCCGA CGCGGTCGGC
CTGCTGCGCA CCGAGCTGAT GTTCATCCAT CGTCAGGCGG CGCCGACGGC GTCCGAGCAT
CAGCAGAGCT ATCAATCGAT CGTCGACGCG CTGCAAGGGC GCACCGCGAT CATCCGCACG
CTCGACGTCG GCGCGGACAA GGAAGTCGAT TACCTGACGC TGCCGCCCGA GCCGAACCCG
GCGCTCGGCC TGCGCGGGAT CCGTCTCGCG CAGGTGCGCC CCGATCTGCT CGACGACCAG
TTGCGGGGCC TGCTCGCCGT GAAGCCGTAC GGCTCGGTGC GCATCCTGCT GCCGATGGTG
ACGGACGTGG GCGAGCTCGT GCGGATCCGC AAGCGCATCG ACGATTTCGC GCGCGCGATG
GGCCGCGCGC AGGCCGTCGA GGTCGGCGTG ATGATCGAAG TGCCGTCGGC CGCGCTTCTC
GCGGATCAAC TCGCGCAGCA CGCGGACTTC CTGTCGATCG GCACGAACGA TCTCACGCAG
TACACGCTCG CGATGGACCG CTGCCAGGCG GATCTCGCCG CGCAGGCGGA CGGCCTGCAT
CCGGCCGTGC TGCGGCTCGT CGACGCGACC GTGCGCGGCG CCGAGAAGCA CGGCAAGTGG
GTCGGCGTGT GCGGCGCGCT GGGCGGCGAT CCGGTCGCGG TGCCGGTGCT CGTCGGCCTC
GGCGTGACGG AGTTGTCGGT GGACCCGGTG TCGGTGCCGG GCATCAAGGC GCAGGTGCGC
CGTCTCGATT ACCAGCTGTG CCGCCAGCGC GCGCAAGACC TGCTCGCGCT CGAATCGGCG
CAGGCGGTGA GGGCAGCAAG CCGCGAGATC TGGCCGGCGG AATGA
 
Protein sequence
MLVAPLTGPV VPLADVPDPV FSGGMFGDGI GIDPLEGRLL APCAGVVSHV ARTGHAVTIA 
ADGGAEILLH IGIDTVELNG LGFTAKIAEG ARVAAGDLLI EFDQDAIARA AHSLVSVIAI
ANSDAFEVVE RAGAGVVKAG ETPLLALRAR GADASADASA SASAGAAADA SCAQPAAEAR
KSITLTQPGG LHARPAARAR EAARGLDAHV DVHFEGRKAA LQSVVGLLGL GAGEHATIEL
VATGRDAAKA LERVAHELLR EAHGEAEEKP ARIVSPAPAA AGIARAPLEP NTLAGVCAAP
GIAVGTLVRW DDAQIVPPEL ASGTPAAESR LLDRALAEVD AQLETTVREA SRRGAIGEAG
IFAVHRVLLE DPALVDAARD LISLGKSAGY AWRETIRAQT AVLADVDDTL LAERAADLRD
IDKRVLRALG YASASARELP AEAVLAAEEF TPSDLASLDR ERVAALVMAR GGATSHAAII
ARQLGIPALV AVGDALYAIA QRTQVVVDAS AGRLEYAPSA LDVERARHER QRLAGVREAN
RRMSGEAALT RDGHRIEVAA NIATLDDARV ALDNGADAVG LLRTELMFIH RQAAPTASEH
QQSYQSIVDA LQGRTAIIRT LDVGADKEVD YLTLPPEPNP ALGLRGIRLA QVRPDLLDDQ
LRGLLAVKPY GSVRILLPMV TDVGELVRIR KRIDDFARAM GRAQAVEVGV MIEVPSAALL
ADQLAQHADF LSIGTNDLTQ YTLAMDRCQA DLAAQADGLH PAVLRLVDAT VRGAEKHGKW
VGVCGALGGD PVAVPVLVGL GVTELSVDPV SVPGIKAQVR RLDYQLCRQR AQDLLALESA
QAVRAASREI WPAE