Gene BTH_I0449 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBTH_I0449 
Symbol 
ID3849091 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia thailandensis E264 
KingdomBacteria 
Replicon accessionNC_007651 
Strand
Start bp495574 
End bp498174 
Gene Length2601 bp 
Protein Length866 aa 
Translation table11 
GC content73% 
IMG OID637840122 
ProductPTS system, glucose-specific EIIA/HPr/phosphoenolpyruvate-protein phosphotransferase components 
Protein accessionYP_441007 
Protein GI83719492 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1080] Phosphoenolpyruvate-protein kinase (PTS system EI component in bacteria)
[COG1925] Phosphotransferase system, HPr-related proteins
[COG2190] Phosphotransferase system IIA components 
TIGRFAM ID[TIGR00830] PTS system, glucose subfamily, IIA component
[TIGR01003] Phosphotransferase System HPr (HPr) Family
[TIGR01417] phosphoenolpyruvate-protein phosphotransferase 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.237423 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGACGTG CCGAGGAGTC ACAGTTGAAG CAACAAGCAT CCCACGACCA GATCGTGCTG 
GTCGCTCCGC TGACGGGGCC CGTCGTGCCG CTCGCCGACG TGCCCGATCC CGTGTTCTCG
GGCGGCATGT TCGGCGACGG CATCGGCATC GATCCGCTCG AGGGCCGGCT TCTCGCGCCG
TGCGGGGGCG TCGTGTCGCA CGTCGCGCGC ACGGGCCACG CGGTGACGAT CGCGGCCGAC
GGCGGCGCCG AGATCCTGCT GCACATCGGC ATCGACACGG TCGAGCTGAA CGGGCTCGGC
TTCTCGGCGA AGATCGCCGA AGGCGCGCGC GTGGCGCCGG GCGATCTGCT GATCGAGTTC
GATCAGGACA CGATCGCGCG CGCCGCGCAC AGCCTCGTAT CGGTGATCGC GATCGCGAAT
TCGGATGCGT TCGAAGTCGT CGAGCGCGCG GGCGCGGGCG TCGCGAAGGC GGGCGAGACG
CCGCTGCTCA CGCTGCGCGC GCGCGGGGCG GCGGCGGGCG CGGCGGCAAG CGCGGGTTCG
GCGGCGGGCC GCGCAGACGC GGGCGCCGAA GCGCGCAAGT CGATCACGCT CACGCAGCCG
GGCGGCCTGC ACGCGCGACC GGCCGCGCGC GCGCGCGAGG CGGCGCGCGG GCTCGACGCG
CACGTCGACG TGCATTTCGG AGGGCGCAAG GCGGCGCTGC AAAGCGTCGT CGGGCTGCTC
GGCCTCGGCG CGGGCGAGCA CGCGACGATC GAGATCGTCG CGACGGGCCG CGACGCCGCG
AAGGCGGTCG AGCTCGTCGC GCACGAACTG CTGCGCGAAG CGCACGGCGA AGCCGAGGAG
AAGCCGGCGC GCGTCGTGTC GCCCGCGCCC GTCGCCGCCG CCGTCGGGCG CGCGCCGCTC
GATCCGAACA CGCTCGCGGG CGTGTGTGCG GCGCCCGGCA TCGCCGTAGG CGCGCTCGTG
CGCTGGGACG AGACGGACAT CGCGCCGCCC GAGCTCGCGA GCGGCACGCC CGCCGCCGAG
AGCCGGCTGC TCGACCGCGC GCTCGCCGCG GTCGACGCGG AGCTCGAGAC GACGGTGCGC
GAAGCGTCGC AGCGCGGCGC GATCGGCGAA GCGGGCATCT TCGCCGTGCA CCGCGTGCTG
CTCGAGGACC CGTCGCTCGT CGACGCCGCG CGCGACCTGA TCAGCCTCGG CAAGAGCGCG
GGCTACGCGT GGCGCGAGAC GATCCGTGCG CAGACGGCCG TGCTCGCCGG CGTCGACGAC
GCGCTGCTCG CCGAGCGCGC GGCCGATCTG CGCGACATCG ACAAGCGCGT GCTGCGCGCG
CTCGGCTATG CGAGCGCGAC TGCGCGGGAG CTGCCGGCCG AAGCGGTGCT CGCGGCGGAG
GAGTTCACGC CGTCCGATCT CGCGTCGCTC GATCGCGAGC GCGTGACGGC GCTCGTGATG
GCGCGCGGCG GCGCGACTTC GCATGCGGCG ATCATCGCGC GCCAGCTCGG CATTCCGTCG
CTCGTCGCGG TGGGCGATGC GCTGTATGCG ATTCCGCAGC GCACGCAGGT CGTCGTCGAC
GCGAGCGCGG GCCGCCTCGA ATACGCGCCG ACCGCGCTCG ACGTCGAGCG CGCGCGCCAC
GAGCGGCAGC GCCTTGCCGG CGTGCGCGAG GCGAACCGGC GGATGTCGGG CGAAGCGGCG
GTCACGCGCG ACGGCCACAA GATCGAGGTC GCCGCGAACA TCGCGACGCT CGACGACGCG
CGCGTCGCGG TCGACAACGG CGCCGACGCG GTCGGCCTGC TGCGCACCGA GCTGATGTTC
ATCCACCGGC AGGCGGCGCC GACGACGTCC GAGCATCAGC AGAGCTATCA ATCGATCGTC
GACGCGCTGC AAGGCCGCAC CGCGATCATC CGCACGCTCG ACGTCGGCGC GGACAAGGAA
GTCGATTACC TGACGCTGCC GCCCGAGCCG AACCCCGCGC TCGGCCTGCG CGGGATCCGT
CTCGCGCAGG TGCGCCCGGA CCTGCTCGAC GATCAGTTGC AGGGCCTCCT TTCGGTGAAG
CCGTACGGCT CGGTGCGAAT CCTGCTGCCG ATGGTGACGG ACGTCGGCGA GCTCGTGCGC
ATCCGCGAGC GCATCGACGC CTTCGCGCGC GCGCTCGGCC GCGCCGATCC GATCGAAGTC
GGCGTGATGA TCGAGGTGCC GTCGGCCGCG CTCCTCGCGG ATCAGCTCGC GAAGCACGCG
GATTTCCTGT CGATCGGCAC GAACGACCTC ACGCAGTACA CGCTCGCGAT GGACCGCTGC
CAGGCCGATC TCGCCGCACA GGCGGACGGC CTGCATCCGG CCGTGCTGCG GCTCGTCGAC
GCGACCGTGC GCGGCGCCGA GAAGCACGGC AAGTGGGTGG GCGTGTGCGG CGCGCTCGGC
GGCGATCCGG TTGCGGTGCC GGTGCTCGTC GGCCTCGGCG TGACGGAATT GTCGGTGGAC
CCGGTGTCGG TGCCGGGCAT CAAGGCGCAG GTGCGCCGTC TCGATTACCA GCTGTGCCGC
CAGCGCGCGC AAGACCTGCT CGCGCTCGAA TCGGCGCAGG CGGTGAGGGC AGCAAGCCGC
GAGATCTGGC CGGCGGACTG A
 
Protein sequence
MRRAEESQLK QQASHDQIVL VAPLTGPVVP LADVPDPVFS GGMFGDGIGI DPLEGRLLAP 
CGGVVSHVAR TGHAVTIAAD GGAEILLHIG IDTVELNGLG FSAKIAEGAR VAPGDLLIEF
DQDTIARAAH SLVSVIAIAN SDAFEVVERA GAGVAKAGET PLLTLRARGA AAGAAASAGS
AAGRADAGAE ARKSITLTQP GGLHARPAAR AREAARGLDA HVDVHFGGRK AALQSVVGLL
GLGAGEHATI EIVATGRDAA KAVELVAHEL LREAHGEAEE KPARVVSPAP VAAAVGRAPL
DPNTLAGVCA APGIAVGALV RWDETDIAPP ELASGTPAAE SRLLDRALAA VDAELETTVR
EASQRGAIGE AGIFAVHRVL LEDPSLVDAA RDLISLGKSA GYAWRETIRA QTAVLAGVDD
ALLAERAADL RDIDKRVLRA LGYASATARE LPAEAVLAAE EFTPSDLASL DRERVTALVM
ARGGATSHAA IIARQLGIPS LVAVGDALYA IPQRTQVVVD ASAGRLEYAP TALDVERARH
ERQRLAGVRE ANRRMSGEAA VTRDGHKIEV AANIATLDDA RVAVDNGADA VGLLRTELMF
IHRQAAPTTS EHQQSYQSIV DALQGRTAII RTLDVGADKE VDYLTLPPEP NPALGLRGIR
LAQVRPDLLD DQLQGLLSVK PYGSVRILLP MVTDVGELVR IRERIDAFAR ALGRADPIEV
GVMIEVPSAA LLADQLAKHA DFLSIGTNDL TQYTLAMDRC QADLAAQADG LHPAVLRLVD
ATVRGAEKHG KWVGVCGALG GDPVAVPVLV GLGVTELSVD PVSVPGIKAQ VRRLDYQLCR
QRAQDLLALE SAQAVRAASR EIWPAD