Gene BURPS1106A_0554 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_0554 
Symbol 
ID4901539 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009076 
Strand
Start bp521594 
End bp524206 
Gene Length2613 bp 
Protein Length870 aa 
Translation table11 
GC content73% 
IMG OID640133784 
ProductPTS system, glucose-glucoside (Glc) family EIIA/phosphocarrier HPr/phosphoenolpyruvate-protein phosphotransferase components 
Protein accessionYP_001064837 
Protein GI126451463 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1080] Phosphoenolpyruvate-protein kinase (PTS system EI component in bacteria)
[COG1925] Phosphotransferase system, HPr-related proteins
[COG2190] Phosphotransferase system IIA components 
TIGRFAM ID[TIGR00830] PTS system, glucose subfamily, IIA component
[TIGR01003] Phosphotransferase System HPr (HPr) Family
[TIGR01417] phosphoenolpyruvate-protein phosphotransferase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.71734 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGAAGCAAC AAGCATCCCA CGACCAGATC GTGCTGGTCG CTCCGCTGAC GGGGCCCGTC 
GTGCCGCTCG CCGACGTACC CGATCCCGTG TTCTCGGGCG GCATGTTCGG CGACGGCATC
GGCATCGATC CGCTCGAGGG CCGGCTGCTC GCGCCGTGCG CGGGCGTCGT GTCGCACGTC
GCGCGCACCG GCCACGCGGT GACGATCGCG GCCGACGGCG GCGCGGAGAT CCTGCTGCAC
ATCGGCATCG ACACGGTCGA GCTGAACGGG CTCGGCTTCA CGGCGAAGAT CGCCGAGGGC
GCGCGCGTCG CGGCGGGCGA TCTGCTGATC GAATTCGATC AGGACGCGAT CGCGCGCGCC
GCGCACAGCC TCGTATCGGT GATCGCGATC GCGAACTCGG ATGCGTTCGA AGTCGTCGAG
CGCGCGGGCG CGGGCGTCGT GAAAGCGGGC GAGACGCCGC TGCTCGCGCT GCGCGCGCGC
GGCGCGGATG CAAGTGCGGA TGCAAGTGCG GATGCAAGTG CGGATGCAAG TGCGAGTGCG
GGCGCGGCTG CTGACGCGAG CTGCGCGCAG CCCGCCGCCG AAGCGCGCAA GTCGATCACG
CTCACGCAGC CGGGCGGCCT GCACGCGCGG CCGGCCGCGC GCGCGCGCGA GGCGGCGCGC
GGGCTCGACG CGCACGTCGA CGTGCACTTC GAAGGGCGCA AGGCGGCGCT GCAAAGCGTG
GTCGGGCTGC TCGGGCTCGG CGCGGGCGAG CATGCGACGA TCGAGCTCGT CGCGACGGGC
CGCGACGCGG CGAAGGCGCT CGAGCGCGTC GCGCACGAGC TGCTGCGCGA GGCGCACGGC
GAGGCCGAAG AGAAGCCGGC GCGCATCGTG TCGCCCGCGC CCGCCGCCGC GGGCATCGCG
CGCGCGCCGC TCGAGCCGAA CACGCTCGCG GGCGTGTGCG CGGCGCCCGG CATCGCGGTC
GGCACGCTCG TGCGCTGGGA TGATGCGCAG ATCGTGCCGC CCGAGCTCGC GAGCGGCACG
CCGGCGGCCG AGAGCCGGCT GCTCGACCGC GCGCTCGCCG AAGTCGACGC GCAACTCGAG
ACGACGGTGC GCGAAGCGTC GCGGCGCGGC GCGATCGGCG AAGCGGGCAT CTTCGCCGTG
CATCGCGTGC TGCTCGAAGA TCCGGCGCTC GTCGACGCCG CGCGCGACCT GATCAGCCTC
GGCAAGAGCG CGGGCTACGC GTGGCGCGAG ACGATCCGCG CGCAGACGGC CGTGCTCGCC
GACGTCGACG ACACGCTCCT CGCCGAGCGC GCGGCCGATC TGCGCGACAT CGACAAGCGC
GTGCTGCGCG CGCTCGGCTA TGCGAGCGCG AGCGCGCGCG AGCTGCCCGC CGAAGCGGTG
CTCGCCGCGG AGGAGTTCAC GCCGTCCGAT CTCGCGTCGC TCGATCGCGA GCGCGTCGCG
GCGCTCGTGA TGGCGCGCGG CGGCGCAACC TCGCATGCGG CGATCATCGC GCGGCAGTTG
GGCATTCCGG CGCTCGTCGC GGTGGGCGAC GCGCTGTACG CGATTGCGCA GCGCACACAG
GTCGTCGTCG ACGCGAGCGC CGGCCGCCTC GAATACGCGC CGAGCGCGCT CGACGTCGAG
CGCGCGCGTC ACGAGCGACA GCGCCTTGCC GGCGTGCGCG AGGCGAACCG GCGGATGTCG
GGCGAGGCGG CGCTCACGCG CGACGGCCAC CGGATCGAGG TGGCCGCGAA CATCGCGACG
CTCGACGACG CGCGCGTCGC GCTCGACAAC GGCGCCGACG CGGTCGGCCT GCTGCGCACC
GAGCTGATGT TCATCCATCG TCAGGCGGCG CCGACGGCGT CCGAGCATCA GCAGAGCTAT
CAATCGATCG TCGACGCGCT GCAAGGGCGC ACCGCGATCA TCCGCACGCT CGACGTCGGC
GCGGACAAGG AAGTCGATTA CCTGACGCTG CCGCCCGAGC CGAACCCGGC GCTCGGCCTG
CGCGGGATCC GTCTCGCGCA GGTGCGCCCC GATCTGCTCG ACGACCAACT GCGGGGCCTG
CTCGCCGTGA AGCCGTACGG CTCGGTGCGC ATCCTGCTGC CGATGGTGAC GGACGTGGGC
GAGCTCGTGC GGATCCGCAA GCGCATCGAC GATTTCGCGC GCGCGATGGG CCGCGCGCAG
GCCGTCGAGG TCGGCGTGAT GATCGAAGTG CCGTCGGCCG CGCTTCTCGC GGATCAACTC
GCGCAGCACG CGGACTTCCT GTCGATCGGC ACGAACGATC TCACGCAGTA CACGCTCGCG
ATGGACCGCT GCCAGGCGGA TCTCGCCGCG CAGGCGGACG GCCTGCATCC GGCCGTGCTG
CGGCTCGTCG ACGCGACCGT GCGCGGCGCC GAGAAGCACG GCAAGTGGGT CGGCGTGTGC
GGCGCGCTGG GCGGCGATCC GGTCGCGGTG CCGGTGCTCG TCGGCCTCGG CGTGACGGAG
TTGTCGGTGG ACCCGGTGTC GGTGCCGGGC ATCAAGGCGC AGGTGCGCCG TCTCGATTAC
CAGCTGTGCC GCCAGCGCGC GCAAGACCTG CTCGCGCTCG AATCGGCGCA GGCGGTGAGA
GCAGCAAGCC GCGAGATCTG GCCGGCGGAA TGA
 
Protein sequence
MKQQASHDQI VLVAPLTGPV VPLADVPDPV FSGGMFGDGI GIDPLEGRLL APCAGVVSHV 
ARTGHAVTIA ADGGAEILLH IGIDTVELNG LGFTAKIAEG ARVAAGDLLI EFDQDAIARA
AHSLVSVIAI ANSDAFEVVE RAGAGVVKAG ETPLLALRAR GADASADASA DASADASASA
GAAADASCAQ PAAEARKSIT LTQPGGLHAR PAARAREAAR GLDAHVDVHF EGRKAALQSV
VGLLGLGAGE HATIELVATG RDAAKALERV AHELLREAHG EAEEKPARIV SPAPAAAGIA
RAPLEPNTLA GVCAAPGIAV GTLVRWDDAQ IVPPELASGT PAAESRLLDR ALAEVDAQLE
TTVREASRRG AIGEAGIFAV HRVLLEDPAL VDAARDLISL GKSAGYAWRE TIRAQTAVLA
DVDDTLLAER AADLRDIDKR VLRALGYASA SARELPAEAV LAAEEFTPSD LASLDRERVA
ALVMARGGAT SHAAIIARQL GIPALVAVGD ALYAIAQRTQ VVVDASAGRL EYAPSALDVE
RARHERQRLA GVREANRRMS GEAALTRDGH RIEVAANIAT LDDARVALDN GADAVGLLRT
ELMFIHRQAA PTASEHQQSY QSIVDALQGR TAIIRTLDVG ADKEVDYLTL PPEPNPALGL
RGIRLAQVRP DLLDDQLRGL LAVKPYGSVR ILLPMVTDVG ELVRIRKRID DFARAMGRAQ
AVEVGVMIEV PSAALLADQL AQHADFLSIG TNDLTQYTLA MDRCQADLAA QADGLHPAVL
RLVDATVRGA EKHGKWVGVC GALGGDPVAV PVLVGLGVTE LSVDPVSVPG IKAQVRRLDY
QLCRQRAQDL LALESAQAVR AASREIWPAE