Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS1106A_0554 |
Symbol | |
ID | 4901539 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 1106a |
Kingdom | Bacteria |
Replicon accession | NC_009076 |
Strand | + |
Start bp | 521594 |
End bp | 524206 |
Gene Length | 2613 bp |
Protein Length | 870 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 640133784 |
Product | PTS system, glucose-glucoside (Glc) family EIIA/phosphocarrier HPr/phosphoenolpyruvate-protein phosphotransferase components |
Protein accession | YP_001064837 |
Protein GI | 126451463 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1080] Phosphoenolpyruvate-protein kinase (PTS system EI component in bacteria) [COG1925] Phosphotransferase system, HPr-related proteins [COG2190] Phosphotransferase system IIA components |
TIGRFAM ID | [TIGR00830] PTS system, glucose subfamily, IIA component [TIGR01003] Phosphotransferase System HPr (HPr) Family [TIGR01417] phosphoenolpyruvate-protein phosphotransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.71734 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGAAGCAAC AAGCATCCCA CGACCAGATC GTGCTGGTCG CTCCGCTGAC GGGGCCCGTC GTGCCGCTCG CCGACGTACC CGATCCCGTG TTCTCGGGCG GCATGTTCGG CGACGGCATC GGCATCGATC CGCTCGAGGG CCGGCTGCTC GCGCCGTGCG CGGGCGTCGT GTCGCACGTC GCGCGCACCG GCCACGCGGT GACGATCGCG GCCGACGGCG GCGCGGAGAT CCTGCTGCAC ATCGGCATCG ACACGGTCGA GCTGAACGGG CTCGGCTTCA CGGCGAAGAT CGCCGAGGGC GCGCGCGTCG CGGCGGGCGA TCTGCTGATC GAATTCGATC AGGACGCGAT CGCGCGCGCC GCGCACAGCC TCGTATCGGT GATCGCGATC GCGAACTCGG ATGCGTTCGA AGTCGTCGAG CGCGCGGGCG CGGGCGTCGT GAAAGCGGGC GAGACGCCGC TGCTCGCGCT GCGCGCGCGC GGCGCGGATG CAAGTGCGGA TGCAAGTGCG GATGCAAGTG CGGATGCAAG TGCGAGTGCG GGCGCGGCTG CTGACGCGAG CTGCGCGCAG CCCGCCGCCG AAGCGCGCAA GTCGATCACG CTCACGCAGC CGGGCGGCCT GCACGCGCGG CCGGCCGCGC GCGCGCGCGA GGCGGCGCGC GGGCTCGACG CGCACGTCGA CGTGCACTTC GAAGGGCGCA AGGCGGCGCT GCAAAGCGTG GTCGGGCTGC TCGGGCTCGG CGCGGGCGAG CATGCGACGA TCGAGCTCGT CGCGACGGGC CGCGACGCGG CGAAGGCGCT CGAGCGCGTC GCGCACGAGC TGCTGCGCGA GGCGCACGGC GAGGCCGAAG AGAAGCCGGC GCGCATCGTG TCGCCCGCGC CCGCCGCCGC GGGCATCGCG CGCGCGCCGC TCGAGCCGAA CACGCTCGCG GGCGTGTGCG CGGCGCCCGG CATCGCGGTC GGCACGCTCG TGCGCTGGGA TGATGCGCAG ATCGTGCCGC CCGAGCTCGC GAGCGGCACG CCGGCGGCCG AGAGCCGGCT GCTCGACCGC GCGCTCGCCG AAGTCGACGC GCAACTCGAG ACGACGGTGC GCGAAGCGTC GCGGCGCGGC GCGATCGGCG AAGCGGGCAT CTTCGCCGTG CATCGCGTGC TGCTCGAAGA TCCGGCGCTC GTCGACGCCG CGCGCGACCT GATCAGCCTC GGCAAGAGCG CGGGCTACGC GTGGCGCGAG ACGATCCGCG CGCAGACGGC CGTGCTCGCC GACGTCGACG ACACGCTCCT CGCCGAGCGC GCGGCCGATC TGCGCGACAT CGACAAGCGC GTGCTGCGCG CGCTCGGCTA TGCGAGCGCG AGCGCGCGCG AGCTGCCCGC CGAAGCGGTG CTCGCCGCGG AGGAGTTCAC GCCGTCCGAT CTCGCGTCGC TCGATCGCGA GCGCGTCGCG GCGCTCGTGA TGGCGCGCGG CGGCGCAACC TCGCATGCGG CGATCATCGC GCGGCAGTTG GGCATTCCGG CGCTCGTCGC GGTGGGCGAC GCGCTGTACG CGATTGCGCA GCGCACACAG GTCGTCGTCG ACGCGAGCGC CGGCCGCCTC GAATACGCGC CGAGCGCGCT CGACGTCGAG CGCGCGCGTC ACGAGCGACA GCGCCTTGCC GGCGTGCGCG AGGCGAACCG GCGGATGTCG GGCGAGGCGG CGCTCACGCG CGACGGCCAC CGGATCGAGG TGGCCGCGAA CATCGCGACG CTCGACGACG CGCGCGTCGC GCTCGACAAC GGCGCCGACG CGGTCGGCCT GCTGCGCACC GAGCTGATGT TCATCCATCG TCAGGCGGCG CCGACGGCGT CCGAGCATCA GCAGAGCTAT CAATCGATCG TCGACGCGCT GCAAGGGCGC ACCGCGATCA TCCGCACGCT CGACGTCGGC GCGGACAAGG AAGTCGATTA CCTGACGCTG CCGCCCGAGC CGAACCCGGC GCTCGGCCTG CGCGGGATCC GTCTCGCGCA GGTGCGCCCC GATCTGCTCG ACGACCAACT GCGGGGCCTG CTCGCCGTGA AGCCGTACGG CTCGGTGCGC ATCCTGCTGC CGATGGTGAC GGACGTGGGC GAGCTCGTGC GGATCCGCAA GCGCATCGAC GATTTCGCGC GCGCGATGGG CCGCGCGCAG GCCGTCGAGG TCGGCGTGAT GATCGAAGTG CCGTCGGCCG CGCTTCTCGC GGATCAACTC GCGCAGCACG CGGACTTCCT GTCGATCGGC ACGAACGATC TCACGCAGTA CACGCTCGCG ATGGACCGCT GCCAGGCGGA TCTCGCCGCG CAGGCGGACG GCCTGCATCC GGCCGTGCTG CGGCTCGTCG ACGCGACCGT GCGCGGCGCC GAGAAGCACG GCAAGTGGGT CGGCGTGTGC GGCGCGCTGG GCGGCGATCC GGTCGCGGTG CCGGTGCTCG TCGGCCTCGG CGTGACGGAG TTGTCGGTGG ACCCGGTGTC GGTGCCGGGC ATCAAGGCGC AGGTGCGCCG TCTCGATTAC CAGCTGTGCC GCCAGCGCGC GCAAGACCTG CTCGCGCTCG AATCGGCGCA GGCGGTGAGA GCAGCAAGCC GCGAGATCTG GCCGGCGGAA TGA
|
Protein sequence | MKQQASHDQI VLVAPLTGPV VPLADVPDPV FSGGMFGDGI GIDPLEGRLL APCAGVVSHV ARTGHAVTIA ADGGAEILLH IGIDTVELNG LGFTAKIAEG ARVAAGDLLI EFDQDAIARA AHSLVSVIAI ANSDAFEVVE RAGAGVVKAG ETPLLALRAR GADASADASA DASADASASA GAAADASCAQ PAAEARKSIT LTQPGGLHAR PAARAREAAR GLDAHVDVHF EGRKAALQSV VGLLGLGAGE HATIELVATG RDAAKALERV AHELLREAHG EAEEKPARIV SPAPAAAGIA RAPLEPNTLA GVCAAPGIAV GTLVRWDDAQ IVPPELASGT PAAESRLLDR ALAEVDAQLE TTVREASRRG AIGEAGIFAV HRVLLEDPAL VDAARDLISL GKSAGYAWRE TIRAQTAVLA DVDDTLLAER AADLRDIDKR VLRALGYASA SARELPAEAV LAAEEFTPSD LASLDRERVA ALVMARGGAT SHAAIIARQL GIPALVAVGD ALYAIAQRTQ VVVDASAGRL EYAPSALDVE RARHERQRLA GVREANRRMS GEAALTRDGH RIEVAANIAT LDDARVALDN GADAVGLLRT ELMFIHRQAA PTASEHQQSY QSIVDALQGR TAIIRTLDVG ADKEVDYLTL PPEPNPALGL RGIRLAQVRP DLLDDQLRGL LAVKPYGSVR ILLPMVTDVG ELVRIRKRID DFARAMGRAQ AVEVGVMIEV PSAALLADQL AQHADFLSIG TNDLTQYTLA MDRCQADLAA QADGLHPAVL RLVDATVRGA EKHGKWVGVC GALGGDPVAV PVLVGLGVTE LSVDPVSVPG IKAQVRRLDY QLCRQRAQDL LALESAQAVR AASREIWPAE
|
| |