Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BTH_II0906 |
Symbol | |
ID | 3844982 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia thailandensis E264 |
Kingdom | Bacteria |
Replicon accession | NC_007650 |
Strand | + |
Start bp | 1061555 |
End bp | 1064422 |
Gene Length | 2868 bp |
Protein Length | 955 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 637838209 |
Product | PTS system, glucose-specific EIIA/HPr/phosphoenolpyruvate-protein phosphotransferase components |
Protein accession | YP_439103 |
Protein GI | 83716398 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1080] Phosphoenolpyruvate-protein kinase (PTS system EI component in bacteria) [COG4668] Mannitol/fructose-specific phosphotransferase system, IIA domain |
TIGRFAM ID | [TIGR01003] Phosphotransferase System HPr (HPr) Family [TIGR01417] phosphoenolpyruvate-protein phosphotransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.0183615 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGTCGCGCA CCGGGCCGCT GTTCGACAGC ACGCGCGAGA CCGTCGCGAC CGACACGCCC GCCGCTTCCG CGACATCCTT GATTCGTAAC GCCATGACGA CTGAAATCGA TTTCTCTTCG GGGTCGGTAT TTACCCTGAA TCGCGGCGGC GGAAGCGAAC CCCGGCGTAT GCGCCGTCAC CGCCCGCGGA CAGGCCGTTC GCCTATTCTA TCGGCGCTTG CCGCACATTT CACGCGCTTT CCCAAAATCG GGGAACGCTT GACGGTCAAA GTGGTATCGA TTTCAATACG CACATTCCAA CGACTCATCC CAGGAGACGC GATGCCGCCG TTGCTCACCG CGGAACTCGT CCGGCTCGGC GCGAAGGCCG ATTCGAAACA CGATGCGATC GGCCAGGCCG GCGCGCTGCT CGTCGCGGCG GGCGTCATCG AGCCCGGCTA CGTCGACAGC CTGCTCGGCC GCGAGCGGGT GTCGAACACG TATCTCGGCA GCGGCGTCGC GATCCCGCAT GGATTGCAGG AGGACCGCCA CCTGATCCGC CGCACGGGCG TCGCGGTGCT GCAATTGCCG GACGGCGTCG AATGGCAGGG CGGCGAGCGC GCGCGGCTCG TCGTCGCGAT CGCCGCGCAG TCCGACGAGC ACATCGTGCT GCTGCAGCGC CTGACCCGGC TGATCGGCGA TCCGGCGCAG CTCGAGCGGC TGCTCGCCGC GCGCGATTCG CGCGCGATCG TCGACGCGCT GAACGGCGCG CGGCCGGCAA ACGCGGCCGG CGGCGCGCCC ACCGCCGCCG CCGCCGTCGC GGCGACGCCC GCCGACTACG CGCACAAGCT CGATCTGATC CTCGACTATC CGCACGGCCT GCACGCGCGG CCCGCGAGCG CGTGGGTCGC CACCGCGAAG CGCTACCAGG CGGCGCTGCG CGTGCGCTGC GGCGAGCTCG CCGCCGATCC GAAGAACCTC GTCAGTCTGT TGCAGCTCGG CGCGACCGCG CAGGCGCGGC TCGTCGTGTC GGCGCAAGGC GTCGACGCGG CCGATGCGCT CGCCGCGCTC GCACGCACGA TCGGCTCGCT CGCCGCCGAG GAGCACGCGC GCGCGGCGGC GGCGCAGGCG CGCCGGCAGA GCGCGCAGCC CGCGCTGTGG ACGCCCGAGG ACCCGGCGGC CGCGATCGAA GGCGTCGGCG CGGGGCCGGG CCTCGTGACG GGGCCGGTGC GCGTGCTGCG CGCGACGCGC GTCGATGTCG AGGACCGGCC GGGCGACGCG CTCGACGCGT CGCACCGGCT CGAGCGCGCG CTTGCCGACA CCGCGGCCGA GCTCGATGCG CTTGCACGGG AGACGGCGTC GCGGCTCGGC GCGGCCGAGG GCGAGATCTT CGTCGCGCAG CGCGAGCTGC TGAACGACGC CGCGCTGCTC GGCGAGGTCG CGCGGCTGAC CGTCGACGGG CACGGGCTCG CGTGGGCGTG GCATCGCGCG ACCGACGCGC AGGCCGAGCG GCTCGCCGCG CTGCCCGATC CGCTCCTCGC CGCGCGCGCC GCCGATTTGC GCGACGTCGC GCGGCGCGTG CTGCGGCATC TCGGCGAGAC GGGCGGCCCC GCGCGCGACG CGGCGGGCGG CGCCGGCGCG CGCGCGCCGG CGATCCTGAT CGCCGAAGAC CTGACGCCGT CCGACACCGC GCAGCTCGAC CCGGCCGTGA CGCTCGGCTT CTGCACGGTC GCGGGCGGCC CGACGTCGCA CACGGCGATC CTCGCGCGCA CGCTCGGCGT GCCGGCGGCG GTCGCGTGCG GCGCGACGCT GATGGACGTG GCCGACGGCG CGTGCGCGGT GCTCGACGGC AATGCCGGGC GGCTTTACGT CGGCGTGTCG GCGCGCGACG CCGAGCGCGC GCGGCAGGTC GAGCAGCGCC TCGCCGACGA GCGGCGGCGC GCGAGCGCGA ACCGCGCGCT GCCCGCGGCG ACGCTCGACG GCCACGTGAT CGAGATCGGC GCGAACATCA CGCGGCCCGC GCAGGTTGCC GACGCGCTCG CGCAGGGCGC GGACGGCGTC GGCCTGATGC GCACCGAGTT CCTGTTCCTC GAGCGGCGCG ACGCGCCCGA CGAGGACGAG CAATACGCGT GCTACCGGCA GATGGTCGAC GCGAGCGGCG GCCGGCGGCT CATCATCCGC ACGCTCGACA TCGGCGGCGA CAAGCAGGTG CCGTATCTGA ACCTGCCGCA CGAATCGAAT CCGTTCCTCG GCGTGCGCGG GCTGCGCCTG TGCCTGCGGC GGCCGGACCT GTTCGTGCCG CAGTTGCGCG CGCTGTATCG CGCGGCGAAG GCGGGGCCGC TCTGGATCAT GTTCCCGATG GTGTCGACGC TCGATGAGGC GCGCGAGGCG CTCGCGCTCG CCGAGACCGT GCGCGCCGAG CTCGATGCGC CGAAGGTGCC GCTCGGCATC ATGGTCGAGA CGCCGTCGGC CGCGGCGCTC GCCGATCACT TCGCGCAGCT CGTCGATTTC TTCTCGATCG GCACCAACGA TCTGACGCAA TACGTGCTCG CGATCGACCG CGAGCACCCG GAGCTCGCGC GGCTCGCCGA AAGCCTGCAT CCGGCCGTGC TGAGGATGAT CCGGCAGACA GTCGACGGCG CGCGCCGCCA TCGCAAGTGG GTCGGCGTGT GCGGCGGGCT CGCGGGCGAT CCGCTCGGCG CGTCGATTCT CGCGGGCTTG GGCGTCGACG AATTGTCGAT GAGCGCGCGC GACGTCGCCG CGGTGAAGGC GCGGCTGCGC GGCGCGCGGC TCGACGCGCT CACGGCGCTC GCCGCGCGCG CGCTCGACTG CGCGGACGTC GACGCGGTGC GCGCGCTCGA TTCGGCCGAG ATCCGGGTGG CCGCATGA
|
Protein sequence | MSRTGPLFDS TRETVATDTP AASATSLIRN AMTTEIDFSS GSVFTLNRGG GSEPRRMRRH RPRTGRSPIL SALAAHFTRF PKIGERLTVK VVSISIRTFQ RLIPGDAMPP LLTAELVRLG AKADSKHDAI GQAGALLVAA GVIEPGYVDS LLGRERVSNT YLGSGVAIPH GLQEDRHLIR RTGVAVLQLP DGVEWQGGER ARLVVAIAAQ SDEHIVLLQR LTRLIGDPAQ LERLLAARDS RAIVDALNGA RPANAAGGAP TAAAAVAATP ADYAHKLDLI LDYPHGLHAR PASAWVATAK RYQAALRVRC GELAADPKNL VSLLQLGATA QARLVVSAQG VDAADALAAL ARTIGSLAAE EHARAAAAQA RRQSAQPALW TPEDPAAAIE GVGAGPGLVT GPVRVLRATR VDVEDRPGDA LDASHRLERA LADTAAELDA LARETASRLG AAEGEIFVAQ RELLNDAALL GEVARLTVDG HGLAWAWHRA TDAQAERLAA LPDPLLAARA ADLRDVARRV LRHLGETGGP ARDAAGGAGA RAPAILIAED LTPSDTAQLD PAVTLGFCTV AGGPTSHTAI LARTLGVPAA VACGATLMDV ADGACAVLDG NAGRLYVGVS ARDAERARQV EQRLADERRR ASANRALPAA TLDGHVIEIG ANITRPAQVA DALAQGADGV GLMRTEFLFL ERRDAPDEDE QYACYRQMVD ASGGRRLIIR TLDIGGDKQV PYLNLPHESN PFLGVRGLRL CLRRPDLFVP QLRALYRAAK AGPLWIMFPM VSTLDEAREA LALAETVRAE LDAPKVPLGI MVETPSAAAL ADHFAQLVDF FSIGTNDLTQ YVLAIDREHP ELARLAESLH PAVLRMIRQT VDGARRHRKW VGVCGGLAGD PLGASILAGL GVDELSMSAR DVAAVKARLR GARLDALTAL AARALDCADV DAVRALDSAE IRVAA
|
| |