Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BTH_I1354 |
Symbol | |
ID | 3848373 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia thailandensis E264 |
Kingdom | Bacteria |
Replicon accession | NC_007651 |
Strand | + |
Start bp | 1531254 |
End bp | 1533494 |
Gene Length | 2241 bp |
Protein Length | 746 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 637841026 |
Product | exopolysaccharide tyrosine-protein kinase, putative |
Protein accession | YP_441901 |
Protein GI | 83721637 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3206] Uncharacterized protein involved in exopolysaccharide biosynthesis |
TIGRFAM ID | [TIGR01005] exopolysaccharide transport protein family [TIGR01007] capsular exopolysaccharide family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.845392 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATCCCA ATCCAACCGG CACATCGGCG CTCGCCGACA GCGAAGGCGA CACCGATTTC GTCGCCGTTC TCGACATCCT GATCGAAGGC CGCTGGCTGA TCGCGGCGAT CGCGCTCGGC TGCTTCGTCG TGGGCGTCGC GTATGCGGTG CTCAGCAAGC CCGTCTATCA GGCCGACATC CTGATCCAGG TCGAGGACAG CCCCGATACG TCAGCGGCGA AGAGTCTGCT CGGCGACGTG TCTTCGCTCT TCGACGTGAA GTCGTCCGCG GCGGCCGAAA CGCAGATTCT CGCGTCGCGG CTCGTCGTGT CGCGCGCCGT CGACAATCTG AAGCTCTTCA TCGACGCGAA GCCGAAGCGC TTTCCGGTGA TCGGCAATTG GCTCGCGCGC CGCAGCGAAG GGCTGTCGAA TCCGGGGCTC GCGGGCTTCG GCGGCTACGC GTGGGGCCAG GAGCGCATCA ACGTCGCGAC GTTCGACGTG CCGCGCGCGA TGGAAGGCGA CACGTTCGAG CTGACGATGC TCGATGCGCG CCGCTATCGG CTCGCCGGCG GCGATCTCGA ACGCAATGTC GAGGGTGCGA TCGGCACGCT CGAACGCTTT TCGGCGAAGG GCGGGGCGAT CGTGTTGCGC GTCGACGCGG TTGCCGCGAA GCCGGGCGCG ACGTTTGTGC TCGTACGCCA TTCGCGTTTG CGAACGATCG AGGATCTGCA GGACAACCTC GATGTGCAGG AGCGCGTCAA GCAGTCCGAC GTCGTCGTCG CGAGCCTGCG CGACACTGAC CCCGACCTCG TCAGCAGCGC GCTCAATGAA ATCGGGCGGC AGTACATCGC GCAGAACATT CAGCGCAAAT CGGCGGAAGC CGCGCAATCG CTCGAGTTCC TGAACGGACA GCTGCCGGCG CTCAAGCGGC AGTTGACCGA TTCCGAGGCG CGGCTCACGA AGCTGCGCGA CGAGCACGGC ACCGTCGATC TGACCGAAGA GGCGAAGCTC GCGCTCGCGC AGTCGGCCGA TGCGAAGACG CGTCTGCTCG AATTGCGGCA AAAACGGCAG GAGGCGCTGT CGCGCTTCAC GCCGAAGCAT CCGAGCGTCA TCGCAATCGA TCAGCAGATC GCCGCGCTCG ACGGCTATCG CGGTGCGGCC GAGCAGCAGA TCAAGCGGTT GCCGGATCTG CAGCAGCAGC TCGTGCGGCT GATGCTCGAC GTGAAGGTCA ATACCGATCT GTATACGGCG CTGCTGAACA ACATGCAGCA GTTGCAGCTC GTGCGCGCGG GCAAGGTCGG CAACGTGCGG CTCGTCGATA CGGCGGCGGT GCCGGAAGTG CCCGTCAAGC CGAAGAAGGC GCTCGTCGCG CTCGCGTCGC TGCTGCTCGG CGTGCTCGCC GGTTGCGGCA CGGCGGTCGG CCGCTCGATG CTGTTCCATG GCATTTCCGA TCCGAACGAG ATCGAGCGCC GTCTCGGCCT GAACGTCTAT GCGACCGTGC CGCGCAGCGA TCAGCAGCGG GCGCTGACCG AGCGCGCGAA GCGCAGGGAG CGCGCCCTGT CGCTGCTGTC CGTCGCGCAT CCGGACGAGC CGGCCGTCGA AAGCCTGCGC AGCCTGCGCA CCGCGCTGCA GTTCGCGATG CTCGACGCGA GGAACAACGT CGTCGTGATC GCGGGGCCCG CGCCGGGTGT CGGCAAGTCG TTCGTGTCGG CGAATCTCGC CGCGGTGCTG ACGATGGCGG GCAAGCGCGT GCTGCTGATC GACGGCGACA TTCGCAAGGG ACATCTGAAC GACTATCTCG GCCTCGCTCG CGGCAAGGGC TTTTCGGAGC TGATCGCGGG ATCGGCGCAG CCGGACGAGG TGCTGCACCG CGATGTGATC GTCGGACTCG ATTTCATTTC GACGGGCGCG ATGCCGAAGC ATCCCGCCGA GCTGTTGCTC CATCCGCGCT TGCCCGAATT GATCGGCGAA TTCTCGAAGC ATTACGACGT TGTCCTGATC GATTCGCCGC CGGTGCTTGC GGTGGCCGAC ACGGGCATTC TCGCCGCGAC GGCGGGCACC GCGTTCCTCG TCGCGCTCGC CGGCTCGACG AAGCTCGGCG AGATCGCGGA ATCCGCGAAG CGGCTCGCGC AGAACGGCGT GCGTCTGAGC GGCGTCGTGT TCAACGGCAT CAATCCGCGG CTCGGGCAGT ACGGCTATGG CTCGAAGTAC GGCGGCTATC GCTACGTCGC GTACGAATAC GGCGCGAAGC ACGATGCGTG A
|
Protein sequence | MNPNPTGTSA LADSEGDTDF VAVLDILIEG RWLIAAIALG CFVVGVAYAV LSKPVYQADI LIQVEDSPDT SAAKSLLGDV SSLFDVKSSA AAETQILASR LVVSRAVDNL KLFIDAKPKR FPVIGNWLAR RSEGLSNPGL AGFGGYAWGQ ERINVATFDV PRAMEGDTFE LTMLDARRYR LAGGDLERNV EGAIGTLERF SAKGGAIVLR VDAVAAKPGA TFVLVRHSRL RTIEDLQDNL DVQERVKQSD VVVASLRDTD PDLVSSALNE IGRQYIAQNI QRKSAEAAQS LEFLNGQLPA LKRQLTDSEA RLTKLRDEHG TVDLTEEAKL ALAQSADAKT RLLELRQKRQ EALSRFTPKH PSVIAIDQQI AALDGYRGAA EQQIKRLPDL QQQLVRLMLD VKVNTDLYTA LLNNMQQLQL VRAGKVGNVR LVDTAAVPEV PVKPKKALVA LASLLLGVLA GCGTAVGRSM LFHGISDPNE IERRLGLNVY ATVPRSDQQR ALTERAKRRE RALSLLSVAH PDEPAVESLR SLRTALQFAM LDARNNVVVI AGPAPGVGKS FVSANLAAVL TMAGKRVLLI DGDIRKGHLN DYLGLARGKG FSELIAGSAQ PDEVLHRDVI VGLDFISTGA MPKHPAELLL HPRLPELIGE FSKHYDVVLI DSPPVLAVAD TGILAATAGT AFLVALAGST KLGEIAESAK RLAQNGVRLS GVVFNGINPR LGQYGYGSKY GGYRYVAYEY GAKHDA
|
| |