Gene BTH_II0547 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBTH_II0547 
Symbol 
ID3845502 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia thailandensis E264 
KingdomBacteria 
Replicon accessionNC_007650 
Strand
Start bp649613 
End bp651832 
Gene Length2220 bp 
Protein Length739 aa 
Translation table11 
GC content68% 
IMG OID637837852 
Productexopolysaccharide tyrosine-protein kinase, putative 
Protein accessionYP_438747 
Protein GI83716548 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3206] Uncharacterized protein involved in exopolysaccharide biosynthesis 
TIGRFAM ID[TIGR01005] exopolysaccharide transport protein family
[TIGR01007] capsular exopolysaccharide family 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTGAATA CGCAAGCGAA ACATCCTTAT GCCGACCTCG CGGTGAAGAC CGACGAGGAA 
GACGTCGTCC TGGGCCAGAT GATCCAGGTG ATTCTCGACG ATATCTGGCT GCTCCTCGGC
ATCGCGTTGG TCGTGATCGC GCTCGCCGGG CTCTACTGCT ACGTCGCGAA GCCGGTCTAT
TCGGCCGATG CGCAGGTGCG AGTCGAGGCG AGCGACAACA CGTCGCAAGC GCTCACGCAG
ACGCAGACGG GCGCGATGAT CAACAGCGGG CCGCCGACGC CGCCGACCGA CGCGGAAATC
GAGATCATCA AGAGCCGCGG CGTCGTCGCG CCGGTCGTCG AGCAGTTCAA GCTGAACTCG
TCGGTCACGC CGAACACGTT CCCGATCCTC GGCGCGATCG CGGCGCGGCT CGCGACGCCG
GGCCATCCGG GCAAGCCGTG GCTCGGCCTG TCTTCGTACG CGTGGGGCGG CGAGGAGGCG
AAGGTCGATT CGATCGAGGT GACGCCCGCG CTCGAAGGCA AGAAGCTCAC GCTCACGGCC
GGCGCGGACG GCGGCTACAC GCTTGCCGAT CCGGACGGCT TCCCGCTCGT GCGCGGCAGG
GTCGGCGAGA GCGAGCAGGG CGGCGGCGTG ACGATCGACG TATCGAAGCT CGTCGCGCGT
CCCGGCACGC GGTTCACGGT GATCCGGCAG AACGATCTCG ATGCGATCAC CGCGTTCCAG
TCGGCGATCC AGGTGGCCGA GCAGGGCAAG CAGACGGGCG TGATCCAGAT CTCGCTCGAA
GGCAAGGACC CCGAACAGAC CGCGCAGATC GCGAATGCGC TCGCGCAGTC GTATTTGCAT
CAGCACGTGA CGAGCAAGCA GGCCGAAGCG ACGAAGATGC TCGAGTTCCT GAAGAACGAA
GAGCCGCGCC TGAAGTCGGA CCTCGAGCGC GCGGAGGCGG AGCTGACCCA GTATCAGCGC
ACGTCGGGCT CGATCAACGC GAGCGACGAA GCGAAGGTCT ATCTCGAAGG CAGCGTCCAG
TACGAGCAGC AGATCGCCGC GCAGCGGCTG CAGCTCGCGG CGCTCGCGCA GCGCTACACG
GACGAGCATC CGCTCGTCAT CGCGGCGAAG CAGCAGCTCG GACAGCTCGA AGCGGAGCGC
GCGAAGTACG ACGGCAAGTT CCGCGGGCTG CCGGCGACCG AAGTCAAGGC GGTCGCGCTG
CAGCGCAATG CGAAGGTCGC GGAAGACATC TACGTGCTGC TGCTCAACCG CGTGCAGGAG
CTGTCGGTGC AGAAGGCCGG CACCGGCGGC AACATCCGCC TCGTCGATGC GGCGCTGCGC
CCCGGCGTGC CGGTCAAGCC GAAGAAGATG CTGATCCTGT CGGCGGCGAC GCTGCTCGGC
CTGATCCTCG GCACGGGCGT CGTGTTCCTG CGCCGCAACC TGTTCCACGG CATCGAGGAC
CCGGATCGCG TCGAGCGCGC GTTCAACCTG CCGCTGTACG GCCTCGTGCC GATGAGCGCG
GAGCAGACGC GGTTCGACGC CGCGGACAAG GGCAACCGCG TGCGGCCGAT TCTCGCGTGC
GCCCGGCCGA AGGACCTGAG CGTCGAGAGC CTGCGCAGCC TGCGCACCGC GATGCAGTTC
GCGCTGATGG ACGCGAAGAA CCGCGTGATC GTGCTGACGG GCCCGACGCC CGGCATCGGC
AAGAGCTTTC TCGCGGTCAA TCTCGCCGCG CTCGTCGCGC ATTCGGGCAA GCGCGTGCTG
CTGATCGACG CGGACATGCG GCGCGGCACG CTGGAGCGCC ACTTCGGCAC CGGCGGGCGG
AGCGGCCTGT CGGAGCTGTT GAGCGACCAG GTCGCGCTCG AAGAGGCGAT TCGCGAGACG
CCGGTGCCGG GCCTGTCGTT CATTCCGTGC GGCGCGCGTC CGCCGAATCC GTCGGAGCTG
CTGATGTCGC CGCGCCTGTC GCAGTATCTC GACGGCCTCG CGAAGCGCTA CGACATGGTG
ATCGTCGATT CGCCGCCGAT CCTCGCCGTC ACCGACGCGA TGATCTTCGG CGAACTCGCC
GGCTCGACGT TCCTCGTGCT GCGCTCCGGC ATGCACACCG AGGGCGAGAT CAGCGATGCG
ATCAAGCGGC TGCGCACCGC GGGCGTGCAA CTGCAAGGCG GAATCTTCAA CGGCGTGCCG
GCGCGCACGC GAGGCTACGG CCGCGGCTAT GCGGCCGTGC ACGAATATCT GAGCGCATGA
 
Protein sequence
MVNTQAKHPY ADLAVKTDEE DVVLGQMIQV ILDDIWLLLG IALVVIALAG LYCYVAKPVY 
SADAQVRVEA SDNTSQALTQ TQTGAMINSG PPTPPTDAEI EIIKSRGVVA PVVEQFKLNS
SVTPNTFPIL GAIAARLATP GHPGKPWLGL SSYAWGGEEA KVDSIEVTPA LEGKKLTLTA
GADGGYTLAD PDGFPLVRGR VGESEQGGGV TIDVSKLVAR PGTRFTVIRQ NDLDAITAFQ
SAIQVAEQGK QTGVIQISLE GKDPEQTAQI ANALAQSYLH QHVTSKQAEA TKMLEFLKNE
EPRLKSDLER AEAELTQYQR TSGSINASDE AKVYLEGSVQ YEQQIAAQRL QLAALAQRYT
DEHPLVIAAK QQLGQLEAER AKYDGKFRGL PATEVKAVAL QRNAKVAEDI YVLLLNRVQE
LSVQKAGTGG NIRLVDAALR PGVPVKPKKM LILSAATLLG LILGTGVVFL RRNLFHGIED
PDRVERAFNL PLYGLVPMSA EQTRFDAADK GNRVRPILAC ARPKDLSVES LRSLRTAMQF
ALMDAKNRVI VLTGPTPGIG KSFLAVNLAA LVAHSGKRVL LIDADMRRGT LERHFGTGGR
SGLSELLSDQ VALEEAIRET PVPGLSFIPC GARPPNPSEL LMSPRLSQYL DGLAKRYDMV
IVDSPPILAV TDAMIFGELA GSTFLVLRSG MHTEGEISDA IKRLRTAGVQ LQGGIFNGVP
ARTRGYGRGY AAVHEYLSA