Gene BURPS1710b_A0914 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1710b_A0914 
SymbolepsB 
ID3692061 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1710b 
KingdomBacteria 
Replicon accessionNC_007435 
Strand
Start bp1172873 
End bp1175092 
Gene Length2220 bp 
Protein Length739 aa 
Translation table11 
GC content68% 
IMG OID637731168 
ProductEpsB 
Protein accessionYP_336072 
Protein GI76819162 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3206] Uncharacterized protein involved in exopolysaccharide biosynthesis 
TIGRFAM ID[TIGR01005] exopolysaccharide transport protein family
[TIGR01007] capsular exopolysaccharide family 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.121592 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTGAATA CGCAAGCGAA ACATCCTTAT GCCGACCTCG CGGTGAAGAC CGACGAGGAA 
GACGTCGTCC TGGGCCAGAT GATCCAGGTG ATTCTCGACG ATATCTGGCT GCTCCTCGGC
ATCGCGTTGG TCGTGGTCGC GCTCGCCGGG CTCTACTGCT ACGTCGCGAA GCCGCTCTAT
TCGGCCGATG CGCAGGTGCG GGTGGAGGCG AGCGACAACA CGTCGCAGGC GCTTACGCAG
ACGCAGACGG GCGCGATGAT CAACAGCGGG CCGCCGACGC CGCCCACCGA TGCGGAAATC
GAGATCATCA AGAGCCGCGG CGTCGTCGCG CCGGTCGTCG AGCAGTTCAA GCTGAACGCG
TCGGTCACGC CGAACACGTT GCCGATTCTC GGCGCGATCG CCGCGCGGCT CGCGACGCCG
GGCCATCCGG GCAAACCGTG GCTCGGCTTG TCGTCGTACG CGTGGGGCGG CGAGGAGGCG
AGCATCGATT CGATCGACGT GACGCCCGTG CTCGAAGGCA AGCAGCTCAC GCTCACGGCC
GGCGCGGACG GCGGCTACGC GCTCGCCGAT CCGGACGGCG CGGTGCTCGT GCGCGGCAAG
GTCGGCGAGC GCGAGCAGGG CGGCGGCGTG ACGATCAACG TCTCGAAGCT CGTCGCGCGC
CCCGGCACGC GCTTCACGGT GGTCCGGCAG AACGATCTCG ATGCGATCAC CGCGTTCCAG
TCGGCGATCC AGGTGGCCGA GCAGGGCAAG CAGACCGGCG TGATCCAGAT CTCGCTCGAA
GGCAAGGACC CCGAACAGAC CGCGCAGATC GCGAACGCGC TCGCGCAGTC GTATCTGCAT
CAGCACGTGA CGAGCAAGCA GGCCGAAGCG ACGAAGATGC TCGAGTTCCT GAAGAACGAA
GAGCCGCGCC TGAAATCGGA CCTCGAGCGC GCGGAGGCGG AGCTCACCCA GTATCAGCGC
ACGTCGGGCT CGATCAACGC GAGCGACGAA GCGAAGGTCT ACCTCGAAGG CAGCGTCCAG
TACGAGCAGC AGGTCGCCGC GCAGCGGCTG CAGCTCGCGG CGCTCGCGCA GCGCTACACG
GACGAGCATC CGCTCGTCGT CGCGGCGAAG CAGCAGCTCG GCCAGCTCGA GGCGGAGCGC
GCGAAGTACG ACGGCAAGTT CCGCGGGCTG CCGGCGACCG AAGTCAAGGC TGTCGCGTTG
CAGCGCAACG CGAAGGTTGC GGAAGACATC TACGTGCTGC TGCTCAACCG TGTGCAGGAG
CTGTCGGTGC AGAAGGCCGG CACGGGCGGC AACATCCGCC TCGTCGATGC GGCGCTGCGC
CCGGGCGTGC CGGTCAAGCC GAAGAAGGTG CTGATCCTGT CGGCGGCGAC GCTGCTCGGC
CTGATCCTCG GCACGAGCGT CGTGTTCCTG CGCCGCAACC TGTTCCATGG CATCGAGGAT
CCGGATCGCG TCGAGCGCGC GTTCAACCTG CCGCTGTACG GCCTCGTGCC GATGAGCGCG
GAGCAGGCGC GATTCGATGC CGCCGACAAG GGCAATCGCG TGCGGCCGAT TCTCGCGTGC
GCGCGGCCGA AGGATCTGAG CGTCGAAAGC CTGCGCAGCC TGCGCACCGC GATGCAGTTC
GCGCTGATGG ATGCGAAGAA CCGCGTGATC GTGCTGACCG GACCGACCCC CGGCATCGGC
AAGAGCTTTC TCGCGGTCAA CCTCGCCGCG CTCGTCGCGC ATTCGGGCAA GCGCGTGCTG
CTGATCGACG CGGACATGCG GCGCGGCTCG CTCGATCGCC ACTTCGGCAC CGGGGGAAGG
CGCGGCCTGT CGGAATTGCT GAGCGATCAG GTCGCGCTCG AAGAGGCGAT TCGCGAAACG
TCGGTGCCGG GGCTGTCGTT CATCCCGAGC GGCGCGCGCC CGCCGAATCC GTCGGAGCTG
CTGATGTCGC CGCGCCTGTC GCAATACCTC GACGGCCTCG CGAAGCGCTA CGACATGGTG
ATCGTCGATT CGCCGCCGAT CCTCGCCGTC ACCGACGCGA CGATCTTCGG CGAACTCGCC
GGCTCGACGT TCCTCGTGCT GCGCTCCGGC ATGCACACCG AAGGCGAGAT CGGCGACGCG
ATCAAGCGGC TGCGCACCGC GGGCGTGCAA CTGCAAGGCG GGATCTTCAA CGGCGTGCCG
GCGCGCACGC GCGGCTACGG CCGCGGCTAT GCGGCCGTGC ACGAATATCT GAGCGCATGA
 
Protein sequence
MVNTQAKHPY ADLAVKTDEE DVVLGQMIQV ILDDIWLLLG IALVVVALAG LYCYVAKPLY 
SADAQVRVEA SDNTSQALTQ TQTGAMINSG PPTPPTDAEI EIIKSRGVVA PVVEQFKLNA
SVTPNTLPIL GAIAARLATP GHPGKPWLGL SSYAWGGEEA SIDSIDVTPV LEGKQLTLTA
GADGGYALAD PDGAVLVRGK VGEREQGGGV TINVSKLVAR PGTRFTVVRQ NDLDAITAFQ
SAIQVAEQGK QTGVIQISLE GKDPEQTAQI ANALAQSYLH QHVTSKQAEA TKMLEFLKNE
EPRLKSDLER AEAELTQYQR TSGSINASDE AKVYLEGSVQ YEQQVAAQRL QLAALAQRYT
DEHPLVVAAK QQLGQLEAER AKYDGKFRGL PATEVKAVAL QRNAKVAEDI YVLLLNRVQE
LSVQKAGTGG NIRLVDAALR PGVPVKPKKV LILSAATLLG LILGTSVVFL RRNLFHGIED
PDRVERAFNL PLYGLVPMSA EQARFDAADK GNRVRPILAC ARPKDLSVES LRSLRTAMQF
ALMDAKNRVI VLTGPTPGIG KSFLAVNLAA LVAHSGKRVL LIDADMRRGS LDRHFGTGGR
RGLSELLSDQ VALEEAIRET SVPGLSFIPS GARPPNPSEL LMSPRLSQYL DGLAKRYDMV
IVDSPPILAV TDATIFGELA GSTFLVLRSG MHTEGEIGDA IKRLRTAGVQ LQGGIFNGVP
ARTRGYGRGY AAVHEYLSA