Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS1710b_A0914 |
Symbol | epsB |
ID | 3692061 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 1710b |
Kingdom | Bacteria |
Replicon accession | NC_007435 |
Strand | - |
Start bp | 1172873 |
End bp | 1175092 |
Gene Length | 2220 bp |
Protein Length | 739 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 637731168 |
Product | EpsB |
Protein accession | YP_336072 |
Protein GI | 76819162 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3206] Uncharacterized protein involved in exopolysaccharide biosynthesis |
TIGRFAM ID | [TIGR01005] exopolysaccharide transport protein family [TIGR01007] capsular exopolysaccharide family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.121592 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTGAATA CGCAAGCGAA ACATCCTTAT GCCGACCTCG CGGTGAAGAC CGACGAGGAA GACGTCGTCC TGGGCCAGAT GATCCAGGTG ATTCTCGACG ATATCTGGCT GCTCCTCGGC ATCGCGTTGG TCGTGGTCGC GCTCGCCGGG CTCTACTGCT ACGTCGCGAA GCCGCTCTAT TCGGCCGATG CGCAGGTGCG GGTGGAGGCG AGCGACAACA CGTCGCAGGC GCTTACGCAG ACGCAGACGG GCGCGATGAT CAACAGCGGG CCGCCGACGC CGCCCACCGA TGCGGAAATC GAGATCATCA AGAGCCGCGG CGTCGTCGCG CCGGTCGTCG AGCAGTTCAA GCTGAACGCG TCGGTCACGC CGAACACGTT GCCGATTCTC GGCGCGATCG CCGCGCGGCT CGCGACGCCG GGCCATCCGG GCAAACCGTG GCTCGGCTTG TCGTCGTACG CGTGGGGCGG CGAGGAGGCG AGCATCGATT CGATCGACGT GACGCCCGTG CTCGAAGGCA AGCAGCTCAC GCTCACGGCC GGCGCGGACG GCGGCTACGC GCTCGCCGAT CCGGACGGCG CGGTGCTCGT GCGCGGCAAG GTCGGCGAGC GCGAGCAGGG CGGCGGCGTG ACGATCAACG TCTCGAAGCT CGTCGCGCGC CCCGGCACGC GCTTCACGGT GGTCCGGCAG AACGATCTCG ATGCGATCAC CGCGTTCCAG TCGGCGATCC AGGTGGCCGA GCAGGGCAAG CAGACCGGCG TGATCCAGAT CTCGCTCGAA GGCAAGGACC CCGAACAGAC CGCGCAGATC GCGAACGCGC TCGCGCAGTC GTATCTGCAT CAGCACGTGA CGAGCAAGCA GGCCGAAGCG ACGAAGATGC TCGAGTTCCT GAAGAACGAA GAGCCGCGCC TGAAATCGGA CCTCGAGCGC GCGGAGGCGG AGCTCACCCA GTATCAGCGC ACGTCGGGCT CGATCAACGC GAGCGACGAA GCGAAGGTCT ACCTCGAAGG CAGCGTCCAG TACGAGCAGC AGGTCGCCGC GCAGCGGCTG CAGCTCGCGG CGCTCGCGCA GCGCTACACG GACGAGCATC CGCTCGTCGT CGCGGCGAAG CAGCAGCTCG GCCAGCTCGA GGCGGAGCGC GCGAAGTACG ACGGCAAGTT CCGCGGGCTG CCGGCGACCG AAGTCAAGGC TGTCGCGTTG CAGCGCAACG CGAAGGTTGC GGAAGACATC TACGTGCTGC TGCTCAACCG TGTGCAGGAG CTGTCGGTGC AGAAGGCCGG CACGGGCGGC AACATCCGCC TCGTCGATGC GGCGCTGCGC CCGGGCGTGC CGGTCAAGCC GAAGAAGGTG CTGATCCTGT CGGCGGCGAC GCTGCTCGGC CTGATCCTCG GCACGAGCGT CGTGTTCCTG CGCCGCAACC TGTTCCATGG CATCGAGGAT CCGGATCGCG TCGAGCGCGC GTTCAACCTG CCGCTGTACG GCCTCGTGCC GATGAGCGCG GAGCAGGCGC GATTCGATGC CGCCGACAAG GGCAATCGCG TGCGGCCGAT TCTCGCGTGC GCGCGGCCGA AGGATCTGAG CGTCGAAAGC CTGCGCAGCC TGCGCACCGC GATGCAGTTC GCGCTGATGG ATGCGAAGAA CCGCGTGATC GTGCTGACCG GACCGACCCC CGGCATCGGC AAGAGCTTTC TCGCGGTCAA CCTCGCCGCG CTCGTCGCGC ATTCGGGCAA GCGCGTGCTG CTGATCGACG CGGACATGCG GCGCGGCTCG CTCGATCGCC ACTTCGGCAC CGGGGGAAGG CGCGGCCTGT CGGAATTGCT GAGCGATCAG GTCGCGCTCG AAGAGGCGAT TCGCGAAACG TCGGTGCCGG GGCTGTCGTT CATCCCGAGC GGCGCGCGCC CGCCGAATCC GTCGGAGCTG CTGATGTCGC CGCGCCTGTC GCAATACCTC GACGGCCTCG CGAAGCGCTA CGACATGGTG ATCGTCGATT CGCCGCCGAT CCTCGCCGTC ACCGACGCGA CGATCTTCGG CGAACTCGCC GGCTCGACGT TCCTCGTGCT GCGCTCCGGC ATGCACACCG AAGGCGAGAT CGGCGACGCG ATCAAGCGGC TGCGCACCGC GGGCGTGCAA CTGCAAGGCG GGATCTTCAA CGGCGTGCCG GCGCGCACGC GCGGCTACGG CCGCGGCTAT GCGGCCGTGC ACGAATATCT GAGCGCATGA
|
Protein sequence | MVNTQAKHPY ADLAVKTDEE DVVLGQMIQV ILDDIWLLLG IALVVVALAG LYCYVAKPLY SADAQVRVEA SDNTSQALTQ TQTGAMINSG PPTPPTDAEI EIIKSRGVVA PVVEQFKLNA SVTPNTLPIL GAIAARLATP GHPGKPWLGL SSYAWGGEEA SIDSIDVTPV LEGKQLTLTA GADGGYALAD PDGAVLVRGK VGEREQGGGV TINVSKLVAR PGTRFTVVRQ NDLDAITAFQ SAIQVAEQGK QTGVIQISLE GKDPEQTAQI ANALAQSYLH QHVTSKQAEA TKMLEFLKNE EPRLKSDLER AEAELTQYQR TSGSINASDE AKVYLEGSVQ YEQQVAAQRL QLAALAQRYT DEHPLVVAAK QQLGQLEAER AKYDGKFRGL PATEVKAVAL QRNAKVAEDI YVLLLNRVQE LSVQKAGTGG NIRLVDAALR PGVPVKPKKV LILSAATLLG LILGTSVVFL RRNLFHGIED PDRVERAFNL PLYGLVPMSA EQARFDAADK GNRVRPILAC ARPKDLSVES LRSLRTAMQF ALMDAKNRVI VLTGPTPGIG KSFLAVNLAA LVAHSGKRVL LIDADMRRGS LDRHFGTGGR RGLSELLSDQ VALEEAIRET SVPGLSFIPS GARPPNPSEL LMSPRLSQYL DGLAKRYDMV IVDSPPILAV TDATIFGELA GSTFLVLRSG MHTEGEIGDA IKRLRTAGVQ LQGGIFNGVP ARTRGYGRGY AAVHEYLSA
|
| |