Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BMASAVP1_A0937 |
Symbol | |
ID | 4680832 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia mallei SAVP1 |
Kingdom | Bacteria |
Replicon accession | NC_008785 |
Strand | + |
Start bp | 933375 |
End bp | 935288 |
Gene Length | 1914 bp |
Protein Length | 637 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 639845211 |
Product | polysaccharide synthase family protein |
Protein accession | YP_992277 |
Protein GI | 121599405 |
COG category | [G] Carbohydrate transport and metabolism [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1086] Predicted nucleoside-diphosphate sugar epimerases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.217913 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGCGAG CATTTTGCCC GAAGGCTTCG TGGCTTTCCC TGGGCGCCTT TCTGTTCGAT CTGATGGCGG TCGTATGCGC GTGGCTGATC GCGTATGTGG TCCGGTTCAA TGGTGCGATG CCGCCTGAAT TCCTGAAAGG CGCCTTCGTC GCGCTCGCAT GGGTGGTGCC GTTGTATGGC CTGCTGTTTC ATGTTTTCGG GTTGTATCGC GGTCTCTGGG TATTTGCGAG CCTGCCGGAT CTGCTGCGTA TCTCGAAGGC CGTGGTGGGC GGCGGCTTGA TCGTGATGAT CGGCGCGGTG ATGCTGCAGC CGGTTCCGAT CATTCCGCGT TCGGTTCTCG TGCTGTCGCC GATGCTGCTG TTTCTTGCGA TGGGTGGTGC ACGTGCGCTC TATCGGGCGA CGAAGGAGTT CTATCTGTAC GGCGGACTGA TCGGGCAGGG CAAGCCCGTG CTGATTCTCG GCGCCGGCAG TGCCGGCGCG AGCCTCGCCC GTGAACTGTC GCGCTCCGGT GAGTGGCGTC TTGCCGGTCT GCTCGATGAC GATCCCGCGA AACACGGGCG CGAAGTCTAC GGCTATAAGG TACTGGGTTC GATCGGCGAA GTCGCAGACT GGGCCGAGGC CGCGAAAGCC GAATACGCGA TCATCGCGAT TCCATCCGCA TCGGTCGAAG CGCAACGTCG CTTCGCCACC CTCTGCGTGC GCGCCGGCGT CAGGGCCATG GTGCTGCCGT CGCTGACCGC ATTGATGCCG GGCCAGGGCA TTCTGTCGCA GATCCGCCAG ATCGATCTCG AAGACCTGCT CGGCCGCGAG GCCGTGACGA TCGACACGCC GCACGTCGAG GCGCTGCTGC GCGGCCGCGT CGTGATGGTG ACGGGCGCGG GCGGCTCGAT CGGCTCGGAG CTGTGCCGCC AGATCCTGAA GTTCCAGCCC GCGCAACTGA TCGCGTTCGA TCTGTCCGAA TATGCGATGT ACCGGCTCAC GGAAGAGTTG CGCGAGCGCT TCCCCGATCT GCCCGTCGTG CCGATCATCG GCGATGCGAA GGATTCGCTG CTGCTCGATC AGGTGATGTC GCGCTATGCG CCGCACATCG TGTTCCATGC GGCCGCGTAC AAGCACGTGC CGTTGATGGA GGAGCTCAAC GCATGGCAGG CGCTGCGCAA CAACGTGCTC GGCACGTATC GCGTCGCGCG CGCGGCGATT CGCCACGACG TCAGGCACTT CGTGCTGATC TCGACCGACA AGGCCGTGAA CCCGACGAAC GTGATGGGCG CGAGCAAGCG CCTGGCCGAG ATGGCTTGCC AGGCGCTGCA GCAGACGAGC GCGCGCACGC AGTTCGAGAC GGTGCGCTTC GGCAACGTGC TCGGCAGCGC CGGCAGCGTG ATTCCGAAAT TTCAGCAGCA GATCGCCAAG GGCGGCCCGG TGACCGTCAC GCATCCGGAA ATCACGCGCT TCTTCATGAC GATTCCCGAG GCGTCGCAAC TCGTGCTGCA GGCGTCGAGC ATGGGGCAGG GCGGCGAGAT CTTCATTCTC GACATGGGCG AGCCGGTGAA GATCGTCGAT CTCGCGCGCG ATCTGATTCG CCTGTACGGC TTCACCGAGG AGCAGATCCG CATCGAGTTC AGCGGGCTGC GCCCGGGCGA GAAGCTCTAT GAGGAACTGC TCGCCGACGA CGAGACCACC ACCCGCACGC CACACCCGAA GCTGCGCACC GCGAAGGCGC GCGAGGTGCC CGATCATCTG CTCGACGAAC TGCTGCCGTG GCTGATGCAG CACCGCGTGC TGAGCGACGA TGAAGTGCGG CGCGACTTGC GGCGCTGGGT GCCCGAGTAT CAGCCGGCCG TCGGCCCGAC GCTGCAGAGC GTGCCGACCG GCAACGGGCT CGTCGCCGGC ATCGGGCGCG AGGCGGGGCG CTGA
|
Protein sequence | MTRAFCPKAS WLSLGAFLFD LMAVVCAWLI AYVVRFNGAM PPEFLKGAFV ALAWVVPLYG LLFHVFGLYR GLWVFASLPD LLRISKAVVG GGLIVMIGAV MLQPVPIIPR SVLVLSPMLL FLAMGGARAL YRATKEFYLY GGLIGQGKPV LILGAGSAGA SLARELSRSG EWRLAGLLDD DPAKHGREVY GYKVLGSIGE VADWAEAAKA EYAIIAIPSA SVEAQRRFAT LCVRAGVRAM VLPSLTALMP GQGILSQIRQ IDLEDLLGRE AVTIDTPHVE ALLRGRVVMV TGAGGSIGSE LCRQILKFQP AQLIAFDLSE YAMYRLTEEL RERFPDLPVV PIIGDAKDSL LLDQVMSRYA PHIVFHAAAY KHVPLMEELN AWQALRNNVL GTYRVARAAI RHDVRHFVLI STDKAVNPTN VMGASKRLAE MACQALQQTS ARTQFETVRF GNVLGSAGSV IPKFQQQIAK GGPVTVTHPE ITRFFMTIPE ASQLVLQASS MGQGGEIFIL DMGEPVKIVD LARDLIRLYG FTEEQIRIEF SGLRPGEKLY EELLADDETT TRTPHPKLRT AKAREVPDHL LDELLPWLMQ HRVLSDDEVR RDLRRWVPEY QPAVGPTLQS VPTGNGLVAG IGREAGR
|
| |