Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_0898 |
Symbol | |
ID | 6147021 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 901751 |
End bp | 903181 |
Gene Length | 1431 bp |
Protein Length | 476 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 641615786 |
Product | NAD dependent epimerase/dehydratase family protein |
Protein accession | YP_001742978 |
Protein GI | 170681016 |
COG category | [G] Carbohydrate transport and metabolism [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0702] Predicted nucleoside-diphosphate-sugar epimerases |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 46 |
Fosmid unclonability p-value | 0.668207 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCCGCAAC GCATTTTAGT TCTCGGTGCC AGTGGCTACA TTGGTCAGCA TCTGGTGCGA ACACTCAGCC AGCAGGGGCA TCAGATCCTG GCGGCGGCAC GTCATGTCGA CAGGCTTGCA AAGCTGCAAC TGGCAAATGT CAGTTGCCAT AAAGTCGATC TCAGCTGGCC GGATAACCTT CCGGCCCTGT TGCAGGATAT CGATACGGTC TATTTTCTGG TGCACAGCAT GGGCGAAGGC GGCGATTTTA TCGCTCAGGA GCGCCAGGTG GCTCTCAACG TCCGCGATGC GCTACGTGAA GTCCCAGTTA AGCAATTAAT CTTTCTCAGT TCGTTGCAGG CCCCGCCACA TGAGCAGTCG GACCATCTGC GCGCCCGTCA GGCTACGGCG GATATTCTTC GTGAAGCGGG TGTCCCGGTG ACCGAATTGC GTGCCGGAAT AATCGTTGGT GCAGGTTCAG CGGCGTTCGA AGTCATGCGC GATATGGTCT ACAACCTGCC GGTGTTAACG CCGCCACGCT GGGTTCGTTC ACGCACCACG CCCATCGCGC TGGAAAACTT GCTGCACTAT CTGGTGGCGC TGTTAGACCA TCCAGCCAGC GAACATCGCA TCTTCGAAGC CGCCGGACCA GAAGTGCTCA GTTATCAGCA ACAGTTTGAA CATTTTATGG CGGTGAGCGG TAAGCGCCGC TGGTTGATCC CCATCCCCTT CCCCACCCGC TGGATTTCGG TGTGGTTTCT CAATGTGATT ACTTCCGTAC CGCCCACTAC CGCCAAAGCG TTGATTCAGG GGCTGAAACA CGATCTGCTG GCAGATGACA CCGCGTTACG TGCACTCATC CCACAACGGC TGATTGCTTT CGATGACGCG GTACGTCGCA CCCTGAAAGA GGAAGAAAAG CTGGTCAACT CCAGCGACTG GGGATACGAC GCTCAGGCCT TTGCCCGCTG GCGACCGGAG TACGGTTATT TCGCCAAACA GGCGGGGTTT ACCGTTAAAA CGTCCGCCAG CCTTGCGGCT TTATGGCAGG TGGTGAATCA AATCGGCGGT AGAGAGCGTT ATTTCTTTGG TAATATTTTG TGGCAGACCC GGGCGCTGAT GGACCGCGCG ATCGGTCATA AGCTGGCGAA AGGCCGCCCG GAGCGTGAAT ATTTGCAAAC TGGCGATGCG GTAGACAGCT GGAAAGTGAT TATCGTCGAG CCGGAAAAAC AACTTGCGTT GTTATTTGGC ATGAAAGCGC CGGGGCTGGG ACGACTGTGT TTTACCCTGG AAGATAAAGG CGACTATCGT ACTATCGATG TTCGCGCATT CTGGCATCCG CACGGGATGC CGGGGCTGTT TTACTGGTTA CTGATGATCC CCGCGCATCT GTTTATTTTT CGCGGAATGG CAAAACGAAT CGCCAGACTG GCAGAACAAA GCACAGATTA A
|
Protein sequence | MPQRILVLGA SGYIGQHLVR TLSQQGHQIL AAARHVDRLA KLQLANVSCH KVDLSWPDNL PALLQDIDTV YFLVHSMGEG GDFIAQERQV ALNVRDALRE VPVKQLIFLS SLQAPPHEQS DHLRARQATA DILREAGVPV TELRAGIIVG AGSAAFEVMR DMVYNLPVLT PPRWVRSRTT PIALENLLHY LVALLDHPAS EHRIFEAAGP EVLSYQQQFE HFMAVSGKRR WLIPIPFPTR WISVWFLNVI TSVPPTTAKA LIQGLKHDLL ADDTALRALI PQRLIAFDDA VRRTLKEEEK LVNSSDWGYD AQAFARWRPE YGYFAKQAGF TVKTSASLAA LWQVVNQIGG RERYFFGNIL WQTRALMDRA IGHKLAKGRP EREYLQTGDA VDSWKVIIVE PEKQLALLFG MKAPGLGRLC FTLEDKGDYR TIDVRAFWHP HGMPGLFYWL LMIPAHLFIF RGMAKRIARL AEQSTD
|
| |