Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcE24377A_0942 |
Symbol | |
ID | 5588708 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli E24377A |
Kingdom | Bacteria |
Replicon accession | NC_009801 |
Strand | - |
Start bp | 956242 |
End bp | 957672 |
Gene Length | 1431 bp |
Protein Length | 476 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 640924652 |
Product | NAD dependent epimerase/dehydratase family protein |
Protein accession | YP_001462067 |
Protein GI | 157156770 |
COG category | [G] Carbohydrate transport and metabolism [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0702] Predicted nucleoside-diphosphate-sugar epimerases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.286438 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGCCGCAAC GCATTTTAGT TCTCGGTGCC AGTGGCTACA TTGGTCAGCA TCTGGTGCGC ACACTCAGCC AGCAAGGGCA TCAGATCCTG GCGGCGGCAC GTCATGTCGA CAGGCTTGCA AAGCTGCAAC TGGCAAACGT CAGTTGCCAT AAAGTCGATC TCAGCTGGCC GGATAACCTT CCGGCCCTGT TGCAGGATAT CGATACCGTC TATTTTCTGG TGCACAGCAT GGGCGAAGGC GGCGATTTTA TCGCTCAGGA GCGCCAGGTG GCTCTCAACG TCCGCGATGC GCTACGTGAA GTACCAGTTA AGCAATTAAT CTTTCTCAGT TCGTTGCAGG CCCCGCCACA TGAGCAGTCG GATCATCTGC GTGCTCGTCA GGCTACGGCG GACATTCTTC GTGAAGCGAA TGTACCTGTG ACCGAACTTC GGGCCGGAAT TATCGTTGGC GCAGGTTCAG CGGCGTTCGA AGTCATGCGC GATATGGTCT ACAACCTGCC AGTGTTAACG CCGCCACGCT GGGTACGTTC ACGCACCACG CCCATCGCGC TGGAAAACTT GCTGCACTAT CTGGTGGCGT TGTTAGATCA TCCGGCCAGC GAACACCGCA TCTTCGAAGC CGCCGGACCA GAGGTGCTCA GTTATCAGCA ACAGTTTGAA CATTTTATGG CGGTGAGCGG TAAGCGCCGC TGGTTGATCC CCATCCCCCT CCCCACCCGC TGGATTTCGG TGTGGTTTCT CAATGTGATT ACTTCCGTAC CGCCCACCAC CGCCAGGGCG TTGATTCAGG GGCTGAAACA CGATCTGCTG GCGGATGATA CCGCGCTACG TGCACTCATC CCACAACGGC TGATCGCTTT CGATGACGCG GTACGTAGCA CGTTGAAAGA GGAGGAAAAA CTGGTCAACT CCAGCGACTG GGGCTACGAC GCTCAGGCCT TTGCCCGCTG GCGACCGGAG TACGGTTATT TTGCCAAACA GGCGGGGTTT ACCGTTAAAA CGTCCGCCAG CCTTGCTGCT TTATGGCAGG TAGTGAACCA AATCGGCGGT AAAGAGCGTT ATTTCTTTGG CAATATTTTG TGGCAGACAC GGGCGTTGAT GGACCGCGCG ATCGGTCATA AGCTGGCGAA AGGCCGCCCG GAGCGCGAAT ATTTACAGAC TGGCGATGCG GTAGATAGCT GGAAAGTGAT TGTCGTTGAA CCGGAAAAAC AACTTACGTT GTTATTTGGC ATGAAAGCGC CGGGGCTGGG ACGACTGTGT TTTAGCCTGG AAGATAAAGG CGACTATCGT ACTATCGATG TCCGCGCTTT CTGGCATCCG CACGGTATGC CGGGGCTGTT TTACTGGTTA TTGATGATCC CCGCGCATCT GTTTATTTTT CGCGGAATGG CAAAACAAAT CGCCAGACTG GCAGAACAAA GCACAGATTA A
|
Protein sequence | MPQRILVLGA SGYIGQHLVR TLSQQGHQIL AAARHVDRLA KLQLANVSCH KVDLSWPDNL PALLQDIDTV YFLVHSMGEG GDFIAQERQV ALNVRDALRE VPVKQLIFLS SLQAPPHEQS DHLRARQATA DILREANVPV TELRAGIIVG AGSAAFEVMR DMVYNLPVLT PPRWVRSRTT PIALENLLHY LVALLDHPAS EHRIFEAAGP EVLSYQQQFE HFMAVSGKRR WLIPIPLPTR WISVWFLNVI TSVPPTTARA LIQGLKHDLL ADDTALRALI PQRLIAFDDA VRSTLKEEEK LVNSSDWGYD AQAFARWRPE YGYFAKQAGF TVKTSASLAA LWQVVNQIGG KERYFFGNIL WQTRALMDRA IGHKLAKGRP EREYLQTGDA VDSWKVIVVE PEKQLTLLFG MKAPGLGRLC FSLEDKGDYR TIDVRAFWHP HGMPGLFYWL LMIPAHLFIF RGMAKQIARL AEQSTD
|
| |