Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcE24377A_3443 |
Symbol | |
ID | 5590493 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli E24377A |
Kingdom | Bacteria |
Replicon accession | NC_009801 |
Strand | + |
Start bp | 3457067 |
End bp | 3457981 |
Gene Length | 915 bp |
Protein Length | 304 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 640927070 |
Product | NAD dependent epimerase/dehydratase family protein |
Protein accession | YP_001464440 |
Protein GI | 157154742 |
COG category | [G] Carbohydrate transport and metabolism [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0451] Nucleoside-diphosphate-sugar epimerases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 41 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATCAAA CTGTCGCGGT GACGGGCGCT ACCGGGTTTA TCGGTAAATA TATTATCGAT AACCTGCTCG CCCGCGGCTT TCACGTTCGC GCATTGACGC GTACTGCCCG CGCTCACGTC AATGATAATC TTACTTGGGT GCGCGGTTCG CTGGAAGATA CACATTCGCT TAGCGAACTG GTTGCCGGAG CCAGCGCGGT GGTCCATTGC GCCGGGCAAG TGCGCGGACA CAAAGAAGAG ATTTTTACCC ACTGTAACGT TGACGGCAGC CTGCGCCTGA TGCAAGCAGC AAAAGAGAGC GGCTTTTGCC AACGTTTTCT GTTTATCTCT TCGCTGGCGG CGCGCCATCC CGAGCTCTCC TGGTACGCAA ATTCCAAACA CGTCGCCGAA CAACGTCTGA CTGCAATGGC TGACGAAATT ACGCTGGGCG TTTTTCGCCC GACAGCCGTG TATGGTCCCG GCGATAAAGA GTTAAAACCG CTGTTTGACT GGATGCTGCG CGGCCTGCTG CCACGACTTG GTGCGCCAGA TACACAGCTC TCTTTCCTGC ACGTCACCGA TTTCGCGCAA GCAGTGGGTC AGTGGTTAAG CGCCGAAACT GTACAGACGC AAACCTATGA ATTATGCGAT GGCGTCCCCG GCGGCTATGA CTGGCAACAC GTACGACAGC TTGCCGCCGA CGCCCGTTGT GGTTCCGTGC GAATGGTTGG TATTCCTCTG CCGGTACTCA CCTGCCTTGC GGATATCAGT ACCGCGTTGA GTCGCCTGGC AGGTAAAGAA CCTATGCTGA CCCGCTCGAA AATTCGTGAA TTAACCCACG CCGACTGGTC GGCAAGTAAT AACCGTATTT CTGAAGATAT TAATTGGTTT CCCGGGATTA GCCTGGAACA AGCATTACGC AACGGGCTAT TTTGA
|
Protein sequence | MNQTVAVTGA TGFIGKYIID NLLARGFHVR ALTRTARAHV NDNLTWVRGS LEDTHSLSEL VAGASAVVHC AGQVRGHKEE IFTHCNVDGS LRLMQAAKES GFCQRFLFIS SLAARHPELS WYANSKHVAE QRLTAMADEI TLGVFRPTAV YGPGDKELKP LFDWMLRGLL PRLGAPDTQL SFLHVTDFAQ AVGQWLSAET VQTQTYELCD GVPGGYDWQH VRQLAADARC GSVRMVGIPL PVLTCLADIS TALSRLAGKE PMLTRSKIRE LTHADWSASN NRISEDINWF PGISLEQALR NGLF
|
| |