Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A3156 |
Symbol | |
ID | 5593626 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | + |
Start bp | 3169914 |
End bp | 3170828 |
Gene Length | 915 bp |
Protein Length | 304 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 640922276 |
Product | NAD dependent epimerase/dehydratase family protein |
Protein accession | YP_001459774 |
Protein GI | 157162456 |
COG category | [G] Carbohydrate transport and metabolism [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0451] Nucleoside-diphosphate-sugar epimerases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 59 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATCAAA CTGTCGCGGT GACGGGCGCT ACCGGGTTTA TCGGTAAATA TATTATCGAT AACCTGCTCG CCCGCGGCTT TCACGTTCGC GCATTGACGC GTACTGCCCG CGCTCACGTC AATGATAATC TCACCTGGGT GCGCGGTTCG CTGGAAGATA CTCATTCACT TAGCGAGCTG GTTGCCGGAG CCAGCGCAGT GGTCCATTGC GCCGGGCAAG TGCGCGGGCA CAAAGAAGAG ATTTTCACCC GCTGTAACGT TGACGGCAGC CTGCGCCTGA TGCAAGCAGC AAAAGAGAGC TGCTTTTGCC AACGTTTTCT GTTTATCTCT TCGCTGGCGG CGCGCCATCC CGAGCTCTCC TGGTACGCAA ATTCCAAACA CATCGCCGAA CAACGGCTGA CTGCAATGGC TGACGAAATT ACGCTGGGCG TTTTTCGCCC GACAGCCGTG TATGGTCCCG GCGATAAAGA GTTAAAACCG CTGTTTGACT GGATGCTGCG CGGCCTGCTG CCACGACTTG GTGCACCAGA TACACAGCTC TCTTTCCTGC ACGTCACCGA TTTTGCGCAA GCAGTGGGTC AGTGGTTAAG CGCCGAAACT GTACAGACGC AAACCTATGA ATTATGCGAT GGCGTCGCTG GCGGCTATGA CTGGCAACGC ATACAGCAAC TTGCCGCCAA CGTCCGTTGC GGTTCCGTGC GAATGGTTGG TATTCCTCTG CCGGTACTCA CCTGCCTTGC GGATATCAGT ACTGCGTTGA GTCGCCTGGC GGGTAAAGAA CCTATGCTGA CCCGCTCGAA AATTCGTGAA TTAACCCACG CCGACTGGTC GGCAAGTAAT AACCGTATTT CTGAAGATAT TAATTGGTTT CCCGGGATTA GCCTGGAACA CGCATTACGC AACGGGCTAT TTTGA
|
Protein sequence | MNQTVAVTGA TGFIGKYIID NLLARGFHVR ALTRTARAHV NDNLTWVRGS LEDTHSLSEL VAGASAVVHC AGQVRGHKEE IFTRCNVDGS LRLMQAAKES CFCQRFLFIS SLAARHPELS WYANSKHIAE QRLTAMADEI TLGVFRPTAV YGPGDKELKP LFDWMLRGLL PRLGAPDTQL SFLHVTDFAQ AVGQWLSAET VQTQTYELCD GVAGGYDWQR IQQLAANVRC GSVRMVGIPL PVLTCLADIS TALSRLAGKE PMLTRSKIRE LTHADWSASN NRISEDINWF PGISLEHALR NGLF
|
| |