Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_1028 |
Symbol | |
ID | 6967017 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 1040208 |
End bp | 1041257 |
Gene Length | 1050 bp |
Protein Length | 349 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 643385042 |
Product | NAD dependent epimerase/dehydratase family protein |
Protein accession | YP_002269542 |
Protein GI | 209396994 |
COG category | [G] Carbohydrate transport and metabolism [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0451] Nucleoside-diphosphate-sugar epimerases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 54 |
Fosmid unclonability p-value | 0.779011 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCACGT CATTTTCTCA ACAACAGTGG ATTGATATGA AGGTACTGGT TACCGGCGCC ACCAGCGGCT TAGGTCGAAA CGCGGTAGAG TTTTTATGCC AGAAAGGCAT CAGCGTGCGA GCGACCGGTC GCAACGAGGC AATGGGTAAA TTGCTGGAAA AAATGGGCGC AGAGTTTGTT CCGGCGGATC TGACCGAGCT GGTCTCATCA CAAGCTAAAG TGATGCTCGC GGGCATTGAT ACGCTGTGGC ACTGCTCCAG CTTTACCTCT CCCTGGGGGA CACAACAGGC TTTCGATCTG GCTAACGTTC GCGCCACTCG CCGCCTAGGT GAATGGGCTG TCGCCTGGGG TGTACGTAAC TTTATTCATA TCTCTTCTCC CTCCCTGTAC TTCGATTATC ACCACCATCG CGATATTAAA GAAGATTTTC GCCCTCACCG CTTCGCCAAC GAGTTTGCCC GCAGCAAAGC GGCCAGCGAA GAAGTGATCA ATATGCTTTC GCAGGCGAAT CCACAAACGC GCTTTACTAT TCTGCGCCCA CAAAGTCTGT TCGGACCGCA CGATAAAGTC TTTATCCCCC GTCTTGCGCA TATGATGCAC CACTACGGCA GCATTCTGTT ACCGCATGGC GGCAGTGCGC TGGTGGATAT GACCTACTAT GAAAATGCCG TGCACGCCAT GTGGCTGGCA AGCCAGGAAG CCTGCGATAA GCTACCTTCC GGGCGTGTGT ACAACATCAC CAACGGCGAG CATCGCACAC TGCGCAGCAT CGTGCAGAAG CTGATCGACG AGTTGAATAT TGACTGTCGT ATTCGTTCCG TCCCTTACCC GATGCTGGAT ATGATCGCCC GCAGCATGGA GCGTTTAGGC CGCAAGTCAG CAAAAGAGCC GCCGCTGACC CACTACGGCG TCTCGAAGCT TAATTTTGAC TTTACGCTGG ATATTACGCG GGCACAGGAA GAGTTAGGTT ATCAGCCGGT CATCACCCTG GATGAAGGTA TCGAGAAAAC TGCCGCCTGG CTGCGCGACC ACGGAAAACT GCCGCGCTAA
|
Protein sequence | MRTSFSQQQW IDMKVLVTGA TSGLGRNAVE FLCQKGISVR ATGRNEAMGK LLEKMGAEFV PADLTELVSS QAKVMLAGID TLWHCSSFTS PWGTQQAFDL ANVRATRRLG EWAVAWGVRN FIHISSPSLY FDYHHHRDIK EDFRPHRFAN EFARSKAASE EVINMLSQAN PQTRFTILRP QSLFGPHDKV FIPRLAHMMH HYGSILLPHG GSALVDMTYY ENAVHAMWLA SQEACDKLPS GRVYNITNGE HRTLRSIVQK LIDELNIDCR IRSVPYPMLD MIARSMERLG RKSAKEPPLT HYGVSKLNFD FTLDITRAQE ELGYQPVITL DEGIEKTAAW LRDHGKLPR
|
| |