Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_0862 |
Symbol | galE |
ID | 6968825 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 878272 |
End bp | 879288 |
Gene Length | 1017 bp |
Protein Length | 338 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 643384887 |
Product | UDP-galactose-4-epimerase |
Protein accession | YP_002269387 |
Protein GI | 209397428 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1087] UDP-glucose 4-epimerase |
TIGRFAM ID | [TIGR01179] UDP-glucose-4-epimerase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.0889309 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 56 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGAGTTC TGGTCACTGG TGGTAGCGGT TACATTGGAA GTCATACCTG TGTGCAATTA CTGCAAAACG GTCATGATGT CATCATTCTT GATAACCTCT GTAACAGTAA GCGCAGCGTA CTGCCTGTTA TCGAGCGTTT AGGCGGCAAA CATCCGACGT TTGTTGAAGG CGATATCCGT AACGAAGCGT TGATGACCGA GATCCTGCAC GATCACGCTA TCGACACCGT GATCCACTTC GCCGGGCTGA AAGCCGTTGG CGAATCGGTA CAAAAACCGC TGGAATATTA CGACAACAAT GTCAACGGTA CTCTGCGCCT GATTAGCGCC ATGCGCGCCG CTAACGTCAA AAACTTTATT TTTAGCTCCT CCGCCACCGT TTATGGCGAT CAGCCCAAAA TTCCATACGT TGAAAGCTTC CCGACCGGCA CACCGCAAAG CCCTTACGGC AAAAGCAAGC TGATGGTGGA ACAGATCCTC ACCGATCTGC AAAAAGCCCA GCCGGACTGG AGCATTGCCC TGCTGCGCTA CTTCAACCCG GTTGGCGCAC ATCCGTCGGG CGATATGGGC GAAGATCCGC AAGGCATTCC GAATAACCTT ATGCCATACA TCGCCCAGGT TGCTGTAGGC CGTCGCGACT CGCTGGCGAT TTTTGGTAAC GATTATCCGA CCGAAGACGG TACTGGCGTA CGCGATTACA TCCACGTAAT GGATCTGGCG GACGGTCACG TCGTGGCGAT GGAAAAACTG GCGAACAAGC CAGGCGTACA CATCTACAAC CTCGGTGCTG GCGTAGGCAG CAGCGTGCTG GACGTGGTTA ATGCCTTCAG CAGAGCCTGC GGCAAACCGG TTAACTATCA TTTTGCACCG CGTCGCGAGG GCGACCTTCC GGCCTACTGG GCGGACGCCA GCAAAGCCGA CCGTGAACTG AACTGGCGCG TAACGCGCAC ACTCGATGAA ATGGCGCAGG ACACCTGGCA CTGGCAGTCA CGCCATCCAC AGGGATATCC CGATTAA
|
Protein sequence | MRVLVTGGSG YIGSHTCVQL LQNGHDVIIL DNLCNSKRSV LPVIERLGGK HPTFVEGDIR NEALMTEILH DHAIDTVIHF AGLKAVGESV QKPLEYYDNN VNGTLRLISA MRAANVKNFI FSSSATVYGD QPKIPYVESF PTGTPQSPYG KSKLMVEQIL TDLQKAQPDW SIALLRYFNP VGAHPSGDMG EDPQGIPNNL MPYIAQVAVG RRDSLAIFGN DYPTEDGTGV RDYIHVMDLA DGHVVAMEKL ANKPGVHIYN LGAGVGSSVL DVVNAFSRAC GKPVNYHFAP RREGDLPAYW ADASKADREL NWRVTRTLDE MAQDTWHWQS RHPQGYPD
|
| |