Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_0782 |
Symbol | galE |
ID | 6144838 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 783049 |
End bp | 784065 |
Gene Length | 1017 bp |
Protein Length | 338 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641615670 |
Product | UDP-galactose-4-epimerase |
Protein accession | YP_001742862 |
Protein GI | 170683375 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1087] UDP-glucose 4-epimerase |
TIGRFAM ID | [TIGR01179] UDP-glucose-4-epimerase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 59 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGAGTTC TGGTTACCGG TGGTAGCGGT TACATTGGAA GTCATACCTG TGTGCAATTA CTGCAAAACG GTCATGATGT CATCATTCTT GATAACCTCT GTAACAGTAA GCGCAGCGTA CTGCCTGTTA TCGAGCGTTT AGGCGGCAAA CATCCAACGT TTGTTGAAGG CGATATCCGT AACGAAGCGT TGATGACCGA GATCCTGCAC GATCACGCTA TCGACACCGT GATCCACTTC GCCGGGCTGA AAGCCGTTGG CGAATCGGTA CAAAAACCGC TTGAATATTA CGACAACAAT GTCAACGGTA CTCTGCGCCT GATTAGCGCC ATGCGCGCCG CTAACGTCAA AAACTTTATC TTTAGCTCCT CCGCCACCGT TTATGGCGAT CAGCCCAAAA TTCCATATGT TGAAAGCTTC CCGACCGGCA CACCGCAAAG CCCTTACGGC AAAAGCAAGC TGATGGTGGA ACAGATCCTC ACCGATCTGC AAAAAGCCCA GCCGGACTGG AGCATTGCCC TGCTGCGCTA CTTCAACCCA GTTGGCGCGC ATCCGTCGGG CGATATGGGC GAAGATCCGC AAGGCATTCC GAATAACCTG ATGCCATACA TCGCCCAGGT TGCTGTAGGC CGTCGCGACT CGCTGGCGAT TTTTGGTAAC GATTATCCGA CCGAAGACGG TACTGGCGTG CGCGATTACA TCCACGTAAT GGATCTGGCG GACGGTCACG TCGTAGCGAT GGAAAAACTG GCGAACAAGC CAGGCGTACA CATCTACAAC CTCGGCGCTG GCGTGGGCAG CAGCGTGCTG GACGTGGTTA ATGCCTTCAG CAAAGCCTGC GGCAAACCGG TTAATTATCA TTTTGCACCG CGTCGCGAGG GCGACCTTCC GGCCTACTGG GCGGACGCCA GCAAAGCCGA CCGTGAACTG AACTGGCGCG TAACGCGCAC ACTCGATGAA ATGGCGCAGG ACACCTGGCA CTGGCAGTCA CGCCATCCAC AGGGATATCC CGATTAA
|
Protein sequence | MRVLVTGGSG YIGSHTCVQL LQNGHDVIIL DNLCNSKRSV LPVIERLGGK HPTFVEGDIR NEALMTEILH DHAIDTVIHF AGLKAVGESV QKPLEYYDNN VNGTLRLISA MRAANVKNFI FSSSATVYGD QPKIPYVESF PTGTPQSPYG KSKLMVEQIL TDLQKAQPDW SIALLRYFNP VGAHPSGDMG EDPQGIPNNL MPYIAQVAVG RRDSLAIFGN DYPTEDGTGV RDYIHVMDLA DGHVVAMEKL ANKPGVHIYN LGAGVGSSVL DVVNAFSKAC GKPVNYHFAP RREGDLPAYW ADASKADREL NWRVTRTLDE MAQDTWHWQS RHPQGYPD
|
| |