Gene ECH74115_0862 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_0862 
SymbolgalE 
ID6968825 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp878272 
End bp879288 
Gene Length1017 bp 
Protein Length338 aa 
Translation table11 
GC content54% 
IMG OID643384887 
ProductUDP-galactose-4-epimerase 
Protein accessionYP_002269387 
Protein GI209397428 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1087] UDP-glucose 4-epimerase 
TIGRFAM ID[TIGR01179] UDP-glucose-4-epimerase 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.0889309 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones56 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGAGTTC TGGTCACTGG TGGTAGCGGT TACATTGGAA GTCATACCTG TGTGCAATTA 
CTGCAAAACG GTCATGATGT CATCATTCTT GATAACCTCT GTAACAGTAA GCGCAGCGTA
CTGCCTGTTA TCGAGCGTTT AGGCGGCAAA CATCCGACGT TTGTTGAAGG CGATATCCGT
AACGAAGCGT TGATGACCGA GATCCTGCAC GATCACGCTA TCGACACCGT GATCCACTTC
GCCGGGCTGA AAGCCGTTGG CGAATCGGTA CAAAAACCGC TGGAATATTA CGACAACAAT
GTCAACGGTA CTCTGCGCCT GATTAGCGCC ATGCGCGCCG CTAACGTCAA AAACTTTATT
TTTAGCTCCT CCGCCACCGT TTATGGCGAT CAGCCCAAAA TTCCATACGT TGAAAGCTTC
CCGACCGGCA CACCGCAAAG CCCTTACGGC AAAAGCAAGC TGATGGTGGA ACAGATCCTC
ACCGATCTGC AAAAAGCCCA GCCGGACTGG AGCATTGCCC TGCTGCGCTA CTTCAACCCG
GTTGGCGCAC ATCCGTCGGG CGATATGGGC GAAGATCCGC AAGGCATTCC GAATAACCTT
ATGCCATACA TCGCCCAGGT TGCTGTAGGC CGTCGCGACT CGCTGGCGAT TTTTGGTAAC
GATTATCCGA CCGAAGACGG TACTGGCGTA CGCGATTACA TCCACGTAAT GGATCTGGCG
GACGGTCACG TCGTGGCGAT GGAAAAACTG GCGAACAAGC CAGGCGTACA CATCTACAAC
CTCGGTGCTG GCGTAGGCAG CAGCGTGCTG GACGTGGTTA ATGCCTTCAG CAGAGCCTGC
GGCAAACCGG TTAACTATCA TTTTGCACCG CGTCGCGAGG GCGACCTTCC GGCCTACTGG
GCGGACGCCA GCAAAGCCGA CCGTGAACTG AACTGGCGCG TAACGCGCAC ACTCGATGAA
ATGGCGCAGG ACACCTGGCA CTGGCAGTCA CGCCATCCAC AGGGATATCC CGATTAA
 
Protein sequence
MRVLVTGGSG YIGSHTCVQL LQNGHDVIIL DNLCNSKRSV LPVIERLGGK HPTFVEGDIR 
NEALMTEILH DHAIDTVIHF AGLKAVGESV QKPLEYYDNN VNGTLRLISA MRAANVKNFI
FSSSATVYGD QPKIPYVESF PTGTPQSPYG KSKLMVEQIL TDLQKAQPDW SIALLRYFNP
VGAHPSGDMG EDPQGIPNNL MPYIAQVAVG RRDSLAIFGN DYPTEDGTGV RDYIHVMDLA
DGHVVAMEKL ANKPGVHIYN LGAGVGSSVL DVVNAFSRAC GKPVNYHFAP RREGDLPAYW
ADASKADREL NWRVTRTLDE MAQDTWHWQS RHPQGYPD