Gene EcSMS35_0782 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_0782 
SymbolgalE 
ID6144838 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp783049 
End bp784065 
Gene Length1017 bp 
Protein Length338 aa 
Translation table11 
GC content54% 
IMG OID641615670 
ProductUDP-galactose-4-epimerase 
Protein accessionYP_001742862 
Protein GI170683375 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1087] UDP-glucose 4-epimerase 
TIGRFAM ID[TIGR01179] UDP-glucose-4-epimerase 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones59 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGAGTTC TGGTTACCGG TGGTAGCGGT TACATTGGAA GTCATACCTG TGTGCAATTA 
CTGCAAAACG GTCATGATGT CATCATTCTT GATAACCTCT GTAACAGTAA GCGCAGCGTA
CTGCCTGTTA TCGAGCGTTT AGGCGGCAAA CATCCAACGT TTGTTGAAGG CGATATCCGT
AACGAAGCGT TGATGACCGA GATCCTGCAC GATCACGCTA TCGACACCGT GATCCACTTC
GCCGGGCTGA AAGCCGTTGG CGAATCGGTA CAAAAACCGC TTGAATATTA CGACAACAAT
GTCAACGGTA CTCTGCGCCT GATTAGCGCC ATGCGCGCCG CTAACGTCAA AAACTTTATC
TTTAGCTCCT CCGCCACCGT TTATGGCGAT CAGCCCAAAA TTCCATATGT TGAAAGCTTC
CCGACCGGCA CACCGCAAAG CCCTTACGGC AAAAGCAAGC TGATGGTGGA ACAGATCCTC
ACCGATCTGC AAAAAGCCCA GCCGGACTGG AGCATTGCCC TGCTGCGCTA CTTCAACCCA
GTTGGCGCGC ATCCGTCGGG CGATATGGGC GAAGATCCGC AAGGCATTCC GAATAACCTG
ATGCCATACA TCGCCCAGGT TGCTGTAGGC CGTCGCGACT CGCTGGCGAT TTTTGGTAAC
GATTATCCGA CCGAAGACGG TACTGGCGTG CGCGATTACA TCCACGTAAT GGATCTGGCG
GACGGTCACG TCGTAGCGAT GGAAAAACTG GCGAACAAGC CAGGCGTACA CATCTACAAC
CTCGGCGCTG GCGTGGGCAG CAGCGTGCTG GACGTGGTTA ATGCCTTCAG CAAAGCCTGC
GGCAAACCGG TTAATTATCA TTTTGCACCG CGTCGCGAGG GCGACCTTCC GGCCTACTGG
GCGGACGCCA GCAAAGCCGA CCGTGAACTG AACTGGCGCG TAACGCGCAC ACTCGATGAA
ATGGCGCAGG ACACCTGGCA CTGGCAGTCA CGCCATCCAC AGGGATATCC CGATTAA
 
Protein sequence
MRVLVTGGSG YIGSHTCVQL LQNGHDVIIL DNLCNSKRSV LPVIERLGGK HPTFVEGDIR 
NEALMTEILH DHAIDTVIHF AGLKAVGESV QKPLEYYDNN VNGTLRLISA MRAANVKNFI
FSSSATVYGD QPKIPYVESF PTGTPQSPYG KSKLMVEQIL TDLQKAQPDW SIALLRYFNP
VGAHPSGDMG EDPQGIPNNL MPYIAQVAVG RRDSLAIFGN DYPTEDGTGV RDYIHVMDLA
DGHVVAMEKL ANKPGVHIYN LGAGVGSSVL DVVNAFSKAC GKPVNYHFAP RREGDLPAYW
ADASKADREL NWRVTRTLDE MAQDTWHWQS RHPQGYPD