Gene EcolC_2903 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_2903 
Symbol 
ID6065378 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp3163512 
End bp3164528 
Gene Length1017 bp 
Protein Length338 aa 
Translation table11 
GC content54% 
IMG OID641602308 
ProductUDP-galactose-4-epimerase 
Protein accessionYP_001725857 
Protein GI170020903 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1087] UDP-glucose 4-epimerase 
TIGRFAM ID[TIGR01179] UDP-glucose-4-epimerase 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.348521 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGAGTTC TGGTTACCGG TGGTAGCGGT TACATTGGAA GTCATACCTG TGTGCAATTA 
CTGCAAAACG GTCATGATGT CATCATTCTT GATAACCTCT GTAACAGTAA GCGCAGCGTA
CTGCCTGTTA TCGAGCGTTT AGGCGGCAAA CATCCAACGT TTGTTGAAGG CGATATTCGT
AACGAAGCGT TGATGACCGA GATCCTGCAC GATCACGCTA TCGACACCGT GATCCACTTC
GCCGGGCTGA AAGCCGTGGG CGAATCGGTA CAAAAACCGC TGGAATATTA CGACAACAAT
GTCAACGGCA CTCTGCGCCT GATTAGCGCC ATGCGCGCCG CTAACGTCAA AAACTTTATT
TTTAGCTCCT CCGCCACCGT TTATGGCGAT CAGCCCAAAA TTCCATACGT TGAAAGCTTC
CCGACCGGCA CACCGCAAAG CCCTTACGGC AAAAGCAAGC TGATGGTGGA ACAGATCCTC
ACCGATCTGC AAAAAGCCCA GCCGGACTGG AGCATTGCCC TGCTGCGCTA CTTCAACCCG
GTTGGCGCGC ATCCGTCGGG CGATATGGGC GAAGATCCGC AAGGCATTCC GAATAACCTG
ATGCCATACA TCGCCCAGGT TGCTGTAGGC CGTCGCGACT CGCTGGCGAT TTTTGGTAAC
GATTATCCGA CCGAAGATGG TACTGGCGTA CGCGATTACA TCCACGTAAT GGATCTGGCG
GACGGTCACG TCGTGGCGAT GGAAAAACTG GCGAACAAGC CAGGCGTACA CATCTACAAC
CTCGGCGCTG GCGTAGGCAA CAGCGTGCTG GACGTGGTTA ATGCCTTCAG CAAAGCCTGC
GGCAAACCGG TTAATTATCA TTTTGCACCG CGTCGCGAGG GCGACCTTCC GGCCTACTGG
GCGGACGCCA GCAAAGCCGA CCGTGAACTG AACTGGCGCG TAACGCGCAC ACTCGATGAA
ATGGCGCAGG ACACCTGGCA CTGGCAGTCA CGCCATCCAC AGGGATATCC CGATTAA
 
Protein sequence
MRVLVTGGSG YIGSHTCVQL LQNGHDVIIL DNLCNSKRSV LPVIERLGGK HPTFVEGDIR 
NEALMTEILH DHAIDTVIHF AGLKAVGESV QKPLEYYDNN VNGTLRLISA MRAANVKNFI
FSSSATVYGD QPKIPYVESF PTGTPQSPYG KSKLMVEQIL TDLQKAQPDW SIALLRYFNP
VGAHPSGDMG EDPQGIPNNL MPYIAQVAVG RRDSLAIFGN DYPTEDGTGV RDYIHVMDLA
DGHVVAMEKL ANKPGVHIYN LGAGVGNSVL DVVNAFSKAC GKPVNYHFAP RREGDLPAYW
ADASKADREL NWRVTRTLDE MAQDTWHWQS RHPQGYPD