Gene EcSMS35_2265 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2265 
Symbolglf 
ID6144907 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2286363 
End bp2287517 
Gene Length1155 bp 
Protein Length384 aa 
Translation table11 
GC content44% 
IMG OID641617140 
ProductUDP-galactopyranose mutase 
Protein accessionYP_001744313 
Protein GI170680099 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0562] UDP-galactopyranose mutase 
TIGRFAM ID[TIGR00031] UDP-galactopyranose mutase 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones52 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGCGTA ACAATATTCT CATCGTCGGC GCTGGTTTTT CAGGTGTGGT CATTGCCCGC 
CAGCTTGCTG AACAAGGTCA CAAGGTTAAA ATCATCGATC AGCGTGACCA CATTGGAGGA
AACTCTTATG ACACTCGCGA TCCTCAGACT GATGTCATGG TTCATGTCTA CGGTCCGCAT
ATTTTCCATA CGGATAATGA AACCGTCTGG AACTATGTCA ATCAGTATGC GGAAATGGTG
CCCTATGTGA ATAGAGTAAA AGCTACTGTT AATGGTCAGG TTTTTTCACT GCCGATTAAT
CTGCATACTA TTAATCAGTT CTTTGCTAAA ACCTGTTCTC CTGATGAAGC TCGCGCGCTC
ATTAGCGAAA AAGGTGATAG TTCCATTCTG GAACCGAAGA CTTTTGAAGA GCAGGCCTTG
CGCTTCATTG GTAAAGAATT ATACGAAGCG TTCTTCAAAG GCTACACCAT AAAACAGTGG
GGGATGCAGC CTTCTGAGCT TCCGGCATCT ATACTCAAAC GTTTGCCTGT GCGTTTCAAT
TATGACGATA ATTATTTCAA TCATAAATTC CAGGGCATGC CGAAGTTAGG CTATACCCAT
ATGATTGAAG CGATCGCCGA TCATGAAAAT ATCACTCTGC AATTACAGCG TGAGTTTGCG
GCTGAAAATC GCGAAAGTTA TGATCATGTA TTTTATAGCG GGCCGTTAGA CGCCTTCTAT
TCATACCAGC ATGGGCGTCT TGGCTACCGT ACTCTGGACT TTGAGCGATT TACCTGGCAA
GGTGATTACC AGGGCTGCGC GGTCATGAAC TATTGCTCAG TTGATGTCCC GTATACGCGC
ATTACCGAAC ATAAATATTT TTCCCCGTGG GAAAACCACG AAGGCTCCGT ATGTTACAAA
GAATATAGTC GCGCCTGCGG TGAAAATGAT ATTCCTTACT ATCCAATCCG CCAGATGGGC
GAGATGGCTC TGCTGGATAA ATATCTATCA CTGGCGGAAG GTGAGAAAAA TATTACTTTC
GTCGGACGCT TAGGGACTTA TCGCTATCTT GATATGGATG TAACGATTGC AGAAGCTTTA
AAAACGGCCG ATAAATACCT GTCTTCATTG TCCAACGACG AAACAATGCC TGTATTTGTG
GCCGATGTAC GATGA
 
Protein sequence
MKRNNILIVG AGFSGVVIAR QLAEQGHKVK IIDQRDHIGG NSYDTRDPQT DVMVHVYGPH 
IFHTDNETVW NYVNQYAEMV PYVNRVKATV NGQVFSLPIN LHTINQFFAK TCSPDEARAL
ISEKGDSSIL EPKTFEEQAL RFIGKELYEA FFKGYTIKQW GMQPSELPAS ILKRLPVRFN
YDDNYFNHKF QGMPKLGYTH MIEAIADHEN ITLQLQREFA AENRESYDHV FYSGPLDAFY
SYQHGRLGYR TLDFERFTWQ GDYQGCAVMN YCSVDVPYTR ITEHKYFSPW ENHEGSVCYK
EYSRACGEND IPYYPIRQMG EMALLDKYLS LAEGEKNITF VGRLGTYRYL DMDVTIAEAL
KTADKYLSSL SNDETMPVFV ADVR