Gene ECH74115_5814 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_5814 
Symbol 
ID6968185 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp5466215 
End bp5467321 
Gene Length1107 bp 
Protein Length368 aa 
Translation table11 
GC content44% 
IMG OID643389441 
ProductN-acetylneuraminic acid mutarotase 
Protein accessionYP_002273833 
Protein GI209397986 
COG category[S] Function unknown 
COG ID[COG3055] Uncharacterized protein conserved in bacteria 
TIGRFAM ID[TIGR03547] mutatrotase, YjhT family 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones48 
Fosmid unclonability p-value0.376687 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATAAAA CAATAACGGC GCTTGCTATC CTAATGGCTT CATTTGCCGC AAACGCGTCT 
GTATTACCGG AAACTCCTGT ACCATTTAAA AGTGGTACCG GAGCAATTGA TAACGACACT
GTCTACATTG GTTTAGGTAG CGCAGGTACG GCATGGTACA AGCTGGAGAC ACAGGCCAAA
GATAAAAAAT GGACAGCGTT AGCTGCATTC CCTGGTGGAC CAAGAGATCA AGCAACCTCG
GCATTTATTG ATGGCAATCT GTATGTGTTT GGCGGCATTG GCAAAAACAG CGAGGGCTTG
ACTCAGGTAT TTAATGACGT ACACAAATAC AACCCCAAAA CCAATAGCTG GGTTAAATTG
ATATCGCACG CGCCGATGGG CATGGCGGGC CATGTGACTT TTGTACACAA CGGCAAGGCT
TATGTTACTG GCGGTGTTAA CCAGAATATC TTCAATGGCT ATTTTGAAGA TCTCAACGAA
GCTGGAAAAG ATTCAACCGC TGTAGATAAA ATCAATGCAC ACTATTTTGA CAAAAAAGCA
GAAGATTATT TCTTTAATAA GTTTCTGTTG TCTTTTGATC CCTCTACACA GCAATGGAGT
TACGCTGGCG AATCTCCCTG GTACGGGACG GCTGGTGCGG CGGTTGTGAA TAAAGGTGAT
AAAACCTGGC TTATTAATGG CGAAGCCAAA CCCGGATTGC GAACGGATGC CGTATTTGAA
CTTGATTTCA CCGGTAATAA TTTAAAATGG AATAGGCTTG CTCCCGTCTC ATCACCAGAT
GGCGTCGCTG GCGGTTTTGC GGGGATAAGC AATGATTCTC TTATATTTGC CGGAGGGGCC
GGATTCAAAG GTTCACGAGA AAATTACCAA AACGGTAAGA ACTATGCGCA TGAAGGCCTG
AAAAAATCAT ATAGCACTGA TATTCATCTT TGGCATAACG GGAAATGGGA TAAATCGGGT
GAATTATCGC AAGGTCGGGC CTACGGAGTA TCATTGCCCT GGAATAATAG TCTATTGATT
ATTGGCGGTG AAACTGCAGG CGGCAAAGCG GTGACGGATT CAGTTTTGAT CTCTGTGAAG
GATAATAAAG TCACAGTACA AAACTAA
 
Protein sequence
MNKTITALAI LMASFAANAS VLPETPVPFK SGTGAIDNDT VYIGLGSAGT AWYKLETQAK 
DKKWTALAAF PGGPRDQATS AFIDGNLYVF GGIGKNSEGL TQVFNDVHKY NPKTNSWVKL
ISHAPMGMAG HVTFVHNGKA YVTGGVNQNI FNGYFEDLNE AGKDSTAVDK INAHYFDKKA
EDYFFNKFLL SFDPSTQQWS YAGESPWYGT AGAAVVNKGD KTWLINGEAK PGLRTDAVFE
LDFTGNNLKW NRLAPVSSPD GVAGGFAGIS NDSLIFAGGA GFKGSRENYQ NGKNYAHEGL
KKSYSTDIHL WHNGKWDKSG ELSQGRAYGV SLPWNNSLLI IGGETAGGKA VTDSVLISVK
DNKVTVQN