Gene ECH74115_2733 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_2733 
Symbol 
ID6972312 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp2557668 
End bp2558483 
Gene Length816 bp 
Protein Length271 aa 
Translation table11 
GC content52% 
IMG OID643386592 
Productmannosyl-3-phosphoglycerate phosphatase 
Protein accessionYP_002271071 
Protein GI209397957 
COG category[R] General function prediction only 
COG ID[COG3769] Predicted hydrolase (HAD superfamily) 
TIGRFAM ID[TIGR01484] HAD-superfamily hydrolase, subfamily IIB
[TIGR01486] mannosyl-3-phosphoglycerate phosphatase family
[TIGR02463] mannosyl-3-phosphoglycerate phosphatase-related protein 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000138935 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones52 
Fosmid unclonability p-value0.573407 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTTCAA TTCAACAACC ACTACTGGTT TTTAGCGATC TTGATGGCAC CCTGCTGGAC 
AGTCATAGTT ATGACTGGCA ACCGGCAGCC CCCTGGCTCA GCCGTTTACG CGAAGCAAAT
GTTCCCGTCA TTCTCTGTAG CAGTAAAACA TCAGCGGAAA TGCTGTACTT GCAAAAAATG
TTGGGGCTAC AAGGTTTACC GCTGATTGCA GAGAATGGCG CAGTGATCCA GCTTGCTGAA
CAATGGCAGG AGATAGACGG TTTTCCACGC ATCATCTCAG GTATTAGCCA TGGCGAAATC
AGCCAGGTTT TAAATACGCT ACGCGAGAAA GAACATTTTA AATTCACGAC TTTTGATGAT
GTCGACGATG CAACCATCGC CGAATGGACG GGATTAAGCC GTAGCCAGGC GGCGCTGACG
CAGCTTCATG AGGCGTCGGT AACGCTAATC TGGCGCGACA GTGACGAGCG TATGGCACAA
TTTACCGCTC GTCTGAACGA ACTGGGCTTA CAGTTTATGC AAGGTGCGCG CTTCTGGCAC
GTCCTGGATG CCTCTGCCGG AAAAGATCAG GCTGCCAACT GGATTATCGC GACCTATCAA
CAATTGTCAG GCAAACGCCC AACCACACTT GGCCTGGGCG ATGGGCCAAA CGATGCGCCC
TTACTGGAGG TAATGGATTA CGCGGTGATT GTGAAAGGGC TAAACCGTGA AGGGGTGCAT
CTGCATGATG AGGATCCGGC CCGCGTCTGG CGAACGCAGC GTGAAGGACC GGAAGGATGG
CGTGAAGGGC TGGACCATTT TTTCTCCGCC CGTTAA
 
Protein sequence
MFSIQQPLLV FSDLDGTLLD SHSYDWQPAA PWLSRLREAN VPVILCSSKT SAEMLYLQKM 
LGLQGLPLIA ENGAVIQLAE QWQEIDGFPR IISGISHGEI SQVLNTLREK EHFKFTTFDD
VDDATIAEWT GLSRSQAALT QLHEASVTLI WRDSDERMAQ FTARLNELGL QFMQGARFWH
VLDASAGKDQ AANWIIATYQ QLSGKRPTTL GLGDGPNDAP LLEVMDYAVI VKGLNREGVH
LHDEDPARVW RTQREGPEGW REGLDHFFSA R