Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_2733 |
Symbol | |
ID | 6972312 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 2557668 |
End bp | 2558483 |
Gene Length | 816 bp |
Protein Length | 271 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 643386592 |
Product | mannosyl-3-phosphoglycerate phosphatase |
Protein accession | YP_002271071 |
Protein GI | 209397957 |
COG category | [R] General function prediction only |
COG ID | [COG3769] Predicted hydrolase (HAD superfamily) |
TIGRFAM ID | [TIGR01484] HAD-superfamily hydrolase, subfamily IIB [TIGR01486] mannosyl-3-phosphoglycerate phosphatase family [TIGR02463] mannosyl-3-phosphoglycerate phosphatase-related protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.00000138935 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 52 |
Fosmid unclonability p-value | 0.573407 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTTTCAA TTCAACAACC ACTACTGGTT TTTAGCGATC TTGATGGCAC CCTGCTGGAC AGTCATAGTT ATGACTGGCA ACCGGCAGCC CCCTGGCTCA GCCGTTTACG CGAAGCAAAT GTTCCCGTCA TTCTCTGTAG CAGTAAAACA TCAGCGGAAA TGCTGTACTT GCAAAAAATG TTGGGGCTAC AAGGTTTACC GCTGATTGCA GAGAATGGCG CAGTGATCCA GCTTGCTGAA CAATGGCAGG AGATAGACGG TTTTCCACGC ATCATCTCAG GTATTAGCCA TGGCGAAATC AGCCAGGTTT TAAATACGCT ACGCGAGAAA GAACATTTTA AATTCACGAC TTTTGATGAT GTCGACGATG CAACCATCGC CGAATGGACG GGATTAAGCC GTAGCCAGGC GGCGCTGACG CAGCTTCATG AGGCGTCGGT AACGCTAATC TGGCGCGACA GTGACGAGCG TATGGCACAA TTTACCGCTC GTCTGAACGA ACTGGGCTTA CAGTTTATGC AAGGTGCGCG CTTCTGGCAC GTCCTGGATG CCTCTGCCGG AAAAGATCAG GCTGCCAACT GGATTATCGC GACCTATCAA CAATTGTCAG GCAAACGCCC AACCACACTT GGCCTGGGCG ATGGGCCAAA CGATGCGCCC TTACTGGAGG TAATGGATTA CGCGGTGATT GTGAAAGGGC TAAACCGTGA AGGGGTGCAT CTGCATGATG AGGATCCGGC CCGCGTCTGG CGAACGCAGC GTGAAGGACC GGAAGGATGG CGTGAAGGGC TGGACCATTT TTTCTCCGCC CGTTAA
|
Protein sequence | MFSIQQPLLV FSDLDGTLLD SHSYDWQPAA PWLSRLREAN VPVILCSSKT SAEMLYLQKM LGLQGLPLIA ENGAVIQLAE QWQEIDGFPR IISGISHGEI SQVLNTLREK EHFKFTTFDD VDDATIAEWT GLSRSQAALT QLHEASVTLI WRDSDERMAQ FTARLNELGL QFMQGARFWH VLDASAGKDQ AANWIIATYQ QLSGKRPTTL GLGDGPNDAP LLEVMDYAVI VKGLNREGVH LHDEDPARVW RTQREGPEGW REGLDHFFSA R
|
| |