Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_0964 |
Symbol | |
ID | 6972162 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 978683 |
End bp | 980266 |
Gene Length | 1584 bp |
Protein Length | 527 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 643384985 |
Product | sulfatase family protein |
Protein accession | YP_002269485 |
Protein GI | 209398680 |
COG category | [R] General function prediction only |
COG ID | [COG2194] Predicted membrane-associated, metal-dependent hydrolase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.00476272 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 65 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATTTAA CCCTCAAAGA ATCGCTTGTT ACCCGTAGCC GGGTATTTAG CCCGTGGACT GCGTTCTACT TTTTACAGTC GCTATTAATT AACCTCGGCT TAGGTTACCC CTTCAGTTTG CTCTACACCG CTGCGTTTAC GGCTATTTTG CTTTTGCTAT GGCGAACATT GCCTCGCGTA CAAAAAGTTC TGGTCGGTGT CAGTTCGCTG GTGGCGGCTT GTTATTTCCC TTTTGCTCAG GCCTACGGCG CGCCTAATTT CAATACATTG CTGGCATTGC ACTCCACCAA TATGGAAGAG TCGACCGAAA TCCTGACGAT TTTTCCGTGG TACAGCTACC TGGTCGGCTT ATTTATTTTT GCGCTCGGCG TAATAGCAAT CAGGCGAAAA AAAGAGAATG AAAAAGCGCG CTGGAATACC TTCGACAGCC TGTGTCTGGT ATTCAGTGTG GCGACATTTT TTGTTGCTCC CGTGCAAAAC CTGGCCTGGG GTGGCGTATT TAAACTGAAA GATACTGGCT ATCCGGTATT TCGTTTTGCT AAGGATGTCA TCGTCAATAA TAACGAGGTG ATTGAAGAGC AAGAACGGAT GGCAAAACTT TCCGGAATGA AAGATACCTG GACGGTCACT GCCGTTAAGC CGAAGTATCA GACCTATGTG GTGGTGATCG GTGAAAGCGC GCGTCGCGAT GCCCTCGGTG CCTTTGGCGG TCACTGGGAC AATACCCCGT TTGCCAGCAG CGTTAACGGT TTGATATTTG CTGACTACAT TGCCGCCAGT GGCTCCACGC AGAAATCGCT TGGCTTAACG CTCAATCGCG TAGTCGATGG CAAACCACAG TTTCAGGATA ACTTTGTCAC CCTGGCAAAT CGCGCGGGCT TCCAGACCTG GTGGTTTTCC AACCAGGGCC AAATCGGCGA ATACGATACT GCTATCGCCA GCATCGCCAA ACGTGCAGAT GAAGTGTACT TCCTGAAAGA AGGTAATTTT GAAGCAGATA AAAACACGAA AGACGAAGCG TTACTGGATA TGACCGCTCA AGTGCTGGCG CAAGAGCACT CGCAACCGCA GCTGATTGTT CTGCATCTGA TGGGCTCGCA TCCGCAGGCC TGCGACCGGA CAAAAGGCAA ATACGAAACC TTTGTGCAAT CGAAAGAAAC GTCGTGCTAT CTCTATACTA TGACGCAAAC GGACGATTTA CTGCGCAAGC TGTACGATCA GTTACGCAAC AGCGGCAGCA GCTTCTCGCT GGTTTACTTT TCTGACCACG GTCTGGCCTT TAAAGAGCGC GGCAAAGACG TGCAATACCT TGCCCATGAT GATAAGTATC AGCAAAATTT CCAGGTGCCT TTTATGGTCA TTTCCAGCGA CGATAAAGCG CATCGGGTGA TTAAAGCCCG CCGCTCAGCC AATGACTTCT TAGGCTTTTT CTCGCAGTGG ACGGGAATTA AAGCGAAGGA AATTAACATC AAATACCCGT TTATATCTGA GAAGAAAGCC GGGCCGATAT ACATCACCAA CTTCCAGTTA CAGAAGGTGG ATTACAACCA TCTCGGAACC GATATTTTCG ACCCGAAGCC TTAA
|
Protein sequence | MNLTLKESLV TRSRVFSPWT AFYFLQSLLI NLGLGYPFSL LYTAAFTAIL LLLWRTLPRV QKVLVGVSSL VAACYFPFAQ AYGAPNFNTL LALHSTNMEE STEILTIFPW YSYLVGLFIF ALGVIAIRRK KENEKARWNT FDSLCLVFSV ATFFVAPVQN LAWGGVFKLK DTGYPVFRFA KDVIVNNNEV IEEQERMAKL SGMKDTWTVT AVKPKYQTYV VVIGESARRD ALGAFGGHWD NTPFASSVNG LIFADYIAAS GSTQKSLGLT LNRVVDGKPQ FQDNFVTLAN RAGFQTWWFS NQGQIGEYDT AIASIAKRAD EVYFLKEGNF EADKNTKDEA LLDMTAQVLA QEHSQPQLIV LHLMGSHPQA CDRTKGKYET FVQSKETSCY LYTMTQTDDL LRKLYDQLRN SGSSFSLVYF SDHGLAFKER GKDVQYLAHD DKYQQNFQVP FMVISSDDKA HRVIKARRSA NDFLGFFSQW TGIKAKEINI KYPFISEKKA GPIYITNFQL QKVDYNHLGT DIFDPKP
|
| |