Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcE24377A_0885 |
Symbol | |
ID | 5587604 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli E24377A |
Kingdom | Bacteria |
Replicon accession | NC_009801 |
Strand | - |
Start bp | 899047 |
End bp | 900630 |
Gene Length | 1584 bp |
Protein Length | 527 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 640924595 |
Product | sulfatase family protein |
Protein accession | YP_001462010 |
Protein GI | 157154892 |
COG category | [R] General function prediction only |
COG ID | [COG2194] Predicted membrane-associated, metal-dependent hydrolase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.0481411 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATTTAA CCCTCAAAGA ATCGCTTGTT ACCCGTAGCC GGGTATTTAG CCCGTGGACT GCGTTCTACT TTTTACAGTC GCTATTAATT AACCTCGGCT TAGGTTACCC CTTCAGTTTG CTCTACACCG CTGCGTTTAT GGCTATTTTG CTTTTGCTAT GGCGAACATT GCCTCGCGTA CAAAAAGTTC TGGTCGGTGT CAGTTCGCTG GTGGCGGCTT GTTATTTCCC TTTTGCTCAG GCCTACGGCG CGCCTAATTT CAATACATTG CTGGCATTGC ACTCCACCAA TATGGAAGAG TCGACCGAAA TCCTGACGAT TTTTCCGTGG TACAGCTACC TGGTCGGCTT ATTTATTTTT GCGCTCGGCG TAATAGCAAT CAGGCGAAAA AAAGAGAATG AAAAAGCGCG CTGGAATACC TTCGACAGCC TGTGTCTGGT ATTCAGTGTG GCGACATTTT TTGTTGCTCC CGTGCAAAAC CTGGCCTGGG GTGGCGTATT TAAACTGAAA GATACTGGCT ATCCGGTATT TCGTTTTGCT AAGAATGTCA TCGTCAATAA TAACGAGGTG ATTGAAGAGC AAGAACGGAT GGCAAAACTT TCCGGAATGA AAGATACCTG GACGGTCACT GCCGTTAAGC CGAAGTATCA GACCTATGTG GTGGTGATCG GTGAAAGCGC GCGTCGCGAT GCCCTCGGTG CCTTTGGCGG TCACTGGGAC AATACCCCGT TTGCCAGCAG CGTTAACGGT TTGATATTTG CTGACTACAT TGCCGCCAGT GGCTCCACGC AGAAATCGCT TGGCTTAACG CTCAATCGCG TAGTCGATGG CAAACCACAG TTTCAGGATA ACTTTGTCAC CCTGGCAAAT CGCGCGGGCT TCCAGACCTG GTGGTTTTCC AACCAGGGTC AAATCGGCGA ATACGATACC GCTATCGCCA GCATCGCCAA ACGAGCAGAT GAAGTGTACT TCCTGAAAGA AGGTAATTTT GAAGCAGATA AAAATACCAA AGACGAAGCG TTACTGGATA TGACCGCTCA AGTGCTGGCG CAAGAGCACT CGCAACCGCA GCTGATTGTT CTACATCTGA TGGGCTCACA TCCGCAGGCC TGCGACAGGA CACAAGGCAA ATACGAAACC TTTGTGCAGT CGAAAGAAAC GTCGTGCTAT CTCTATACCA TGACGCAAAC GGACGATTTA CTGCGCAAGC TGTACGATCA GTTACGCAAC AGCGGCAGCA GCTTCTCGCT GGTTTACTTT TCTGACCACG GTCTGGCCTT TAAAGAGCGC GGCAAAGACG TGCAATACCT TGCCCATGAT GATAAGTATC AGCAAAATTT CCAGGTGCCT TTTATGGTCA TTTCCAGCGA CGATAAAGCG CATCGGGTGA TTAAAGCCCG CCGCTCAGCC AATGACTTCT TAGGTTTTTT CTCCCAGTGG ACGGGAATTA AAGCGAAGGA AATTAACATC AAATACCCGT TTATATCTGA GAAGAAAGCC GGGTCGATAT ACATCACCAA CTTCCAGTTA CAGAAGGTGG ATTACAACCA TCTCGGAACC GATATTTTCG ACCCGAAGCC TTAA
|
Protein sequence | MNLTLKESLV TRSRVFSPWT AFYFLQSLLI NLGLGYPFSL LYTAAFMAIL LLLWRTLPRV QKVLVGVSSL VAACYFPFAQ AYGAPNFNTL LALHSTNMEE STEILTIFPW YSYLVGLFIF ALGVIAIRRK KENEKARWNT FDSLCLVFSV ATFFVAPVQN LAWGGVFKLK DTGYPVFRFA KNVIVNNNEV IEEQERMAKL SGMKDTWTVT AVKPKYQTYV VVIGESARRD ALGAFGGHWD NTPFASSVNG LIFADYIAAS GSTQKSLGLT LNRVVDGKPQ FQDNFVTLAN RAGFQTWWFS NQGQIGEYDT AIASIAKRAD EVYFLKEGNF EADKNTKDEA LLDMTAQVLA QEHSQPQLIV LHLMGSHPQA CDRTQGKYET FVQSKETSCY LYTMTQTDDL LRKLYDQLRN SGSSFSLVYF SDHGLAFKER GKDVQYLAHD DKYQQNFQVP FMVISSDDKA HRVIKARRSA NDFLGFFSQW TGIKAKEINI KYPFISEKKA GSIYITNFQL QKVDYNHLGT DIFDPKP
|
| |