Gene EcE24377A_0885 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcE24377A_0885 
Symbol 
ID5587604 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli E24377A 
KingdomBacteria 
Replicon accessionNC_009801 
Strand
Start bp899047 
End bp900630 
Gene Length1584 bp 
Protein Length527 aa 
Translation table11 
GC content48% 
IMG OID640924595 
Productsulfatase family protein 
Protein accessionYP_001462010 
Protein GI157154892 
COG category[R] General function prediction only 
COG ID[COG2194] Predicted membrane-associated, metal-dependent hydrolase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.0481411 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATTTAA CCCTCAAAGA ATCGCTTGTT ACCCGTAGCC GGGTATTTAG CCCGTGGACT 
GCGTTCTACT TTTTACAGTC GCTATTAATT AACCTCGGCT TAGGTTACCC CTTCAGTTTG
CTCTACACCG CTGCGTTTAT GGCTATTTTG CTTTTGCTAT GGCGAACATT GCCTCGCGTA
CAAAAAGTTC TGGTCGGTGT CAGTTCGCTG GTGGCGGCTT GTTATTTCCC TTTTGCTCAG
GCCTACGGCG CGCCTAATTT CAATACATTG CTGGCATTGC ACTCCACCAA TATGGAAGAG
TCGACCGAAA TCCTGACGAT TTTTCCGTGG TACAGCTACC TGGTCGGCTT ATTTATTTTT
GCGCTCGGCG TAATAGCAAT CAGGCGAAAA AAAGAGAATG AAAAAGCGCG CTGGAATACC
TTCGACAGCC TGTGTCTGGT ATTCAGTGTG GCGACATTTT TTGTTGCTCC CGTGCAAAAC
CTGGCCTGGG GTGGCGTATT TAAACTGAAA GATACTGGCT ATCCGGTATT TCGTTTTGCT
AAGAATGTCA TCGTCAATAA TAACGAGGTG ATTGAAGAGC AAGAACGGAT GGCAAAACTT
TCCGGAATGA AAGATACCTG GACGGTCACT GCCGTTAAGC CGAAGTATCA GACCTATGTG
GTGGTGATCG GTGAAAGCGC GCGTCGCGAT GCCCTCGGTG CCTTTGGCGG TCACTGGGAC
AATACCCCGT TTGCCAGCAG CGTTAACGGT TTGATATTTG CTGACTACAT TGCCGCCAGT
GGCTCCACGC AGAAATCGCT TGGCTTAACG CTCAATCGCG TAGTCGATGG CAAACCACAG
TTTCAGGATA ACTTTGTCAC CCTGGCAAAT CGCGCGGGCT TCCAGACCTG GTGGTTTTCC
AACCAGGGTC AAATCGGCGA ATACGATACC GCTATCGCCA GCATCGCCAA ACGAGCAGAT
GAAGTGTACT TCCTGAAAGA AGGTAATTTT GAAGCAGATA AAAATACCAA AGACGAAGCG
TTACTGGATA TGACCGCTCA AGTGCTGGCG CAAGAGCACT CGCAACCGCA GCTGATTGTT
CTACATCTGA TGGGCTCACA TCCGCAGGCC TGCGACAGGA CACAAGGCAA ATACGAAACC
TTTGTGCAGT CGAAAGAAAC GTCGTGCTAT CTCTATACCA TGACGCAAAC GGACGATTTA
CTGCGCAAGC TGTACGATCA GTTACGCAAC AGCGGCAGCA GCTTCTCGCT GGTTTACTTT
TCTGACCACG GTCTGGCCTT TAAAGAGCGC GGCAAAGACG TGCAATACCT TGCCCATGAT
GATAAGTATC AGCAAAATTT CCAGGTGCCT TTTATGGTCA TTTCCAGCGA CGATAAAGCG
CATCGGGTGA TTAAAGCCCG CCGCTCAGCC AATGACTTCT TAGGTTTTTT CTCCCAGTGG
ACGGGAATTA AAGCGAAGGA AATTAACATC AAATACCCGT TTATATCTGA GAAGAAAGCC
GGGTCGATAT ACATCACCAA CTTCCAGTTA CAGAAGGTGG ATTACAACCA TCTCGGAACC
GATATTTTCG ACCCGAAGCC TTAA
 
Protein sequence
MNLTLKESLV TRSRVFSPWT AFYFLQSLLI NLGLGYPFSL LYTAAFMAIL LLLWRTLPRV 
QKVLVGVSSL VAACYFPFAQ AYGAPNFNTL LALHSTNMEE STEILTIFPW YSYLVGLFIF
ALGVIAIRRK KENEKARWNT FDSLCLVFSV ATFFVAPVQN LAWGGVFKLK DTGYPVFRFA
KNVIVNNNEV IEEQERMAKL SGMKDTWTVT AVKPKYQTYV VVIGESARRD ALGAFGGHWD
NTPFASSVNG LIFADYIAAS GSTQKSLGLT LNRVVDGKPQ FQDNFVTLAN RAGFQTWWFS
NQGQIGEYDT AIASIAKRAD EVYFLKEGNF EADKNTKDEA LLDMTAQVLA QEHSQPQLIV
LHLMGSHPQA CDRTQGKYET FVQSKETSCY LYTMTQTDDL LRKLYDQLRN SGSSFSLVYF
SDHGLAFKER GKDVQYLAHD DKYQQNFQVP FMVISSDDKA HRVIKARRSA NDFLGFFSQW
TGIKAKEINI KYPFISEKKA GSIYITNFQL QKVDYNHLGT DIFDPKP