Gene EcSMS35_1884 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_1884 
Symbol 
ID6147190 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp1904356 
End bp1906149 
Gene Length1794 bp 
Protein Length597 aa 
Translation table11 
GC content32% 
IMG OID641616759 
Productsulfatase family protein 
Protein accessionYP_001743937 
Protein GI170683822 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1368] Phosphoglycerol transferase and related proteins, alkaline phosphatase superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value0.00298505 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGTATCG GTTTGAGCAA AGTCGTTACA AGATATAAAA ACTTAAAGTT GAAATCATAT 
GTTTTCGACA TTGTCAACGT TATGTTTTTA TTGTACTTAT TTTGTTTTCT AAATCAGTCA
ATAGTTATCA TAGGAAATAT TGAACCGCCT AAAGAGTTAA TTGGTGCAAT ATTGAATTTC
ATCTGGCGTT CATCCCCCAT TTTGTTTATT TATTTGTTAT TTAGAATAAT AAACATTCCA
ATTTTTCTAT GTTCTTATTT ATCAGGCATC GCAATTATAA CATTGATGTT TGTGAACAGT
ACTAAAATGG CGTTAACCGG TGAGCCTTTA TCATTTAATG ACATTGTTTC TGGATTCAAT
TTGAGTGTCG CAGGAAAATA CCTCAACTTA TCATCCATTA CTTACTCTGT CGCTATTCTA
ATTGGCGCAA TAATCTCAAT CCTACTTAGC AAAAAAACGG TCACAACAAG AAAAAATTAT
ATATCGTTAT TTCTTTTAAT TATCATTACG ATCCCCTTTT CATTCTACCC CTATATTGGA
ACCATTTTTA GCCCTAACAA TAGGATTTCT GAAATCATAA ACTCCAGCGC GAACAACTTA
AATATAAAAT ATATTTACTG GAATTGGTCT CAAAATATTG GCGTTAATGG CTTGCCAATG
CATATAATAC AAACAAGTGT CAGAAAATCT GTACCACATG CAACTGATGC AGAACAATTA
AGATATTTAG ATGAAAAAAG ATCTTTAGTA TCCACTCATA TAAAAAATAA AACCATTATT
TATGTTTTAT GTGAATCATG TTGGTATGAT AAAAAAAACT TCCGAGAAAA TTTCAAACCT
GTTATTGATG AAGGTTACAG TGAATTACGG GCCATCTCAC CAGTTTACGG TGGAGGTACC
GCAAATGCCG AATTTGAAAT GCTGACAGGA CTACCAAGTA ATTCAGATAA AATATCAGGA
ATCATCTATC AAGAATATTC AGATGCATTC AAACCAAAAG TTGACGCATT ACCTAATGCC
CTGAAAAACA AAGGCTATAT TACTTTCGCA GCACATAATT ACCAAGCAGA CTTTTGGTAT
AGGAATAAAA TTTACCCTAA ATTTGGATTT GATAGATTTG ATAGCATTAT TAATATGGGA
GATTTACCAC CAGAGTATAG CAGTATTAAA AAACCATGGC AGTGGCAACC TGATGACTAC
CTGCTTTACA ATGCAGCATT AAAAGCCATC CAAAATGCAG GAAACAAAGA TATTTTTGTA
CATTTGATCA CTATGTCAAC TCATGGTCCT TATCAACACA TTAATGATTA TGGTGAAGGT
GTTTATACCT ATGAGATTAA TGAGGCAGTC AAAAGATTCG TTCAATTTTC ACAGCAAGTA
GAAAAAATAG ATCCAAATGC TGTAATCGTC TTTTACGGTG ACCATAAACC ACCACTAAAT
AAATATTTCG TCGAAAATGG GGTGCTACCA AGTAATATTT TCAACAAAAT CGGAGAAGAA
AAAGATGAAG ATTTCGTCTT TAAAATGAGC ACAACCCCAC TCGATTTTGG TGATGTTCCT
GTACTTATTA AATCAAATGA TAAAGACGCC ATTAAAAAAT TCATAACCGA CGCCAATGGA
AAACCTTTTT TCTGTGTATC TTCATTAGTT GATAAACATT TCATTCAATC TGGATTGGTA
TCATTTAATC ACAACGCAAA GCATGTCTGT GAAAACAGTG ATAACTTTAA TTATGATAAG
TTAATTAACA TGACCCCGTC GTGGATTTAT TCAATGTCAT TATTTCACGA TTAG
 
Protein sequence
MSIGLSKVVT RYKNLKLKSY VFDIVNVMFL LYLFCFLNQS IVIIGNIEPP KELIGAILNF 
IWRSSPILFI YLLFRIINIP IFLCSYLSGI AIITLMFVNS TKMALTGEPL SFNDIVSGFN
LSVAGKYLNL SSITYSVAIL IGAIISILLS KKTVTTRKNY ISLFLLIIIT IPFSFYPYIG
TIFSPNNRIS EIINSSANNL NIKYIYWNWS QNIGVNGLPM HIIQTSVRKS VPHATDAEQL
RYLDEKRSLV STHIKNKTII YVLCESCWYD KKNFRENFKP VIDEGYSELR AISPVYGGGT
ANAEFEMLTG LPSNSDKISG IIYQEYSDAF KPKVDALPNA LKNKGYITFA AHNYQADFWY
RNKIYPKFGF DRFDSIINMG DLPPEYSSIK KPWQWQPDDY LLYNAALKAI QNAGNKDIFV
HLITMSTHGP YQHINDYGEG VYTYEINEAV KRFVQFSQQV EKIDPNAVIV FYGDHKPPLN
KYFVENGVLP SNIFNKIGEE KDEDFVFKMS TTPLDFGDVP VLIKSNDKDA IKKFITDANG
KPFFCVSSLV DKHFIQSGLV SFNHNAKHVC ENSDNFNYDK LINMTPSWIY SMSLFHD