Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_1884 |
Symbol | |
ID | 6147190 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 1904356 |
End bp | 1906149 |
Gene Length | 1794 bp |
Protein Length | 597 aa |
Translation table | 11 |
GC content | 32% |
IMG OID | 641616759 |
Product | sulfatase family protein |
Protein accession | YP_001743937 |
Protein GI | 170683822 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1368] Phosphoglycerol transferase and related proteins, alkaline phosphatase superfamily |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 0.00298505 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAGTATCG GTTTGAGCAA AGTCGTTACA AGATATAAAA ACTTAAAGTT GAAATCATAT GTTTTCGACA TTGTCAACGT TATGTTTTTA TTGTACTTAT TTTGTTTTCT AAATCAGTCA ATAGTTATCA TAGGAAATAT TGAACCGCCT AAAGAGTTAA TTGGTGCAAT ATTGAATTTC ATCTGGCGTT CATCCCCCAT TTTGTTTATT TATTTGTTAT TTAGAATAAT AAACATTCCA ATTTTTCTAT GTTCTTATTT ATCAGGCATC GCAATTATAA CATTGATGTT TGTGAACAGT ACTAAAATGG CGTTAACCGG TGAGCCTTTA TCATTTAATG ACATTGTTTC TGGATTCAAT TTGAGTGTCG CAGGAAAATA CCTCAACTTA TCATCCATTA CTTACTCTGT CGCTATTCTA ATTGGCGCAA TAATCTCAAT CCTACTTAGC AAAAAAACGG TCACAACAAG AAAAAATTAT ATATCGTTAT TTCTTTTAAT TATCATTACG ATCCCCTTTT CATTCTACCC CTATATTGGA ACCATTTTTA GCCCTAACAA TAGGATTTCT GAAATCATAA ACTCCAGCGC GAACAACTTA AATATAAAAT ATATTTACTG GAATTGGTCT CAAAATATTG GCGTTAATGG CTTGCCAATG CATATAATAC AAACAAGTGT CAGAAAATCT GTACCACATG CAACTGATGC AGAACAATTA AGATATTTAG ATGAAAAAAG ATCTTTAGTA TCCACTCATA TAAAAAATAA AACCATTATT TATGTTTTAT GTGAATCATG TTGGTATGAT AAAAAAAACT TCCGAGAAAA TTTCAAACCT GTTATTGATG AAGGTTACAG TGAATTACGG GCCATCTCAC CAGTTTACGG TGGAGGTACC GCAAATGCCG AATTTGAAAT GCTGACAGGA CTACCAAGTA ATTCAGATAA AATATCAGGA ATCATCTATC AAGAATATTC AGATGCATTC AAACCAAAAG TTGACGCATT ACCTAATGCC CTGAAAAACA AAGGCTATAT TACTTTCGCA GCACATAATT ACCAAGCAGA CTTTTGGTAT AGGAATAAAA TTTACCCTAA ATTTGGATTT GATAGATTTG ATAGCATTAT TAATATGGGA GATTTACCAC CAGAGTATAG CAGTATTAAA AAACCATGGC AGTGGCAACC TGATGACTAC CTGCTTTACA ATGCAGCATT AAAAGCCATC CAAAATGCAG GAAACAAAGA TATTTTTGTA CATTTGATCA CTATGTCAAC TCATGGTCCT TATCAACACA TTAATGATTA TGGTGAAGGT GTTTATACCT ATGAGATTAA TGAGGCAGTC AAAAGATTCG TTCAATTTTC ACAGCAAGTA GAAAAAATAG ATCCAAATGC TGTAATCGTC TTTTACGGTG ACCATAAACC ACCACTAAAT AAATATTTCG TCGAAAATGG GGTGCTACCA AGTAATATTT TCAACAAAAT CGGAGAAGAA AAAGATGAAG ATTTCGTCTT TAAAATGAGC ACAACCCCAC TCGATTTTGG TGATGTTCCT GTACTTATTA AATCAAATGA TAAAGACGCC ATTAAAAAAT TCATAACCGA CGCCAATGGA AAACCTTTTT TCTGTGTATC TTCATTAGTT GATAAACATT TCATTCAATC TGGATTGGTA TCATTTAATC ACAACGCAAA GCATGTCTGT GAAAACAGTG ATAACTTTAA TTATGATAAG TTAATTAACA TGACCCCGTC GTGGATTTAT TCAATGTCAT TATTTCACGA TTAG
|
Protein sequence | MSIGLSKVVT RYKNLKLKSY VFDIVNVMFL LYLFCFLNQS IVIIGNIEPP KELIGAILNF IWRSSPILFI YLLFRIINIP IFLCSYLSGI AIITLMFVNS TKMALTGEPL SFNDIVSGFN LSVAGKYLNL SSITYSVAIL IGAIISILLS KKTVTTRKNY ISLFLLIIIT IPFSFYPYIG TIFSPNNRIS EIINSSANNL NIKYIYWNWS QNIGVNGLPM HIIQTSVRKS VPHATDAEQL RYLDEKRSLV STHIKNKTII YVLCESCWYD KKNFRENFKP VIDEGYSELR AISPVYGGGT ANAEFEMLTG LPSNSDKISG IIYQEYSDAF KPKVDALPNA LKNKGYITFA AHNYQADFWY RNKIYPKFGF DRFDSIINMG DLPPEYSSIK KPWQWQPDDY LLYNAALKAI QNAGNKDIFV HLITMSTHGP YQHINDYGEG VYTYEINEAV KRFVQFSQQV EKIDPNAVIV FYGDHKPPLN KYFVENGVLP SNIFNKIGEE KDEDFVFKMS TTPLDFGDVP VLIKSNDKDA IKKFITDANG KPFFCVSSLV DKHFIQSGLV SFNHNAKHVC ENSDNFNYDK LINMTPSWIY SMSLFHD
|
| |