Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_2338 |
Symbol | |
ID | 6142763 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 2370036 |
End bp | 2371796 |
Gene Length | 1761 bp |
Protein Length | 586 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641617212 |
Product | sulfatase family protein |
Protein accession | YP_001744385 |
Protein GI | 170681591 |
COG category | [R] General function prediction only |
COG ID | [COG3083] Predicted hydrolase of alkaline phosphatase superfamily |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.0980627 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 38 |
Fosmid unclonability p-value | 0.0832613 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTAACTC ATCGTCAGCG CTACCGTGAA AAAGTCTCCC AGATGGTCAG TTGGGGGCAC TGGTTTGCAC TGTTCAATAT TCTGCTTTCG CTCGTCATTG GCAGCCGTTA CCTGTTTATC GCCGACTGGC CGACAACGCT TGCTGGTCGC ATTTATTCCT ACGTAAGCAT TATCGGTCAT TTCAGCTTCC TGGTGTTCGC CACCTACTTG CTGATCCTCT TCCCGCTGAC CTTTATCGTC GGCTCCCAGA GGCTGATGAG GTTTTTGTCC GTCATTCTGG CAACGGCGGG AATGACGCTA TTACTGATCG ATAGCGAAGT CTTTACTCGT TTCCATCTCC ATCTTAATCC CATCGTCTGG CAACTGGTTA TCAACCCAGA CGAAAATGAG ATGGCGCGCG ACTGGCAGCT GATGTTCATC AGCGTGCCGG TTATTTTATT GCTTGAACTG GTGTTTGCGA CGTGGAGCTG GCAAAAGCTG CGCAGCCTGA CGCGTCGTCG ACGCTTCGCG CGCCCGCTGG CCGCATTCTT ATTTATCGCC TTTATCGCCT CGCATGTGGT GTATATCTGG GCCGATGCCA ACTTCTATCG CCCGATCACC ATGCAACGCG CTAACCTGCC GCTTTCGTAC CCGATGACGG CGCGACGTTT TCTTGAGAAG CATGGTCTGC TTGATGCGCA GGAGTATCAA CGCCGTCTCA TTGAGCAAGG TAATCCTGAC GCCGTTTCCG TTCAGTATCC GTTAAGCGAA CTGCGCTATC GCGATATGGG CACCGGGCAG AATGTGCTGT TGATTACTGT CGATGGCCTG AACTACTCAC GCTTCGAGAA GCAGATGCCT GCGCTGGCAG GTTTTGCTGA GCAAAATATT TCGTTCACGC GCCATATGAG CTCCGGCAAC ACTACAGACA ACGGCATCTT TGGCCTGTTC TATGGCATCT CGCCGAGCTA TATGGATGGC ATTCTGTCGA CCCGTACGCC TGCGGCATTA ATTACTGCGC TTAATCAACA AGGCTATCAG CTGGGATTAT TCTCATCAGA TGGCTTTACC AGCCCGCTGT ATCGCCAGGC ATTGTTGTCA GATTTCTCGA TGCCGAGCGT ACGCACCCAA TCCGACGAGC AGACCGCCAC GCAGTGGATC AACTGGCTGG GCCGCTACGC ACAAGAAGAT AACCGCTGGT TCTCGTGGGT TTCTTTCAAT GGCACTAACA TTGACGACAG CAATCAGCAG GCATTTGCAC GGAAATATAG CCGGGCGGCA GGCAATGTCG ACGACCAGAT CAACCGCGTG CTCAATGCAC TGCGTGATTC TGGCAAACTG GACAATACGG TGGTGATTAT CACTGCCGGT CGGGGTATTC CGCTGAGCGA AGAGGAAGAA ACCTTTGACT GGTCCCACGG TCATCTGCAG GTGCCATTAG TGATTCACTG GCCAGGCACG CCGGCGCAGC GTATTAATGC TCTGACTGAT CATACCGATC TGATGACGAC GCTGATGCAA CGCCTGCTAC ATGTCAGCAC ACCTGCCAGC GAATATTCGC AAGGTCAGGA TTTGTTCAAC CCTCAACGCC GTCATTACTG GGTTACCGCC GCGGATAACG ATACGCTGGC AATTACCACC CCGAAAAAGA CGCTGGTGCT GAACAATAAC GGTAAATACC GTACTTACAA CTTACGTGGT GAAAGAGTGA AAGATGAAAA ACCACAGTTA AGTTTGTTAT TGCAAGTACT GACAGACGAG AAGCGTTTTA TCGCTAACTG A
|
Protein sequence | MVTHRQRYRE KVSQMVSWGH WFALFNILLS LVIGSRYLFI ADWPTTLAGR IYSYVSIIGH FSFLVFATYL LILFPLTFIV GSQRLMRFLS VILATAGMTL LLIDSEVFTR FHLHLNPIVW QLVINPDENE MARDWQLMFI SVPVILLLEL VFATWSWQKL RSLTRRRRFA RPLAAFLFIA FIASHVVYIW ADANFYRPIT MQRANLPLSY PMTARRFLEK HGLLDAQEYQ RRLIEQGNPD AVSVQYPLSE LRYRDMGTGQ NVLLITVDGL NYSRFEKQMP ALAGFAEQNI SFTRHMSSGN TTDNGIFGLF YGISPSYMDG ILSTRTPAAL ITALNQQGYQ LGLFSSDGFT SPLYRQALLS DFSMPSVRTQ SDEQTATQWI NWLGRYAQED NRWFSWVSFN GTNIDDSNQQ AFARKYSRAA GNVDDQINRV LNALRDSGKL DNTVVIITAG RGIPLSEEEE TFDWSHGHLQ VPLVIHWPGT PAQRINALTD HTDLMTTLMQ RLLHVSTPAS EYSQGQDLFN PQRRHYWVTA ADNDTLAITT PKKTLVLNNN GKYRTYNLRG ERVKDEKPQL SLLLQVLTDE KRFIAN
|
| |