Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_1675 |
Symbol | |
ID | 6144597 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 1669676 |
End bp | 1671295 |
Gene Length | 1620 bp |
Protein Length | 539 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 641616551 |
Product | sulfatase |
Protein accession | YP_001743729 |
Protein GI | 170679774 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 49 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTGCAT TTGCTGCTCA TGCGGCAGAT GATGTAAAGC TGAAAGCAAC CAAAACAAAC GTTGCTTTCT CAGACTTTAC GCCGACTGAA TACAGTACCA AAGGAAAGCC AAATATTATC GTACTGACCA TGGATGATCT TGGTTATGGA CAACTTCCTT TTGATAAGGG ATCTTTTGAC CCAAAAACAA TGGAAAATCG TGAAGTTGTC GATACCTACA AAATAGGGAT AGATAAAGCC ATTGAAGCTG CGCAAAAATC AACGCCGACG CTCCTTTCAT TAATGGATGA AGGCGTGCGT TTTACTAACG GCTATGTGGC GCACGGTGTT TCCGGCCCCT CCCGCGCTGC AATAATGACC GGTCGAGCGC CCGCCCGCTT TGGTGTCTAT TCCAATACCG ATGCTCAGGA TGGTATTCCG CTAACAGAAA CTTTCTTGCC TGAATTATTC CAGAATCATG GTTATTACAC TGCAGCAGTA GGTAAATGGC ACTTGTCAAA AATCAGTAAT GTGCCGGTAC CGGAAGATAA ACAAACGCGT GACTATCATG ACAACTTCAC CACATTTTCT GCGGAAGAAT GGCAACCTCA AAACCGTGGC TTTGATTACT TTATGGGATT CCACGCTGCA GGAACGGCAT ATTACAACTC CCCTTCACTG TTCAAAAACC GTGAACGCGT CCCCGCAAAA GGTTATATCA GCGATCAGTT AACCGATGAG GCAATTGGCG TTGTTGATCG TGCCAAAACA CTTGACCAGC CTTTTATGCT TTACCTGGCT TATAATGCTC CGCACCTGCC AAATGATAAT CCTGCACCGG AGCAATATCA GAAGCAATTT AATACCGGTA GTCAAACAGC AGATAACTAC TACGCTTCCG TTTATTCTGT TGATCAGGGT GTAAAACGCA TTCTCGAACA ACTGAAGAAA AACGGACAGT ATGACAATAC TATTATTCTC TTTACCTCCG ATAATGGTGC GGTTATCGAT GGTCCTCTGC CGCTGAACGG GGCGCAAAAA GGCTATAAGA GTCAGACCTA TCCTGGCGGT ACTCACACCC CAATGTTTAT GTGGTGGAAA GGAAAACTTC AACCCGGTAA TTATGACAAG CTGATTTCCG CAATGGATTT CTACCCGACA GCTCTTGATG CAGCCGATAT CAGCATTCCA AAAGACCTTA AGCTGGATGG CGTTTCCTTG CTGCCGTGGT TGCAAGATAA GAAACAAGGC GAGCCACATA AAAATCTGAC CTGGATAACC TCTTATTCTC ACTGGTTTGA CGAGGAAAAT ATTCCATTCT GGGATAATTA CCACAAATTT GTCCGCCATC AGTCAGACGA TTACCCGCAT AACCCCAACA CCGAGGATTT AAGCCAATTC TCTTATACGG TAAGAAATAA CGATTATTCG CTTGTCTATA CAGTAGAAAA CAATCAGTTA GGTCTATACA AACTGACGGA TCTACAGCAA AAAGATAACC TTGCCGCCGC CAATCCGCAG GTCGTTAAAG AGATGCAAGG CGTGGTAAGA GAGTTTATCG ACAGCAGCCA ACCACCACTT AGCGAGGTAA ATCAGGAGAA GTTTAACAAT ATCAAGAAAG CACTAAGCGA AGCGAAATAA
|
Protein sequence | MAAFAAHAAD DVKLKATKTN VAFSDFTPTE YSTKGKPNII VLTMDDLGYG QLPFDKGSFD PKTMENREVV DTYKIGIDKA IEAAQKSTPT LLSLMDEGVR FTNGYVAHGV SGPSRAAIMT GRAPARFGVY SNTDAQDGIP LTETFLPELF QNHGYYTAAV GKWHLSKISN VPVPEDKQTR DYHDNFTTFS AEEWQPQNRG FDYFMGFHAA GTAYYNSPSL FKNRERVPAK GYISDQLTDE AIGVVDRAKT LDQPFMLYLA YNAPHLPNDN PAPEQYQKQF NTGSQTADNY YASVYSVDQG VKRILEQLKK NGQYDNTIIL FTSDNGAVID GPLPLNGAQK GYKSQTYPGG THTPMFMWWK GKLQPGNYDK LISAMDFYPT ALDAADISIP KDLKLDGVSL LPWLQDKKQG EPHKNLTWIT SYSHWFDEEN IPFWDNYHKF VRHQSDDYPH NPNTEDLSQF SYTVRNNDYS LVYTVENNQL GLYKLTDLQQ KDNLAAANPQ VVKEMQGVVR EFIDSSQPPL SEVNQEKFNN IKKALSEAK
|
| |