Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_4754 |
Symbol | |
ID | 6146721 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 4853175 |
End bp | 4854674 |
Gene Length | 1500 bp |
Protein Length | 499 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 641619568 |
Product | sulfatase family protein |
Protein accession | YP_001746675 |
Protein GI | 170681302 |
COG category | [R] General function prediction only |
COG ID | [COG2194] Predicted membrane-associated, metal-dependent hydrolase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 56 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATATCA GAAAACTGTT TTGTCCGGGA AACACACCCC GGATTTTATT GTTTTTATTC TTTTTTGTTG TTTCTGCAAT AACCACAATT GCATGCGGAT ACACTGAGAA GAATGCCACA GGAAATGTGC TGCTTCTGTT TCTCCTTCTT CTCCTTGCAC ACAGAAATAC CCTCACATCC ATTACAGCGC TGTTATTTCT GTTCTGTTGT GCACTGTATG CGCCTGCGGG TATGACGTAC GGTAAAATCA ACAACAGTTT TATTGTCGCG TTGCTGCAGA CCACGGCTGA TGAGGCTGCA GAGTTTACCG GGATGATTCC TGTTTATCAT TTTCTGGTCA GTGCCGCGAT TCTGGTATTC ATGGTGATTT TCTGGCGGAC ACACCACCGC GGTCACCGTA ACTGGCTGGC ACTGCTGCTA TTCGTATTAT GCTCTGTAAA CAGCTGGCCG TTGCGGATGG TTAAAGGAAC TGTTGTGGGG ACAACTGACA CATTGCGTGA AATGCAGCGT TATAAACAAC TGAGTCAGCA CGGGGCTGAT AACTGGAAAA TCCTGCCGGG TGTGCCGTTG TATGACACGA TTGTTATCGT TACTGGTGAG AGTGTGCGCA GGGATTATAT GTCGGTATAT GGCTATCCCG TGCCAACCAC GCCGTGGCTG AATACAGCTC CCGGTTTATT TATTGACGGC TATACATCGG CAGCAGCCAG TACCGTACCT TCCCTGAGCC GGACACTGAT TTATGACTAT GAGCAGAACC CTGATTCCGG CAACAACGTG GTGGCGCTGG CAGCAAAAGC AGGATACAGC ACATGGTGGA TATCCAATCA GGGAAAACTG GGAGAGCATG ACACACGCAT CTCTGTTATT GCTTCTGATG CGGAGCATAC CGTTTTCCTC AAGAAAGGCA GCTTCGCTTC CCGTAAAACG GATGACATGT TGTTGTTACA GGAAACAGAA CGTGCGCTGG CGGATAAATC CTCGCCGAAG GTGATTTTCC TGCACATGAT TGGCTCTCAT CCGAATCCGT GTGACCGACT TAACTCCTGG CCGAATTATT ACCTGGAGCA GTATCCCCGA AAGATTGCCT GTTACCTCGC CAGCATCAGT AAACTGGATA ACTTTCTCGG TCAGCTTGAT GGTATCCTTC GCCGGCATTC CCGTCACTTT GCAATGCTTT ACTTTTCTGA CCATGGGCTG TCGGTCAGCG ACAGTGCTAA TCCTGTTCAT CATGATGGTC ATGTGCAGGG GGGCTACAGC GTTCCCCTGA TTATTACCGC CAGTGACATC ACGTCTCATC AGTCCGTCAG CAGGAAAATC AGTGCCCGTA ATTTCGCAGG CATTTTTCAG TGGCTGACCG GTATTCGTAC CGAAAATATA ACGCCATTCA ATCCGCTGAC AGACGAAGAT AATGAACCCG TTATGGTTTT TAACGGAGAG AAAAATGTGC CGGCAGACAG TCTGAAACCG CAGCCACTTA TTCTTCCGGA CCACAGGTAA
|
Protein sequence | MNIRKLFCPG NTPRILLFLF FFVVSAITTI ACGYTEKNAT GNVLLLFLLL LLAHRNTLTS ITALLFLFCC ALYAPAGMTY GKINNSFIVA LLQTTADEAA EFTGMIPVYH FLVSAAILVF MVIFWRTHHR GHRNWLALLL FVLCSVNSWP LRMVKGTVVG TTDTLREMQR YKQLSQHGAD NWKILPGVPL YDTIVIVTGE SVRRDYMSVY GYPVPTTPWL NTAPGLFIDG YTSAAASTVP SLSRTLIYDY EQNPDSGNNV VALAAKAGYS TWWISNQGKL GEHDTRISVI ASDAEHTVFL KKGSFASRKT DDMLLLQETE RALADKSSPK VIFLHMIGSH PNPCDRLNSW PNYYLEQYPR KIACYLASIS KLDNFLGQLD GILRRHSRHF AMLYFSDHGL SVSDSANPVH HDGHVQGGYS VPLIITASDI TSHQSVSRKI SARNFAGIFQ WLTGIRTENI TPFNPLTDED NEPVMVFNGE KNVPADSLKP QPLILPDHR
|
| |