Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_4386 |
Symbol | |
ID | 6143196 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 4473823 |
End bp | 4475373 |
Gene Length | 1551 bp |
Protein Length | 516 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 641619207 |
Product | serine/threonine protein phosphatase family protein |
Protein accession | YP_001746331 |
Protein GI | 170680658 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0737] 5'-nucleotidase/2',3'-cyclic phosphodiesterase and related esterases |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 51 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGAACTG TATTTACGAC TTTATCAGTC ATATTCAGCG TGATGTTTTC TCACAGTATA CTAGCTCAGG ACGTTACTAT TTATTATACG AATGATATTC ACGCACATGT CAATCCGGCA AAAATTCCCG CTGTTGATAA GAACAGACTT GTAGGTGGTA TGGCTAATAT TGCAGGCATC GTTAATGAAG CGAAGAAAAA AAACAAAGAT GTCTTTTTCT TTGATGCAGG CGACTATTTT ACCGGTCCGT ATATCAGTAC CCTGACCAAA GGTGAAGCTA TTATTGATAT CATGAACACC ATGCCCTTTG ATGCGGTGTC CGTTGGTAAT CACGAGTTCG ATCACGGCGT GCCCAATATG GTATCTCAGT TATCAAAAGC AAAATTCCCC ATTCTGCTGG GAAATATTTA CTACACCGAT ACAAATAAAC CAGTTTGGGA TCATCCGTGG ACCATTATCG AAAAAGATGG TTTAAAGATT GGTGTGATTG GATTGCATGG TGCATTCGCT TTCTATGATA CAGTCGCGGC AAAAGCACGT GAAGGCGTAG AAGCCAGAGA CGAAATTAAA TATTTAAATA AAGCACTTGC AGAATTAAAA GGTAAAGTCG ACATTACCGT TTTGCTAATT CACGAAGGTG TTCCGGCGCG TCAATCAAGC TTTGGTAGTA AAGATGTGGA GCGGCTACTA CAGGCAGATA TTGAGACAGC TAAAAAAGTG AATGGTGTTG ATGTCTTAAT TACCGGTCAC GCACATGTCG GAACACCGCA ACCAATAAAA GTTAATAACA CATTAATTGT ATCTACCGAT GCATACGGCA CCGACATCGG AAAACTTGTG CTTGATTACA ATCCGAAAAC AAAGAAAATT GATAGTTATA ATGGTGAGTT AATCACCATC TTTGCAGATC AATTTAAACC TGACACCATC GTTCAAAATA CTATCGATAA ATGGAGCGTA AAGCTTAACA AAATAACACA GGAAGTTGTC GGCCATTCTC CCGTAGTTTT AACACGTGAG TATGGCAGTT CTTCTTCTAC CGGTAATCTT ATTCTGGATG CAATGATGGA AAAAACACCT GATGCCATTG CCGGATTTCA AAATAGCGGT GGGATGCGAG CTGATTTTCC TAAAGGTGAT ATCACACTGG GAGATGTTAT TAGCACATTC CCCTTTAATA ATGACCTCAT CGAGATGGAT TTGACGGGCC GCGATCTCAA ATCGTTGATG ACGCATGCAA CCAATCTAAC TAACGGTGTG TTACAGGTTT CAAAAAGCGT TGCGGTTGTC TATGACAGCA AAAAACCACT CAACCAACGG TTAATCTCTT TCACCATTAA CGGCAAACCC GTGGAAGATA ATCAAACATA TCGTATTGCC ACGCACTCCT TTTGTGCCAG TGGTGGTGAC GGTTTTGAAG CATTTTTGAA TGGAAAAAAT GTGAAGACGA TACCGGGAAC AACCTCGGCG GAATCTATCA TCGATTATTT CAAAAATCAT AAACCTGTCA CCCCAGACTT AACTAAACGA GTCATGGACG TCGCCAAATA A
|
Protein sequence | MRTVFTTLSV IFSVMFSHSI LAQDVTIYYT NDIHAHVNPA KIPAVDKNRL VGGMANIAGI VNEAKKKNKD VFFFDAGDYF TGPYISTLTK GEAIIDIMNT MPFDAVSVGN HEFDHGVPNM VSQLSKAKFP ILLGNIYYTD TNKPVWDHPW TIIEKDGLKI GVIGLHGAFA FYDTVAAKAR EGVEARDEIK YLNKALAELK GKVDITVLLI HEGVPARQSS FGSKDVERLL QADIETAKKV NGVDVLITGH AHVGTPQPIK VNNTLIVSTD AYGTDIGKLV LDYNPKTKKI DSYNGELITI FADQFKPDTI VQNTIDKWSV KLNKITQEVV GHSPVVLTRE YGSSSSTGNL ILDAMMEKTP DAIAGFQNSG GMRADFPKGD ITLGDVISTF PFNNDLIEMD LTGRDLKSLM THATNLTNGV LQVSKSVAVV YDSKKPLNQR LISFTINGKP VEDNQTYRIA THSFCASGGD GFEAFLNGKN VKTIPGTTSA ESIIDYFKNH KPVTPDLTKR VMDVAK
|
| |