Gene EcSMS35_4386 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_4386 
Symbol 
ID6143196 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp4473823 
End bp4475373 
Gene Length1551 bp 
Protein Length516 aa 
Translation table11 
GC content41% 
IMG OID641619207 
Productserine/threonine protein phosphatase family protein 
Protein accessionYP_001746331 
Protein GI170680658 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0737] 5'-nucleotidase/2',3'-cyclic phosphodiesterase and related esterases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones51 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGAACTG TATTTACGAC TTTATCAGTC ATATTCAGCG TGATGTTTTC TCACAGTATA 
CTAGCTCAGG ACGTTACTAT TTATTATACG AATGATATTC ACGCACATGT CAATCCGGCA
AAAATTCCCG CTGTTGATAA GAACAGACTT GTAGGTGGTA TGGCTAATAT TGCAGGCATC
GTTAATGAAG CGAAGAAAAA AAACAAAGAT GTCTTTTTCT TTGATGCAGG CGACTATTTT
ACCGGTCCGT ATATCAGTAC CCTGACCAAA GGTGAAGCTA TTATTGATAT CATGAACACC
ATGCCCTTTG ATGCGGTGTC CGTTGGTAAT CACGAGTTCG ATCACGGCGT GCCCAATATG
GTATCTCAGT TATCAAAAGC AAAATTCCCC ATTCTGCTGG GAAATATTTA CTACACCGAT
ACAAATAAAC CAGTTTGGGA TCATCCGTGG ACCATTATCG AAAAAGATGG TTTAAAGATT
GGTGTGATTG GATTGCATGG TGCATTCGCT TTCTATGATA CAGTCGCGGC AAAAGCACGT
GAAGGCGTAG AAGCCAGAGA CGAAATTAAA TATTTAAATA AAGCACTTGC AGAATTAAAA
GGTAAAGTCG ACATTACCGT TTTGCTAATT CACGAAGGTG TTCCGGCGCG TCAATCAAGC
TTTGGTAGTA AAGATGTGGA GCGGCTACTA CAGGCAGATA TTGAGACAGC TAAAAAAGTG
AATGGTGTTG ATGTCTTAAT TACCGGTCAC GCACATGTCG GAACACCGCA ACCAATAAAA
GTTAATAACA CATTAATTGT ATCTACCGAT GCATACGGCA CCGACATCGG AAAACTTGTG
CTTGATTACA ATCCGAAAAC AAAGAAAATT GATAGTTATA ATGGTGAGTT AATCACCATC
TTTGCAGATC AATTTAAACC TGACACCATC GTTCAAAATA CTATCGATAA ATGGAGCGTA
AAGCTTAACA AAATAACACA GGAAGTTGTC GGCCATTCTC CCGTAGTTTT AACACGTGAG
TATGGCAGTT CTTCTTCTAC CGGTAATCTT ATTCTGGATG CAATGATGGA AAAAACACCT
GATGCCATTG CCGGATTTCA AAATAGCGGT GGGATGCGAG CTGATTTTCC TAAAGGTGAT
ATCACACTGG GAGATGTTAT TAGCACATTC CCCTTTAATA ATGACCTCAT CGAGATGGAT
TTGACGGGCC GCGATCTCAA ATCGTTGATG ACGCATGCAA CCAATCTAAC TAACGGTGTG
TTACAGGTTT CAAAAAGCGT TGCGGTTGTC TATGACAGCA AAAAACCACT CAACCAACGG
TTAATCTCTT TCACCATTAA CGGCAAACCC GTGGAAGATA ATCAAACATA TCGTATTGCC
ACGCACTCCT TTTGTGCCAG TGGTGGTGAC GGTTTTGAAG CATTTTTGAA TGGAAAAAAT
GTGAAGACGA TACCGGGAAC AACCTCGGCG GAATCTATCA TCGATTATTT CAAAAATCAT
AAACCTGTCA CCCCAGACTT AACTAAACGA GTCATGGACG TCGCCAAATA A
 
Protein sequence
MRTVFTTLSV IFSVMFSHSI LAQDVTIYYT NDIHAHVNPA KIPAVDKNRL VGGMANIAGI 
VNEAKKKNKD VFFFDAGDYF TGPYISTLTK GEAIIDIMNT MPFDAVSVGN HEFDHGVPNM
VSQLSKAKFP ILLGNIYYTD TNKPVWDHPW TIIEKDGLKI GVIGLHGAFA FYDTVAAKAR
EGVEARDEIK YLNKALAELK GKVDITVLLI HEGVPARQSS FGSKDVERLL QADIETAKKV
NGVDVLITGH AHVGTPQPIK VNNTLIVSTD AYGTDIGKLV LDYNPKTKKI DSYNGELITI
FADQFKPDTI VQNTIDKWSV KLNKITQEVV GHSPVVLTRE YGSSSSTGNL ILDAMMEKTP
DAIAGFQNSG GMRADFPKGD ITLGDVISTF PFNNDLIEMD LTGRDLKSLM THATNLTNGV
LQVSKSVAVV YDSKKPLNQR LISFTINGKP VEDNQTYRIA THSFCASGGD GFEAFLNGKN
VKTIPGTTSA ESIIDYFKNH KPVTPDLTKR VMDVAK