Gene EcSMS35_4385 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_4385 
Symbol 
ID6146414 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp4472163 
End bp4473782 
Gene Length1620 bp 
Protein Length539 aa 
Translation table11 
GC content44% 
IMG OID641619206 
Productserine/threonine protein phosphatase family protein 
Protein accessionYP_001746330 
Protein GI170683900 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0737] 5'-nucleotidase/2',3'-cyclic phosphodiesterase and related esterases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones49 
Fosmid unclonability p-value0.970271 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAACAG TGCAGCGTAC TTTACTTGCC GCAACATTGC TGACTATTTT TTCGGTCTCA 
GTCATGGCGC AGGATGTCAC GATTATTTAT ACAAATGATC TTCACGCTCA TGTGGATTCT
TATAAAGTGC CGTATATCGC AGACGGAAAA CGCGACATTG GCGGCTTTGC CAATATATCT
ACTTTAGTAA AACAGGAGAA AGCGAAAAAT AAAGCCACCT TTTATTTTGA TGCTGGAGAC
TATTTCACCG GCCCTTATAT CAGCAGTCTG ACAAAGGGCG AAGCAATAAT TGACATTATG
AATACCATGC CTTTTGACGC AGTATCAATC GGTAATCATG AGTTTGACCA TGGCTGGGAT
AATGCCCTGC GACAATTAAG CAAAGCGAAT TTTCCTGTTT TATTGGGCAA TGTTTATCAT
AAAGAGAGTG AGAACCCCTT CTGGAACAAG CCCTACACCA TCCTGGAAAA AGATGGCATC
AAGATTGGTA TTATTGGGCT ACACGGCGTA TTTGCATTTA ATGATACGGT CTCGGAGCTC
TCTCTCCAGG GACTCGATAA TGACAATAAT AACCGCTTTG ATAAATCTTC CGAGACCCTT
AAGAATCAGG GGATTGAAGC ACGTGATGAA GTGAAGTATT TGCAGCATTA TATCGACGAG
TTACGTGATA AGGTTGACCT CACTGTGGCG CTGGTTCATG AAGGCGTTCC GGCGCGTCAG
TCCAGCATTG GCAATACAGA TGTTAGACGC GCTCTTGATA AAGATATTCA AACCGCTAGC
AAGGTGAAAG GATTGGATAT TCTGATTACC GGGCATGCCC ACGTCGGAAC GCCAGAACCA
ATCAAAGTCG GCAACACCTT AATCCTTTCA ACTGATAGTG GAGGTATCGA TATTGGTAAA
TTAGTCCTTG ATGTCAACCC TACAGCTCGT ACCCATAAAA TGAAGAGCTT TGAGCTGAAA
ACAGTTTATG CCGATGAATG GATACCCGAT CCAACGACAC AAAAAGTTAT CAATGGCTGG
AATAAAAAAC TGGCGGATAT TGTACGCCAA CCAGTGGGTG AATCATCGAT TGCATTAACT
CGCGCCTATG GTGAGTCATC GCAACTAGGC AACCTGTTTA CCGATGCAAT GCTTGTCGCT
GCTCCGACTG CACAGATTGC ATTGATAAAC TCCGGCAGCT TACGTGCGGA TATAAATGCA
GGCACGATTA CCTTTGGCGA CATTACCAGT ACATTCCCCT TCAAGAATGA ACTTACTGAA
ATGGATCTCA GCGGTAAGGA TCTGCGTAAC CTACTGGAAC ACGGGGCATC GCTTACCAAT
GGCATCCTGC AAATGTCAAA AGGCGCAGAA ATGCGCTATA CCCCGCAGAA ACCAGTAGGT
CAGCGCATAG TATCTTTTAA AATTAATGGT GAAGAGATCG TTGATACGAA TATTTACCAT
GTTGCAACAA CGACTTTCCT TGCACTGGGG GGCGATGGTT TTCTGGCATT CAAAGAAGGG
AAAAATGTTC AGGTCCGCGC CGGAAATAAC ATGTCCGATG TCGTGATCGA TTATTTGAAG
AAAGGCCACA AGATTACGCC TGCACAGGTA AATGAAATGC GGGTGGACGT TAGCAAATAA
 
Protein sequence
MKTVQRTLLA ATLLTIFSVS VMAQDVTIIY TNDLHAHVDS YKVPYIADGK RDIGGFANIS 
TLVKQEKAKN KATFYFDAGD YFTGPYISSL TKGEAIIDIM NTMPFDAVSI GNHEFDHGWD
NALRQLSKAN FPVLLGNVYH KESENPFWNK PYTILEKDGI KIGIIGLHGV FAFNDTVSEL
SLQGLDNDNN NRFDKSSETL KNQGIEARDE VKYLQHYIDE LRDKVDLTVA LVHEGVPARQ
SSIGNTDVRR ALDKDIQTAS KVKGLDILIT GHAHVGTPEP IKVGNTLILS TDSGGIDIGK
LVLDVNPTAR THKMKSFELK TVYADEWIPD PTTQKVINGW NKKLADIVRQ PVGESSIALT
RAYGESSQLG NLFTDAMLVA APTAQIALIN SGSLRADINA GTITFGDITS TFPFKNELTE
MDLSGKDLRN LLEHGASLTN GILQMSKGAE MRYTPQKPVG QRIVSFKING EEIVDTNIYH
VATTTFLALG GDGFLAFKEG KNVQVRAGNN MSDVVIDYLK KGHKITPAQV NEMRVDVSK