Gene EcSMS35_4388 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_4388 
Symbol 
ID6142929 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp4476866 
End bp4478419 
Gene Length1554 bp 
Protein Length517 aa 
Translation table11 
GC content48% 
IMG OID641619209 
Productserine/threonine protein phosphatase family protein 
Protein accessionYP_001746333 
Protein GI170681676 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0737] 5'-nucleotidase/2',3'-cyclic phosphodiesterase and related esterases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones47 
Fosmid unclonability p-value0.693263 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAATTA AAATGCTTGC TGCGGGTATC GCATTAACAC TGCCGTTTTG GGCTTGCGCC 
AAAGATGTCA CCATCATTTA TACCAACGAC CTCCATGCAC ATGTAGAGCC TTATAAAGTG
CCGTGGATTG CTGACGGTAA ACGGGATATT GGCGGCTGGG CAAATATCAC TACGCTTGTT
AAACAAGAAA AAGCTAAAAA CAAAGCGACC TGGTTTTTTG ATGCGGGTGA CTATTTTACC
GGACCGTATA TCAGCAGCCT GACTAAAGGC AAGGCGATTA TCGATATTAT GAATACCATG
CCATTCGATG CGGTCACTAT AGGTAATCAT GAATTTGATC ACGGCTGGGA CAATACGTTA
TTACAGTTGA GTCAGGCAAA ATTCCCGATT GTGCAGGGTA ATGTTTTTTA TCAGAACAGC
AGTAAATCAT TCTGGGATAA GCCCTATACC ATCATTGAAA AAGACGGCGT GAAAATTGGC
GTGATTGGTT TGCACGGTGT ATTTGCCTTT AATGATACGG TATCTGCGGC AACGCGAGTG
GGTATTGAGG CGCGTGATGA AATTAAATGG CTACAACGTT ATATCGATGA ACTCAAAGGC
AAGGTTGATC TAACCGTCGC CCTGATCCAC GAAGGTGTTC CGGCCCGCCA GTCCAGTATG
GGAGGCACGG ATGTGCGTCG CGCACTGGAT AAAGATATTC AGACGGCAAG TCAGGTGAAA
GGGTTGGATA TTTTGATCAC CGGGCATGCA CATGTGGGTA CGCCGGAACC GATTAAAGTC
GGCAATACGT TAATCCTCTC CACTGACAGC GGCGGGATTG ATGTCGGTAA ACTGGTTCTC
GACTACAAAG AGAAGCCGCA CACTTTTACG GTGAAAAACT TCGAGCTTAA AACCCTTTAC
GCCGATGAGT GGAAGCCCGA TCCGCAAACG AAACAGGTGA TTGATAGTTG GAACAAAAAG
CTGGATGAAG TCGTGCAACA AACGGTGGCG CAATCGCCGG TTGAACTAAA ACGTGCCTAT
GGTGAATCAG CTTCTCTCGG GAACCTGGCG GCAGACGCTT TGCTGGCAGC GGCGGGTAAA
AATACCCAGT TGGCGTTAAC CAACTCTGGC GGGATTCGCA ATGAGATCCC GGCGGGTGCA
ATTACGATGG GTGGCGTCAT CAGTACCTTC CCGTTCCCCA ACGAACTGGT GACGATGGAT
CTCACGGGTA AACAATTACG CAGTTTGATG GAACACGGCG CAAGTTTGAG TAATGGCGTT
TTACAGGTAT CGAAAGGCCT GGAAATGAAG TACGACAGCA GTAAGCCGGT TGGTCAGCGG
GTAATCACGC TGACTCTGAA TGGCAAACCC ATTGAAGATG CGACGATTTA CCACATTGCT
ACTCAGAGTT TCCTTGCTGA TGGTGGAGAT GGTTTTACCG CCTTTACCGA AGGGAAAGCG
CGTAACACAA CGGGCGGTTA TTACGTTTAT CACGCCGTGG TTGATTACTT CAAAGCGGGT
AACACCATCA CGGATGAACA GATCAACGGT ATGCGCGTGA AAGATATCAA GTAA
 
Protein sequence
MKIKMLAAGI ALTLPFWACA KDVTIIYTND LHAHVEPYKV PWIADGKRDI GGWANITTLV 
KQEKAKNKAT WFFDAGDYFT GPYISSLTKG KAIIDIMNTM PFDAVTIGNH EFDHGWDNTL
LQLSQAKFPI VQGNVFYQNS SKSFWDKPYT IIEKDGVKIG VIGLHGVFAF NDTVSAATRV
GIEARDEIKW LQRYIDELKG KVDLTVALIH EGVPARQSSM GGTDVRRALD KDIQTASQVK
GLDILITGHA HVGTPEPIKV GNTLILSTDS GGIDVGKLVL DYKEKPHTFT VKNFELKTLY
ADEWKPDPQT KQVIDSWNKK LDEVVQQTVA QSPVELKRAY GESASLGNLA ADALLAAAGK
NTQLALTNSG GIRNEIPAGA ITMGGVISTF PFPNELVTMD LTGKQLRSLM EHGASLSNGV
LQVSKGLEMK YDSSKPVGQR VITLTLNGKP IEDATIYHIA TQSFLADGGD GFTAFTEGKA
RNTTGGYYVY HAVVDYFKAG NTITDEQING MRVKDIK