Gene EcSMS35_4937 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_4937 
SymbolserB 
ID6144234 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp5051782 
End bp5052750 
Gene Length969 bp 
Protein Length322 aa 
Translation table11 
GC content56% 
IMG OID641619740 
Productphosphoserine phosphatase 
Protein accessionYP_001746844 
Protein GI170681720 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0560] Phosphoserine phosphatase 
TIGRFAM ID[TIGR00338] phosphoserine phosphatase SerB
[TIGR01488] Haloacid Dehalogenase superfamily, subfamily IB, phosphoserine phosphatase-like 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones47 
Fosmid unclonability p-value0.859759 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTAACA TTACCTGGTG CGACCTGCCT GAAGATGTCT CTTTATGGCC TGGTCTGCCT 
CTTTCATTAA GTGGTGATGA AGTGATGCCA CTGGATTACC ACGCAGGTCG TAGCGGCTGG
CTGCTGTATG GTCGTGGGCT GGATAAACAA CGTCTGACCC AATACCAGAG CAAACTGGGC
GCGGCTATGG TGATTGTTGC CGCCTGGTGC GTGGAAGATT ATCAGGTGAT TCGTCTGGCA
GGTTCACTCA CCGCGCGGGC TACGCGCCTG GCCCACGAAG CGCAACTGGA TGTCGCGCCG
TTGGGGAAAA TCCCGCACCT GCGCTCTCCG GGTTTGCTGG TGATGGACAT GGATTCCACC
GCCATTCAGA TTGAATGTAT TGACGAAATT GCCAAACTGG CCGGAACCGG CGAGATGGTG
GCGGAGGTAA CCGAACGGGC GATGCGCGGC GAACTCGATT TTACCGCCAG CCTGCGCAGC
CGTGTGGCGA CGCTGAAAGG CGCTGACGCC AATATCCTGC AACAGGTGCG TGAAAATCTG
CCGCTGATGC CAGGCTTAAC GCAACTGGTG CTCAAGCTGG AAACGCTGGG CTGGAAAGTG
GCGATTGCTT CCGGCGGCTT TACTTTCTTT GCTGAATACC TGCGCGACAA GCTGCGCCTG
ACAGCCGTGG TAGCCAATGA ACTGGAGATC ATGGACGGTA AATTTACCGG CAATGTGATC
GGCGACATCG TAGACGCGCA GTACAAAGCG AAAACTCTGA CTCGCCTCGC GCAGGAGTAT
GAAATCCCGC TGGCGCAGAC CGTGGCGATT GGCGATGGAG CCAATGACCT GCCGATGATC
AAAGCGGCAG GGCTGGGGAT TGCCTACCAT GCCAAGCCAA AAGTGAATGA AAAAACGGAA
GTCACCATCC GTCACGCTGA CCTGATGGGG GTATTCTGCA TCCTCTCTGG CAGCCTGAAT
CAGAAGTAA
 
Protein sequence
MPNITWCDLP EDVSLWPGLP LSLSGDEVMP LDYHAGRSGW LLYGRGLDKQ RLTQYQSKLG 
AAMVIVAAWC VEDYQVIRLA GSLTARATRL AHEAQLDVAP LGKIPHLRSP GLLVMDMDST
AIQIECIDEI AKLAGTGEMV AEVTERAMRG ELDFTASLRS RVATLKGADA NILQQVRENL
PLMPGLTQLV LKLETLGWKV AIASGGFTFF AEYLRDKLRL TAVVANELEI MDGKFTGNVI
GDIVDAQYKA KTLTRLAQEY EIPLAQTVAI GDGANDLPMI KAAGLGIAYH AKPKVNEKTE
VTIRHADLMG VFCILSGSLN QK