Gene ECH74115_5902 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_5902 
SymbolserB 
ID6970436 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp5555318 
End bp5556286 
Gene Length969 bp 
Protein Length322 aa 
Translation table11 
GC content57% 
IMG OID643389517 
Productphosphoserine phosphatase 
Protein accessionYP_002273908 
Protein GI209398522 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0560] Phosphoserine phosphatase 
TIGRFAM ID[TIGR00338] phosphoserine phosphatase SerB
[TIGR01488] Haloacid Dehalogenase superfamily, subfamily IB, phosphoserine phosphatase-like 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones67 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTAACA TTACCTGGTG CGACCTGCCT GAAGATGTCT CTTTATGGCC GGGTCTGCCT 
CTTTCATTAA GTGGTGATGA AGTGATGCCA CTGGATTACC ACGCAGGTCG TAGCGGCTGG
CTGCTGTATG GTCGTGGGCT GGATAAACAG CGTCTGACCC AATACCAGAG CAAACTGGGT
GCGGCGATGG TGATTGTTGC CGCCTGGTGC GTGGAAGATT ATCAGGTGAT TCGTCTGGCA
GGTTCACTCA CCGCACGGGC TACACGCCTG GCCCACGAAG CGCAGCTGGA TGTCGCGCCG
CTGGGAAAAA TCCCGCACCT GCGCACGCCG GGTTTGCTGG TGATGGACAT GGATTCCACC
GCCATCCAGA TTGAATGTAT TGATGAAATT GCCAAACTGG CCGGAACGGG CGAGATGGTG
GCGGAAGTAA CCGAACGGGC GATGCGCGGC GAACTCGATT TTACCGCCAG CCTGCGCAGC
CGCGTGGCGA CGCTGAAAGG CGCTGACGCC AATATTCTGC AACAGGTGCG TGAAAATCTG
CCGCTGATGC CAGGCTTAAC GCAACTGGTG CTCAAGCTGG AAACGCTGGG CTGGAAAGTG
GCGATTGCCT CCGGCGGCTT TACTTTCTTT GCTGAATACC TGCGCGACAA GCTGCGCCTG
ACAGCCGTGG TAGCCAATGA ACTGGAGATC ATGGACGGTA AATTTACCGG CAATGTGATC
GGCGACATCG TAGACGCGCA GTACAAAGCG AAAACTCTGA CTCGCCTCGC GCAGGAGTAT
GAAATCCCGC TGGCGCAGAC CGTGGCGATT GGCGATGGAG CCAATGACCT GCCGATGATC
AAAGCGGCAG GGCTGGGGAT TGCCTACCAT GCCAAGCCAA AAGTGAATGA AAAGGCGGAA
GTCACCATCC GTCACGCTGA CCTGATGGGG GTATTCTGCA TCCTCTCAGG CAGCCTGAAT
CAGAAGTAA
 
Protein sequence
MPNITWCDLP EDVSLWPGLP LSLSGDEVMP LDYHAGRSGW LLYGRGLDKQ RLTQYQSKLG 
AAMVIVAAWC VEDYQVIRLA GSLTARATRL AHEAQLDVAP LGKIPHLRTP GLLVMDMDST
AIQIECIDEI AKLAGTGEMV AEVTERAMRG ELDFTASLRS RVATLKGADA NILQQVRENL
PLMPGLTQLV LKLETLGWKV AIASGGFTFF AEYLRDKLRL TAVVANELEI MDGKFTGNVI
GDIVDAQYKA KTLTRLAQEY EIPLAQTVAI GDGANDLPMI KAAGLGIAYH AKPKVNEKAE
VTIRHADLMG VFCILSGSLN QK