Gene EcolC_3668 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_3668 
SymbolserB 
ID6067415 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp4018033 
End bp4019001 
Gene Length969 bp 
Protein Length322 aa 
Translation table11 
GC content57% 
IMG OID641603083 
Productphosphoserine phosphatase 
Protein accessionYP_001726606 
Protein GI170021652 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0560] Phosphoserine phosphatase 
TIGRFAM ID[TIGR00338] phosphoserine phosphatase SerB
[TIGR01488] Haloacid Dehalogenase superfamily, subfamily IB, phosphoserine phosphatase-like 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.610208 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTAACA TTACCTGGTG CGACCTGCCT GAAGATGTCT CTTTATGGCC GGGTCTGCCT 
CTTTCATTAA GTGGTGATGA AGTGATGCCA CTGGATTACC ACGCAGGTCG TAGCGGCTGG
CTGCTGTATG GTCGTGGGCT GGATAAACAG CGTCTGACCC AATACCAGAG CAAACTGGGT
GCGGCGATGG TGATTGTTGC CGCCTGGTGC GTGGAAGATT ATCAGGTGAT TCGTCTGGCA
GGTTCACTCA CCGCACGGGC TACGCGCCTG GCCCATGAAG CGCAACTGGA TGTCGCCCCG
CTGGGGAAAA TCCCGCACCT GCGCACGCCG GGTTTGCTGG TGATGGACAT GGATTCCACC
GCCATCCAGA TTGAATGTAT TGATGAAATT GCCAAACTGG CCGGAACGGG CGAGATGGTG
GCGGAAGTAA CCGAACGGGC GATGCGCGGC GAACTCGATT TTACCGCCAG CCTGCGCAGC
CGCGTGGCGA CGCTGAAAGG CGCTGACGCC AATATTCTGC AACAGGTGCG TGAAAATCTG
CCGCTGATGC CAGGCTTAAC GCAACTGGTG CTCAAGCTGG AAACGCTGGG CTGGAAAGTG
GCGATTGCCT CCGGCGGCTT TACTTTCTTT GCTGAATATC TGCGCGACAA GTTGCGCCTG
ACCGCCGTGG TAGCCAATGA ACTGGAGATC ATGGACGGTA AATTTACCGG CAATGTGATC
GGCGACATCG TAGACGCGCA GTACAAAGCG AAAACTCTGA CTCGCCTCGC GCAGGAGTAT
GAAATCCCGC TGGCGCAGAC CGTGGCGATT GGCGATGGAG CCAATGACCT GCCGATGATC
AAAGCGGCAG GGCTGGGGAT TGCCTACCAT GCCAAGCCAA AAGTGAATGA AAAGGCGGAA
GTCACCATCC GTCACGCTGA CCTGATGGGG GTATTCTGCA TCCTCTCAGG CAGCCTGAAT
CAGAAGTAA
 
Protein sequence
MPNITWCDLP EDVSLWPGLP LSLSGDEVMP LDYHAGRSGW LLYGRGLDKQ RLTQYQSKLG 
AAMVIVAAWC VEDYQVIRLA GSLTARATRL AHEAQLDVAP LGKIPHLRTP GLLVMDMDST
AIQIECIDEI AKLAGTGEMV AEVTERAMRG ELDFTASLRS RVATLKGADA NILQQVRENL
PLMPGLTQLV LKLETLGWKV AIASGGFTFF AEYLRDKLRL TAVVANELEI MDGKFTGNVI
GDIVDAQYKA KTLTRLAQEY EIPLAQTVAI GDGANDLPMI KAAGLGIAYH AKPKVNEKAE
VTIRHADLMG VFCILSGSLN QK