Gene Mlg_2041 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_2041 
Symbol 
ID4268157 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp2311166 
End bp2312380 
Gene Length1215 bp 
Protein Length404 aa 
Translation table11 
GC content66% 
IMG OID638126797 
Productphosphoserine phosphatase 
Protein accessionYP_742873 
Protein GI114321190 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0560] Phosphoserine phosphatase 
TIGRFAM ID[TIGR00338] phosphoserine phosphatase SerB
[TIGR01488] Haloacid Dehalogenase superfamily, subfamily IB, phosphoserine phosphatase-like
[TIGR01490] HAD-superfamily subfamily IB hydrolase, TIGR01490
[TIGR01494] ATPase, P-type (transporting), HAD superfamily, subfamily IC 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.143406 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones43 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGAGA TCATCCTCAT CAACGTCTCC GGCCAGGACC AACCCGGGCT TACCTCCTCG 
CTGATGGGCA TCCTGGCCGA ATACAACGTG GGCATCCTGG ATATCGGCCA GGCGATGATC
CATGACACCC TCTCCCTCGG CATCCTCATC GAGGTGCCGG AGGAGGCCAA TGTCTCGCCG
GTGCTCAAGG ATGTGCTCTT CCACCTGCAT GAGAAGGGGA TGCAGGTGCG CTTCACCCCG
GTCAGCGCCG AGGACTACAG CGCCTGGGTG GCCGGGGCCG GCCGCGCCCG TTACATCATC
ACCCTGCTGG GCCGGCGGAT CACCGCCGAG CAGATCGCCC GCATCGCCAG TGTGATCAGC
GCCCAGGGCC AGAACATCGA GGATGTCATC CGGCTCTCCG GCCGCCGCCC CCTGCACAAG
GCGGATGAAC GCTCCCGCGC CTGTATCGAG CTGACGGTGC GCGGCCAGCC GGTGGACCTG
GATACCATGA AGCGGGACTT TCTGGAGATC TCCGGCCAAC TGGGCATCGA TATCTCCTTC
CAGGAGGACA ACGTCTATCG TCGCAACCGG CGCCTGGTGG CCTTCGACAT GGATTCCACC
CTGATCCAGC AGGAGGTCAT TGACGAGATG GCCAAGGCGG CGGGCGTGGG CGATCAGGTT
TCGGCGGTGA CCGCGGCCGC CATGCGCGGC GAGATCGATT TCAAGGAGAG CCTGCGCCAG
CGGGTGGCCT GTCTGGAGGG GCTGCCGGAG TCCACCCTGC GGAGCGTGGC CGACCGCCTG
ACCCTCACCG AGGGGGCGGA ACGCCTGGTG CGCACCCTGA AAAGCTTTGG CTACCGCACG
GCGATCATCT CCGGCGGCTT CACCTACTTT GGGCGCATGC TGCAGGAGCG CCTGGCGATC
GACTACGTGT TCGCCAACGA GTTGGAGATC GAGAATGGCC TGCTGACCGG CCGGGTCACC
GGCCCGATCG TGGACGGCCC GCGCAAGGCC GAGCTGTTGC GCGAGATCGC TCAGCGCGAG
CAGATCCGGC TGGAGCAGGT GATCGCCATC GGCGACGGCG CCAACGACCT GCCCATGCTG
CGTCTGGCCG GCCTGGGCAT CGCCTTCCAC GCCAAACCGG TGGTACGGGA GAGCGCCCGC
CAGTCCATCT CCACGCTGGG CCTGGACGCC ACCCTCTACC TGATGGGTAT CAAGGATACC
GAGACACCGG CCTGA
 
Protein sequence
MSEIILINVS GQDQPGLTSS LMGILAEYNV GILDIGQAMI HDTLSLGILI EVPEEANVSP 
VLKDVLFHLH EKGMQVRFTP VSAEDYSAWV AGAGRARYII TLLGRRITAE QIARIASVIS
AQGQNIEDVI RLSGRRPLHK ADERSRACIE LTVRGQPVDL DTMKRDFLEI SGQLGIDISF
QEDNVYRRNR RLVAFDMDST LIQQEVIDEM AKAAGVGDQV SAVTAAAMRG EIDFKESLRQ
RVACLEGLPE STLRSVADRL TLTEGAERLV RTLKSFGYRT AIISGGFTYF GRMLQERLAI
DYVFANELEI ENGLLTGRVT GPIVDGPRKA ELLREIAQRE QIRLEQVIAI GDGANDLPML
RLAGLGIAFH AKPVVRESAR QSISTLGLDA TLYLMGIKDT ETPA