Gene Mlg_0503 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_0503 
Symbol 
ID4268439 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp550095 
End bp551183 
Gene Length1089 bp 
Protein Length362 aa 
Translation table11 
GC content62% 
IMG OID638125244 
Productappr-1-p processing domain-containing protein 
Protein accessionYP_741347 
Protein GI114319664 
COG category[R] General function prediction only 
COG ID[COG2110] Predicted phosphatase homologous to the C-terminal domain of histone macroH2A1 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones41 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTGAGT ACACCCGAGG CAACCTTCTG GACGCCGATG TGGAGGCGTT GGTCAATACC 
GTCAACACCG TGGGTGTGAT GGGCAAGGGG GTCGCCCTGA TGTTCAAAGA GGCCTTCCCC
GAGAACTTTC GTGCCTACCA GGCGGCGTGC AAGAACCGGG AGGTGGTGCC GGGTCGTATG
TTCGTGCACG AACGGAGTGC ACTGCTCGGG CCGCGTTGGA TCATCAATTT CCCCACTAAG
CAGCATTGGC GCGGTAAGAC GCGGATGGAG TGGATCGACT CCGGCCTGCG GGACCTCGAA
CGGGTGATCC GCGTGAATGG GATTCGGTCC ATCGCGCTTC CGCCGCTGGG GTGCGGCAAT
GGTGGGCTGC CATGGGCACA GGTGCGGCCC CGGATCGAGT CGGCGCTGCG CGACCTGCAG
GACGTGCGGG TGGTGGTCTT CGAGCCGACC CGTCAATACC AAAATGTGGC CAAGCGCTCC
GGGGTGGAGA AGCTGACACC GGCCCGTGCC TTGATTGCCG AGTTGGTGCG TCGGTATTGG
GTGCTGGGGA TGGAGTGCTC CTTGCTGGAG GTGCAGAAGC TGGCCTGGCT AATCGAGCGG
CGCATCATCG ACCACGGCCT GGAGAACCCA CTGGATCTGC AATTCAAGGC TCTTCGCTAT
GGGCCGTATT CGGATCGGTT GCGCCACCTG CTCAATGGGC TTGATGGCAG CTATCTGCGC
AGTGACAAAC GCATCAACGA CGCGGGCCCT GAAGAAGTGG TCTGGTTCGA TGAGGCGCGG
CGCGACAAGC TGGGCATTTA CCTGCGCAGC GCGGAGGTCC GCCCCTATCT TGGAGTGCTG
CAGGAGGTCG ACGACCTTAT CGATGGCTTT CAGTCCCCCC TCGGTCTCGA GGTGCTGGCC
ACCCTGGATT GGCTTATCTG GCAGGAGGGT GTCGCGCCCA CAATAGCGGA CGTTAAAGAA
GGGCTGCGGC GCTGGCCAGA CGACATTGCC GGTCAGCGCA AGCTACGGCT GTTTTCGGAT
CAGCTCATTG AGTTGGCGTT GGCGCGTTTG ACCAGCCGGA CTCCGGATCT TCAGGTCATC
GCCACGTGA
 
Protein sequence
MIEYTRGNLL DADVEALVNT VNTVGVMGKG VALMFKEAFP ENFRAYQAAC KNREVVPGRM 
FVHERSALLG PRWIINFPTK QHWRGKTRME WIDSGLRDLE RVIRVNGIRS IALPPLGCGN
GGLPWAQVRP RIESALRDLQ DVRVVVFEPT RQYQNVAKRS GVEKLTPARA LIAELVRRYW
VLGMECSLLE VQKLAWLIER RIIDHGLENP LDLQFKALRY GPYSDRLRHL LNGLDGSYLR
SDKRINDAGP EEVVWFDEAR RDKLGIYLRS AEVRPYLGVL QEVDDLIDGF QSPLGLEVLA
TLDWLIWQEG VAPTIADVKE GLRRWPDDIA GQRKLRLFSD QLIELALARL TSRTPDLQVI
AT