Gene Mlg_0272 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_0272 
Symbol 
ID4270490 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp313963 
End bp314850 
Gene Length888 bp 
Protein Length295 aa 
Translation table11 
GC content74% 
IMG OID638124997 
ProductHemK family modification methylase 
Protein accessionYP_741117 
Protein GI114319434 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG2890] Methylase of polypeptide chain release factors 
TIGRFAM ID[TIGR00536] HemK family putative methylases
[TIGR03534] protein-(glutamine-N5) methyltransferase, release factor-specific 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones42 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCCGGG AAGCGCACCC CGCCACGGGC TCCACCCCGC CGCAGCCCAC CCTGGCCGAG 
CTGCGCCGGT CCGCCCGCAC CCGACTGGAG GCGGCGGGCA GCGACTCGCC GGCCGCGGAC
GCCGATGCCC TGCTGGCGCA CGCGCTGGGG CGCGATCGCG CTTTTTTCCT GGCCCATCCC
GAGCACCGTC CGCCGGCATC CAGCCTGGCC CGCTTCCGGC AACTCCTTGC CCGCCGTTTG
GCCGGCGAGC CGGTGGCGCA CCTGACGGGC CGCCGCGGTT TCTGGTCGCT GGAGCTGAAG
GTGACGGCCG AGACGCTGAT CCCCCGCCCG GAGACGGAGC TGCTGGTTGA GGCGGCGCTG
GCCCGGGTTG ACGGCGACCG GCAGCTCCGG GTGGCGGACC TGGGCACGGG CACCGGGGCC
ATTGCGCTGG CGCTGGCGGA TGAGTGTCCG GCGTGGCGGG TGACGGCGGT GGAGGCCAGC
GCCGGGGCCC TGGTGGTGGC CCGGGAGAAC GCCCGCCGGT TAGGGCTGGC GGATCGGGTG
CAGGTGGTGG CGGGGTCCTG GTTCGGCCCA CTGGCCGGTG AGCGTTTTGA TCTGGTGGTG
AGCAATCCGC CCTATGTGGG CGTCCACGAG CCTGAGCTGT ATGAGGGCGA TGTGCGCTTC
GAGCCGCGGT CGGCGCTGGC GGCCGGACGG GACGGGCTGG GTGACCTGCG GCGGATCGTC
GGCGAGGCGC CGGGGCATCT GGTGGCCGGC GGTTGGTTGA TGGTGGAGCA CGGTTTTCAG
CAGGGGGAGG CGGTGCGCCG ACTGTTCCTG GAGGCCGGGT TCGGCGGGGT GGAGACCTTG
CGGGACCTGG CCGGGCATGA GCGGGTGACG GTCGGGCGGC TGGACTGA
 
Protein sequence
MTREAHPATG STPPQPTLAE LRRSARTRLE AAGSDSPAAD ADALLAHALG RDRAFFLAHP 
EHRPPASSLA RFRQLLARRL AGEPVAHLTG RRGFWSLELK VTAETLIPRP ETELLVEAAL
ARVDGDRQLR VADLGTGTGA IALALADECP AWRVTAVEAS AGALVVAREN ARRLGLADRV
QVVAGSWFGP LAGERFDLVV SNPPYVGVHE PELYEGDVRF EPRSALAAGR DGLGDLRRIV
GEAPGHLVAG GWLMVEHGFQ QGEAVRRLFL EAGFGGVETL RDLAGHERVT VGRLD