Gene Mlg_1952 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_1952 
SymbolrlmL 
ID4268121 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp2220354 
End bp2222492 
Gene Length2139 bp 
Protein Length712 aa 
Translation table11 
GC content70% 
IMG OID638126707 
Product23S rRNA m(2)G2445 methyltransferase 
Protein accessionYP_742784 
Protein GI114321101 
COG category[L] Replication, recombination and repair
[R] General function prediction only 
COG ID[COG0116] Predicted N6-adenine-specific DNA methylase
[COG1092] Predicted SAM-dependent methyltransferases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.64464 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value0.031867 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAACGAAC ACCATTCCCT GTTTGCGGCC TGCCCCCGGG GCCTGGAGCC GGCGCTGGTC 
GAGGAGTTGG CCCGGCTCGG GGTCGCCCAG CCCCGGGCGC AGCGCGCCGG GGTCGCCTGG
GAGGGGGATC TGGAGAGTGC GCTCCGGGCC TGCCTATGGT CCCGACTCGC CAGCCGGATC
CTGCTCGGGT TGAGCACCGT CCCGGTGGAG GATGCCGACG GCCTTTATCA AGCAGCTCGC
GACCTGCCCT GGGAGGACCA TCTGTTGCCG GGTGACACGT TGGCGGTGGA TTTTCATGGC
AGCAATGCCG CCATCCGCCA TACCCGGTTC GGGGCGCAGC GGGTGAAGGA TGGGGTGGTG
GATCGCATGC GGGCCGTTGG CCACCCCCGG CCCTCGGTGG ACACGGCCGC CCCGGATCTG
CGCATCAATG CGGTACTGGC GGGCGGGCGG CTGCGTCTGG GGATCGACCT TTCCGGCGAG
AGCCTACACC GGCGTGGTTA CCGCCAGGGC GGGGGCCGTG CCCCCCTGAA AGAGAACCTG
GCCGCCGGAC TCCTCTGGCT GGCGGGGTGG CCGCGCATTG CCGAGGACGG CGGTGGTCTG
CTCGACCCCA TGTGTGGCTC CGGGACGCTG GTGCTGGAGG GGGCCCTGAT GGCGCTGGAC
CGTGCTCCGG GGCTGGGGCG CGAGCGCTGG GGCTTCAGCC GCTGGGCAGG GCACGTCCCG
GTCTACTGGA AGCGTCTGCG CGACGAGGCC GAGGAGCGCG CCGCCGCCGG GGGCCGGCGA
CGGCTGGCGC TGGTGGGCTA CGATCAGGAC CCGCAGGCGA TCCGCGCGGC GTTGGACAAT
CGCGAGCGGG CGGGGCTGCG AGACCGCGTC CACTTCGAGC GCCGGTCGCT GGAGGCGGCG
GAACCGGTCG GGGAGCGGCC GGGTCTGGTG GTGGTCAATC CTCCATATGG GGAGCGGTTG
GGCCAGCGCC AGGCACTGGT CACCCTCTAC GCCAGCCTGG GGGCCCGCCT GCGAGGGGCC
TTCGGTGGCT GGCAGGGTGC CGTCTTTAGC GGTGCGCCGG AGCTGCTCGA TTACCTGGGG
CTTTCCATCG CCCGCCGGCA CGCCTTGTAT AACGGTGCCC TGGAGACCCA ACTGGCCGTC
TTCGCCCTGC GTGAGCAGGG CGCGGACCGG CGGAGTCAGG GGGCCACGGC CCTGGACAAC
CGGCTGCAGA AGAATCACCG CCACCTCCGG CGCTGGTTGC GACGCGAGGG CGTTAAGGCC
TACCGGTTGT ACGACGGTGA TCTGCCGGAG TATGCGCTGG CGGTGGATGT CTACGAAACG
GAATCGGGGC GCCATGCCCA TGTCCAGGAG TATCAGGCCC CGCGCAGCGT CGACCCACGC
AGCGCCCGGC GGCGGCTGCG TGAGGCGCTG GAGGTGATCG CGGCGCACCT GGAGGTGGGG
CCGGAGCGGG TTCACCTTAA GGTGCGCCGC CGGCAGAAGG GGGCGGACCA GTACCGCCCG
GTGGACCAGA CCGGCGAGCG GTGGGTGGTG CAGGAGGGCC CGGCCCGGTT CTACATCAAC
CTGAGCGATT ATCTCGATAC CGGTCTGTTC CTGGATCATC GCATCACCCG GCTGCGACTC
GGGGAGCAGG CCAGGGGGCG GCGGTTCCTC AATCTGTTCG CCTATACCGG AACGGCGACG
GTGCATGCGG CGTTAGGCGG TGCCAGAGAG ACCATCACCG TGGACCTGTC GGCCACCTAC
CTGGGTTGGG CGCGGGATAA TCTCCTGCTT AACGGCATCG AGCCGGGAGC GCGCCATCGG
CTGGAGCGGG CGGACTGCCT GGCCTGGCTG GCCGGGCAGG CGGAAAGCCG GCCCGGGCGT
TACGACCTGA TCTTCATGGA CCCGCCGACC TTTTCCAACT CCAAGCGCAT GTGCGAGAGC
TTCGACGTGC AGCGCGATCA CCCCCGGCTG ATCCGCCAGG CCATGCAGTT GCTGGCCCCG
GATGGGTTGT TGGTCTTTTC CTGTAACCGG CGGGGGTTTT CCCTGGATGA GGCGGTGGCG
TCCGACTACG CCTGCCGGGA GATCACCCGG GAGACGATCC CCCCAGATTA CGCCCGTAAT
CCCCACGTGC ATTATTGCTG GGAACTCCGG CATCGGTGA
 
Protein sequence
MNEHHSLFAA CPRGLEPALV EELARLGVAQ PRAQRAGVAW EGDLESALRA CLWSRLASRI 
LLGLSTVPVE DADGLYQAAR DLPWEDHLLP GDTLAVDFHG SNAAIRHTRF GAQRVKDGVV
DRMRAVGHPR PSVDTAAPDL RINAVLAGGR LRLGIDLSGE SLHRRGYRQG GGRAPLKENL
AAGLLWLAGW PRIAEDGGGL LDPMCGSGTL VLEGALMALD RAPGLGRERW GFSRWAGHVP
VYWKRLRDEA EERAAAGGRR RLALVGYDQD PQAIRAALDN RERAGLRDRV HFERRSLEAA
EPVGERPGLV VVNPPYGERL GQRQALVTLY ASLGARLRGA FGGWQGAVFS GAPELLDYLG
LSIARRHALY NGALETQLAV FALREQGADR RSQGATALDN RLQKNHRHLR RWLRREGVKA
YRLYDGDLPE YALAVDVYET ESGRHAHVQE YQAPRSVDPR SARRRLREAL EVIAAHLEVG
PERVHLKVRR RQKGADQYRP VDQTGERWVV QEGPARFYIN LSDYLDTGLF LDHRITRLRL
GEQARGRRFL NLFAYTGTAT VHAALGGARE TITVDLSATY LGWARDNLLL NGIEPGARHR
LERADCLAWL AGQAESRPGR YDLIFMDPPT FSNSKRMCES FDVQRDHPRL IRQAMQLLAP
DGLLVFSCNR RGFSLDEAVA SDYACREITR ETIPPDYARN PHVHYCWELR HR