Gene Mlg_1539 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_1539 
Symbol 
ID4270544 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp1755159 
End bp1758392 
Gene Length3234 bp 
Protein Length1077 aa 
Translation table11 
GC content56% 
IMG OID638126297 
ProductDNA methylase N-4/N-6 domain-containing protein 
Protein accessionYP_742378 
Protein GI114320695 
COG category[L] Replication, recombination and repair 
COG ID[COG2189] Adenine specific DNA methylase Mod 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value0.648492 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCAGA ATCCAGCCTT CGACAAACTG ATCCGCCTGC TCAAGGAGCT CTTTCAGCTC 
GACCAGCCAG ACCTGGACTT CGGGCTCTAC CGCATCATGC ACGCCCGCGC CGACGAGATC
AGCCAGTTTC TCGACCGCGA CCTGCTGCCC CAGGTAAAGG ACGCTTTCAG CCACTACAAG
ACGGCCGACA AGGCCGGGCT GGAGAAAGAG CTGCAACAAG CCATCGAGCA GGCCAACGGC
CTTGGCGTGG ACCCAGAGAC CACCGCCAAG GTGAAGGAGC TACGCCAGAA GATCGCCGAG
CAGGGCGTGG ATGTCACAGG TCTGGAGCAG GAAGTCTACG ACCACCTCTA CAAATTCTTC
CGCCGCCATT ACCACGAAGG CGACTTCCTC GCCAAGCGCG TCTACAAGCC CGGCGTCTAC
GCCATTCCCT ACGAAGGCGA GGAAGTCAAA CTGCACTGGG CCAACAAGGA CCAGTACTAC
ATCAAGACCA GCGAATACCT GCGCGACTAC GCCTTTATCC TGAAGCCCGG CGCCGACGAT
CCCATGCGCG TGCACTTCCG TCTGGTCGAC GCCGCCGAGG GCGAGCACGG CAACGTCAAG
GAAGCCGAGG GCAAGAACCG GGTGTTTATC CTCGCCGGCG AAGACTTTAT CGCCGAGGAG
AATGGCGAAG CTGGCCGTGA GCTGATCATC CGCTTCGAGT ACCGGCCGGC AACGATGGAG
GACTGGAGCG AGGATGCCAA GGCCAACGCC ACCGCCGCAG CGAAGGAGAA ACCGCCCAAC
CAGAAAGACC TGCGCGAGGA TGCTGTACGT CGGGTGCTGG CGATGCAGGA CGACAGCCTC
AAGCCCTGGC TGGCGGAGCT GGCCAAGAAC CATATCAAAG CCGATGGCGA GCAGGCCGAC
TACAGCCGCC TGGCCGCCCA CCTGAACCGC TACACCGCCC GCAACACGTT TGACTACTTC
ATCCACAAGG ACCTGGGCGG CTTCCTGCGC CGGGAGCTGG ACTTCTACAT CAAGAACGAA
GTCATGCACC TGGACGACAT CGAAAGCGAA ACGGCGCCGC GCGTGGAGCA GTACCTGTCC
AAGATTAAAG TGATCCGTCA GATTGCCGGC AAGATCATTG ATCTTCTGGC GCAACTGGAG
AATTTTCAGA AGAAGCTCTG GCTGAAGAAG AAGTTTGTCA CCGAGACCTC GTATTGCATC
CGCATCGGCT GCATTCCAGA GGCGTTCCAT CCGGAGATTG CCGCCAACGA GGCCCAGCGT
CAGGAATGGG TTGAGCTACA TGCCATTGAT GAGCTTGCAG CGGATTTGAC CACGGTGGCC
TACAGTGAGC CGCTGACTGC GGAGTTTTTG AGGGCGCATC CGACATTGAT GGTGGATACG
CGGCACTTTG ATGACGCTTT CAGTCAGCGG TTGCTGGAGG CGGTGGGTGA TATAGACGAT
CAGACTGACG GCGTTCTTTT CAATAGCGAA AACTTTCAGG CGTTAGCTGT TGCCAACATG
AAATACTGGG GTTCAGTGCA CGTGTCGTAT ATTGACCCTC CATACAATAC TGAGCTGGAT
AGGCAATCGG GAAAATTCAT CTACAAGGAT AACTATGCCC GCTCCACCTG GGCCTCTCTA
ATGGCGGATC GACTTCAATC TGGCGCGTCC TTTCTTAGAG AAGATGGGAC TTTCATCTGC
AGTATTGACG ACAACGAATA TCCGACACTT CGTGAAATCC TAAACTCAGT TTATGGAGGC
GACAACTTCA TCGGGACGAT AGCTTGGAAG TCACGAGATT CTGTATCAAG CGATCACAAA
ATTTCACTGA ACCACAATTA TCATGTTGCA TACGCTAAAG ACTTGGTGGC TAATAAATTC
GGAGGTTTTC CTCTGAATCC AGGTGACTAT AGCAATCCGG ACAATGACCC TCGTGGTCCG
TGGAAGCCGG TGCCGATCGA CGCCAATAAG CCTGGGGGCG AAACAAAGTA CCCTATTGAA
AATCCCAACA CCGGGGACGA ACATTATCCG CCGAACGGTC GGAGTTGGGC GTTTAACCGG
TCGCGCTATG ACGAGTTGCT TTCGGACAAC CGTATTACAT TTGGAATTAG GGGGACGGGA
GCGCCGAAGC GCAAGCTTTT TTTGAAGGAA AGGACCGAGA AAGGCGATGT AAATACGCCA
GTTTCTATCT GGCCAGACGC TGAGACAACT CAAGGCGGTA CTCGCCAAGT AATGAGTTTA
TTCGGCAACA AAGTGTTTTC GTACCCAAAG CCTGTGGGGC TTATGCGCGA CCTTATTAGA
ATCTCTCACT TGAATTCAAA TTGCGTCGTG GCGGATTATT TTGCCGGGTC AGGCACTACG
GGGCATGCAA TTGTCAACCT TAACCGAGCC GATGGTAGTA GGCGTAAATT TTTATTGATG
GAGATGGGTG ATTATTTCGA TGCGGTGCTT CTTCCACGCT TGAAGAAAGT CACGTTTGCG
CCCGATTGGG CAGACGGGAA GCCTGAACGC CTCGCAACAG AAGAAGAGGC GGAATGCAGC
CCCCGAATCA TAAAGGTCAT CCGGCTCGAA TCCTACGAGG ACGCCCTTAA CAACCTGGAG
CCGCGCCGCA GTGAAACACA AAGCGACCTG TTAGCTAGCC AGCAGGCTCA AGGTGCCGAC
GGCCTGCGCG AGCAGTACCT GCTGCGCTAC TGGCTGGATG TGGAGACTAG GGGCCAACAA
TCACTGCTCA ATATCGACGC CTTTACCGAC CCCACCGCTT ACCGGCTCAA GGTCAAGCGC
CCCGGCAGCG AGGAAACCCG CGAGGTCAAT GTGGACCTGC TGGAGACCTT CAACTGGCTG
ATCGGCCTGA CCGTGGAAAC CATCGCCGCG CCCCAGAGGG TGGCGGCCCA GTTCAAGCGC
GATGACGATC CGGATCTGCC CAAGGAAAAC CCGCGCCGCC TGCTGCTCGA CGGCCGCATC
CGCGAAGCCG AAGAAGGCCC CTGGTGGTTC CGCACCGTCA CCGGCACCAC GCCGGACGGA
CGTAAAACCC TGGTGATCTG GCGCAAGCTC ACGGGCGACC CCGAGCAGGA CAATCTGGTA
CTGGACGAAT GGTTCAAGAA GCAGGGCTAT TCCAGCAAGG ACAGCGAGTT CGACCTGATC
TACGTCAACG GCGACAACAA CCTGGAGAAC CTGCGCCAGC CCGACGACAC CTGGAAAGTC
CGCCTCATCG AGGAAGACTT CCACCGGCTG ATGTTCGAGG AGGCCGAATC ATGA
 
Protein sequence
MSQNPAFDKL IRLLKELFQL DQPDLDFGLY RIMHARADEI SQFLDRDLLP QVKDAFSHYK 
TADKAGLEKE LQQAIEQANG LGVDPETTAK VKELRQKIAE QGVDVTGLEQ EVYDHLYKFF
RRHYHEGDFL AKRVYKPGVY AIPYEGEEVK LHWANKDQYY IKTSEYLRDY AFILKPGADD
PMRVHFRLVD AAEGEHGNVK EAEGKNRVFI LAGEDFIAEE NGEAGRELII RFEYRPATME
DWSEDAKANA TAAAKEKPPN QKDLREDAVR RVLAMQDDSL KPWLAELAKN HIKADGEQAD
YSRLAAHLNR YTARNTFDYF IHKDLGGFLR RELDFYIKNE VMHLDDIESE TAPRVEQYLS
KIKVIRQIAG KIIDLLAQLE NFQKKLWLKK KFVTETSYCI RIGCIPEAFH PEIAANEAQR
QEWVELHAID ELAADLTTVA YSEPLTAEFL RAHPTLMVDT RHFDDAFSQR LLEAVGDIDD
QTDGVLFNSE NFQALAVANM KYWGSVHVSY IDPPYNTELD RQSGKFIYKD NYARSTWASL
MADRLQSGAS FLREDGTFIC SIDDNEYPTL REILNSVYGG DNFIGTIAWK SRDSVSSDHK
ISLNHNYHVA YAKDLVANKF GGFPLNPGDY SNPDNDPRGP WKPVPIDANK PGGETKYPIE
NPNTGDEHYP PNGRSWAFNR SRYDELLSDN RITFGIRGTG APKRKLFLKE RTEKGDVNTP
VSIWPDAETT QGGTRQVMSL FGNKVFSYPK PVGLMRDLIR ISHLNSNCVV ADYFAGSGTT
GHAIVNLNRA DGSRRKFLLM EMGDYFDAVL LPRLKKVTFA PDWADGKPER LATEEEAECS
PRIIKVIRLE SYEDALNNLE PRRSETQSDL LASQQAQGAD GLREQYLLRY WLDVETRGQQ
SLLNIDAFTD PTAYRLKVKR PGSEETREVN VDLLETFNWL IGLTVETIAA PQRVAAQFKR
DDDPDLPKEN PRRLLLDGRI REAEEGPWWF RTVTGTTPDG RKTLVIWRKL TGDPEQDNLV
LDEWFKKQGY SSKDSEFDLI YVNGDNNLEN LRQPDDTWKV RLIEEDFHRL MFEEAES