Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_1539 |
Symbol | |
ID | 4270544 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | + |
Start bp | 1755159 |
End bp | 1758392 |
Gene Length | 3234 bp |
Protein Length | 1077 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 638126297 |
Product | DNA methylase N-4/N-6 domain-containing protein |
Protein accession | YP_742378 |
Protein GI | 114320695 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG2189] Adenine specific DNA methylase Mod |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 35 |
Fosmid unclonability p-value | 0.648492 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCCAGA ATCCAGCCTT CGACAAACTG ATCCGCCTGC TCAAGGAGCT CTTTCAGCTC GACCAGCCAG ACCTGGACTT CGGGCTCTAC CGCATCATGC ACGCCCGCGC CGACGAGATC AGCCAGTTTC TCGACCGCGA CCTGCTGCCC CAGGTAAAGG ACGCTTTCAG CCACTACAAG ACGGCCGACA AGGCCGGGCT GGAGAAAGAG CTGCAACAAG CCATCGAGCA GGCCAACGGC CTTGGCGTGG ACCCAGAGAC CACCGCCAAG GTGAAGGAGC TACGCCAGAA GATCGCCGAG CAGGGCGTGG ATGTCACAGG TCTGGAGCAG GAAGTCTACG ACCACCTCTA CAAATTCTTC CGCCGCCATT ACCACGAAGG CGACTTCCTC GCCAAGCGCG TCTACAAGCC CGGCGTCTAC GCCATTCCCT ACGAAGGCGA GGAAGTCAAA CTGCACTGGG CCAACAAGGA CCAGTACTAC ATCAAGACCA GCGAATACCT GCGCGACTAC GCCTTTATCC TGAAGCCCGG CGCCGACGAT CCCATGCGCG TGCACTTCCG TCTGGTCGAC GCCGCCGAGG GCGAGCACGG CAACGTCAAG GAAGCCGAGG GCAAGAACCG GGTGTTTATC CTCGCCGGCG AAGACTTTAT CGCCGAGGAG AATGGCGAAG CTGGCCGTGA GCTGATCATC CGCTTCGAGT ACCGGCCGGC AACGATGGAG GACTGGAGCG AGGATGCCAA GGCCAACGCC ACCGCCGCAG CGAAGGAGAA ACCGCCCAAC CAGAAAGACC TGCGCGAGGA TGCTGTACGT CGGGTGCTGG CGATGCAGGA CGACAGCCTC AAGCCCTGGC TGGCGGAGCT GGCCAAGAAC CATATCAAAG CCGATGGCGA GCAGGCCGAC TACAGCCGCC TGGCCGCCCA CCTGAACCGC TACACCGCCC GCAACACGTT TGACTACTTC ATCCACAAGG ACCTGGGCGG CTTCCTGCGC CGGGAGCTGG ACTTCTACAT CAAGAACGAA GTCATGCACC TGGACGACAT CGAAAGCGAA ACGGCGCCGC GCGTGGAGCA GTACCTGTCC AAGATTAAAG TGATCCGTCA GATTGCCGGC AAGATCATTG ATCTTCTGGC GCAACTGGAG AATTTTCAGA AGAAGCTCTG GCTGAAGAAG AAGTTTGTCA CCGAGACCTC GTATTGCATC CGCATCGGCT GCATTCCAGA GGCGTTCCAT CCGGAGATTG CCGCCAACGA GGCCCAGCGT CAGGAATGGG TTGAGCTACA TGCCATTGAT GAGCTTGCAG CGGATTTGAC CACGGTGGCC TACAGTGAGC CGCTGACTGC GGAGTTTTTG AGGGCGCATC CGACATTGAT GGTGGATACG CGGCACTTTG ATGACGCTTT CAGTCAGCGG TTGCTGGAGG CGGTGGGTGA TATAGACGAT CAGACTGACG GCGTTCTTTT CAATAGCGAA AACTTTCAGG CGTTAGCTGT TGCCAACATG AAATACTGGG GTTCAGTGCA CGTGTCGTAT ATTGACCCTC CATACAATAC TGAGCTGGAT AGGCAATCGG GAAAATTCAT CTACAAGGAT AACTATGCCC GCTCCACCTG GGCCTCTCTA ATGGCGGATC GACTTCAATC TGGCGCGTCC TTTCTTAGAG AAGATGGGAC TTTCATCTGC AGTATTGACG ACAACGAATA TCCGACACTT CGTGAAATCC TAAACTCAGT TTATGGAGGC GACAACTTCA TCGGGACGAT AGCTTGGAAG TCACGAGATT CTGTATCAAG CGATCACAAA ATTTCACTGA ACCACAATTA TCATGTTGCA TACGCTAAAG ACTTGGTGGC TAATAAATTC GGAGGTTTTC CTCTGAATCC AGGTGACTAT AGCAATCCGG ACAATGACCC TCGTGGTCCG TGGAAGCCGG TGCCGATCGA CGCCAATAAG CCTGGGGGCG AAACAAAGTA CCCTATTGAA AATCCCAACA CCGGGGACGA ACATTATCCG CCGAACGGTC GGAGTTGGGC GTTTAACCGG TCGCGCTATG ACGAGTTGCT TTCGGACAAC CGTATTACAT TTGGAATTAG GGGGACGGGA GCGCCGAAGC GCAAGCTTTT TTTGAAGGAA AGGACCGAGA AAGGCGATGT AAATACGCCA GTTTCTATCT GGCCAGACGC TGAGACAACT CAAGGCGGTA CTCGCCAAGT AATGAGTTTA TTCGGCAACA AAGTGTTTTC GTACCCAAAG CCTGTGGGGC TTATGCGCGA CCTTATTAGA ATCTCTCACT TGAATTCAAA TTGCGTCGTG GCGGATTATT TTGCCGGGTC AGGCACTACG GGGCATGCAA TTGTCAACCT TAACCGAGCC GATGGTAGTA GGCGTAAATT TTTATTGATG GAGATGGGTG ATTATTTCGA TGCGGTGCTT CTTCCACGCT TGAAGAAAGT CACGTTTGCG CCCGATTGGG CAGACGGGAA GCCTGAACGC CTCGCAACAG AAGAAGAGGC GGAATGCAGC CCCCGAATCA TAAAGGTCAT CCGGCTCGAA TCCTACGAGG ACGCCCTTAA CAACCTGGAG CCGCGCCGCA GTGAAACACA AAGCGACCTG TTAGCTAGCC AGCAGGCTCA AGGTGCCGAC GGCCTGCGCG AGCAGTACCT GCTGCGCTAC TGGCTGGATG TGGAGACTAG GGGCCAACAA TCACTGCTCA ATATCGACGC CTTTACCGAC CCCACCGCTT ACCGGCTCAA GGTCAAGCGC CCCGGCAGCG AGGAAACCCG CGAGGTCAAT GTGGACCTGC TGGAGACCTT CAACTGGCTG ATCGGCCTGA CCGTGGAAAC CATCGCCGCG CCCCAGAGGG TGGCGGCCCA GTTCAAGCGC GATGACGATC CGGATCTGCC CAAGGAAAAC CCGCGCCGCC TGCTGCTCGA CGGCCGCATC CGCGAAGCCG AAGAAGGCCC CTGGTGGTTC CGCACCGTCA CCGGCACCAC GCCGGACGGA CGTAAAACCC TGGTGATCTG GCGCAAGCTC ACGGGCGACC CCGAGCAGGA CAATCTGGTA CTGGACGAAT GGTTCAAGAA GCAGGGCTAT TCCAGCAAGG ACAGCGAGTT CGACCTGATC TACGTCAACG GCGACAACAA CCTGGAGAAC CTGCGCCAGC CCGACGACAC CTGGAAAGTC CGCCTCATCG AGGAAGACTT CCACCGGCTG ATGTTCGAGG AGGCCGAATC ATGA
|
Protein sequence | MSQNPAFDKL IRLLKELFQL DQPDLDFGLY RIMHARADEI SQFLDRDLLP QVKDAFSHYK TADKAGLEKE LQQAIEQANG LGVDPETTAK VKELRQKIAE QGVDVTGLEQ EVYDHLYKFF RRHYHEGDFL AKRVYKPGVY AIPYEGEEVK LHWANKDQYY IKTSEYLRDY AFILKPGADD PMRVHFRLVD AAEGEHGNVK EAEGKNRVFI LAGEDFIAEE NGEAGRELII RFEYRPATME DWSEDAKANA TAAAKEKPPN QKDLREDAVR RVLAMQDDSL KPWLAELAKN HIKADGEQAD YSRLAAHLNR YTARNTFDYF IHKDLGGFLR RELDFYIKNE VMHLDDIESE TAPRVEQYLS KIKVIRQIAG KIIDLLAQLE NFQKKLWLKK KFVTETSYCI RIGCIPEAFH PEIAANEAQR QEWVELHAID ELAADLTTVA YSEPLTAEFL RAHPTLMVDT RHFDDAFSQR LLEAVGDIDD QTDGVLFNSE NFQALAVANM KYWGSVHVSY IDPPYNTELD RQSGKFIYKD NYARSTWASL MADRLQSGAS FLREDGTFIC SIDDNEYPTL REILNSVYGG DNFIGTIAWK SRDSVSSDHK ISLNHNYHVA YAKDLVANKF GGFPLNPGDY SNPDNDPRGP WKPVPIDANK PGGETKYPIE NPNTGDEHYP PNGRSWAFNR SRYDELLSDN RITFGIRGTG APKRKLFLKE RTEKGDVNTP VSIWPDAETT QGGTRQVMSL FGNKVFSYPK PVGLMRDLIR ISHLNSNCVV ADYFAGSGTT GHAIVNLNRA DGSRRKFLLM EMGDYFDAVL LPRLKKVTFA PDWADGKPER LATEEEAECS PRIIKVIRLE SYEDALNNLE PRRSETQSDL LASQQAQGAD GLREQYLLRY WLDVETRGQQ SLLNIDAFTD PTAYRLKVKR PGSEETREVN VDLLETFNWL IGLTVETIAA PQRVAAQFKR DDDPDLPKEN PRRLLLDGRI REAEEGPWWF RTVTGTTPDG RKTLVIWRKL TGDPEQDNLV LDEWFKKQGY SSKDSEFDLI YVNGDNNLEN LRQPDDTWKV RLIEEDFHRL MFEEAES
|
| |