Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_2281 |
Symbol | |
ID | 3831392 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | - |
Start bp | 2391373 |
End bp | 2393628 |
Gene Length | 2256 bp |
Protein Length | 751 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 637830201 |
Product | adenine-specific DNA methylase |
Protein accession | YP_431111 |
Protein GI | 83591102 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1743] Adenine-specific DNA methylase containing a Zn-ribbon |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.0186187 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.00483181 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCGTCGAA TATATTCCGG TAATGAAAAA TATTACCGGA GCGCTTATAT GACAATCGAG CGTAACTTTG ACATAGCCTT TGTGGCCGAC CTGGCCCTGC ATGAAAAGCA AATCCAGCAG AATTATCGAC CCATTATTGC CGTACACAAA TGGTTTGCCC GCCGGCCAGG CACATTGTTT CGCAGTCTAC TACTGGCAGA GTTTGCACAG GGCAATTTAG CCGACAATTA CTACCGTTCC CATAATTTTC AAGGTCTTAA GGTGGCCGAT CCTTTTATGG GCGGGGGAAC ACCCCTTATT GAGGCCAACC GTCTCGGCTG CCATATTCTG GGTTATGATA TTAATCCTAT GGCTTACTGG ATTGTCCGCG AGGAGATTGA GCATCTGAAC TTGGAAGCTT ACCAGCAGGC AGCCCGGGAA GTTGGATTTT TTTTAGAAGA AAAGGTAGGT CCTTTTTACC GGACCAGGTG TCCCATATGC GGGCGCCAGG ATGCCCTGGT AAAATACTTC CTTTGGGTAA AGGTCCATCG CTGTCATAAT TGCGGCCGGG AATTTGATCT CTTTCCAGGG TATGTTCTGG CGCAAAAGGG GCGTCACCCC AAGGATGTCA TAATCTGTTC CACCTGCGGT AGCCTTAACG AAGTCGGGGA TAAAAGAAAT CCCGGCCACT GCCATAATTG CGGCGAAGAA TTAAAAACAA AAGGCCCGGC TGGTCGTAAC CAGTGCCCTT GTCCTCATTG TGGCGTTAGA AATTCTTATC CTGACCCGGA AAGTGGCCCT CCCGGGCACC GGATGGTGGC CATTGAATAT CACTGCTCCT ATTGCAAGCC TGAGCACCGG GGTCGCTTTT TTAAAAAACC GGATGCTGAT GATCTGGCCA AATTCGCAAC CGCTGTTGGC ACATGGGAGG CGCTCCAGCC GCAGTTTGTT CCCGAAGAAA AAATACCCGC TGGCGATGAG ACAAACCGGC TCCACCGGTG GGGTTACCGC TATTACCGGG AGATGTTTAA CGAACGCCAG CTCCTTGGTT TGGAGTTGCT CGCCCGGAAA ATAAGCCAGC AGCCGGATGA ACGCATTAAA AACGCCCTGG CCACCAATCT TTCCGATCTG CTGCGCTATC AAAACATGCT TTGTCGTTAC GACCCCTATG CCCTGAAATC CCTGGATATT TTCTCTGTTC ACGGTTTCCC CGTCGGCTTA ATCCAGTGCG AATCCAACAT GTTGGGGATA CCTGGGGGAA AAACGGGTCT AAATATTGGC AGTGGCGGCT GGACCAATAT TGTCGACAAA TATTTAAAGG CCAAACACTA TTGCCAGTGG CCCTTTGAAA TAAGGCATGT GAATGGCCGT AAACGGCAGT TATGGATCAA AGGAGAATGG ATAGGGGAGC GCCGCCAAGG GATGACGCAA CAGCGGGAGG TAGATTTAAG GTGCGCCAGT GCCACCACGG CCTTTCTGAA ACCATCTTCC CTGGATGCCG TTCTTACCGA CCCGCCTTAT TTTGCCAACG TCCAGTATGC CGAACTCATG GATTTTTGTT ATGTATGGCT GCGGCGCCTG GTGGGGGCTA GTAACCCGGT ATTTACCCCA CGGACAACCC GTAATCCTGA AGAACTCACA GGTAATACTA CCATGTCCAG GGGGATCGAT GATTTTACCG GGGGGTTGAG CCGGGTTTTT TCTAATATGG CCGCAGCCTT AAAACCGGGC GCCCCCTTTG TCTTTACTTA TCACCATAAC AGGCTTGAAG CTTATTACCC TGTTGCCGTG GCCCTGCTAG ATGCCGGTCT GGCCTGTACA GCCACCTTGC CCTGTCCGGC GGAAATGGCA GCCTCCATCC ATATCAACGG TACCGGTTCT TCAATTATAG ATACGGTTTT CGTTTGCCGG ACTACAGGTG TAGTTTCACG GCGCCTGCTG GTGAAAGAAC CGGAGCAAAT TGCGGCCTTA ATCATAAAGG AACTGGAGGA ACTGGAAAAG GGCGGAGTGC CTGTGACGAG GGGAGATACT CGTTGCATTA TTTACGGCCA TTTAATCCGG CTGGCCGTAT GGTATTTACG GGCAACATGG GATAAAAATC TAAGTTGGGA TAAAAAGTTC GCCTTGATTG CCAGGATGAT TGATGAACTG GGCGGCGCCG GTGCTATCGA GACATATTTA GAGGAGAACG GAGTACAGCT GAAGACGAGG CGCGAAACCA TCGTGTGTGA AGGTGAATCT GAATATGGAG CTGGCGGTGA TGAAGTATCC TTTTGA
|
Protein sequence | MRRIYSGNEK YYRSAYMTIE RNFDIAFVAD LALHEKQIQQ NYRPIIAVHK WFARRPGTLF RSLLLAEFAQ GNLADNYYRS HNFQGLKVAD PFMGGGTPLI EANRLGCHIL GYDINPMAYW IVREEIEHLN LEAYQQAARE VGFFLEEKVG PFYRTRCPIC GRQDALVKYF LWVKVHRCHN CGREFDLFPG YVLAQKGRHP KDVIICSTCG SLNEVGDKRN PGHCHNCGEE LKTKGPAGRN QCPCPHCGVR NSYPDPESGP PGHRMVAIEY HCSYCKPEHR GRFFKKPDAD DLAKFATAVG TWEALQPQFV PEEKIPAGDE TNRLHRWGYR YYREMFNERQ LLGLELLARK ISQQPDERIK NALATNLSDL LRYQNMLCRY DPYALKSLDI FSVHGFPVGL IQCESNMLGI PGGKTGLNIG SGGWTNIVDK YLKAKHYCQW PFEIRHVNGR KRQLWIKGEW IGERRQGMTQ QREVDLRCAS ATTAFLKPSS LDAVLTDPPY FANVQYAELM DFCYVWLRRL VGASNPVFTP RTTRNPEELT GNTTMSRGID DFTGGLSRVF SNMAAALKPG APFVFTYHHN RLEAYYPVAV ALLDAGLACT ATLPCPAEMA ASIHINGTGS SIIDTVFVCR TTGVVSRRLL VKEPEQIAAL IIKELEELEK GGVPVTRGDT RCIIYGHLIR LAVWYLRATW DKNLSWDKKF ALIARMIDEL GGAGAIETYL EENGVQLKTR RETIVCEGES EYGAGGDEVS F
|
| |