Gene Moth_2281 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_2281 
Symbol 
ID3831392 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp2391373 
End bp2393628 
Gene Length2256 bp 
Protein Length751 aa 
Translation table11 
GC content51% 
IMG OID637830201 
Productadenine-specific DNA methylase 
Protein accessionYP_431111 
Protein GI83591102 
COG category[L] Replication, recombination and repair 
COG ID[COG1743] Adenine-specific DNA methylase containing a Zn-ribbon 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0186187 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00483181 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCGTCGAA TATATTCCGG TAATGAAAAA TATTACCGGA GCGCTTATAT GACAATCGAG 
CGTAACTTTG ACATAGCCTT TGTGGCCGAC CTGGCCCTGC ATGAAAAGCA AATCCAGCAG
AATTATCGAC CCATTATTGC CGTACACAAA TGGTTTGCCC GCCGGCCAGG CACATTGTTT
CGCAGTCTAC TACTGGCAGA GTTTGCACAG GGCAATTTAG CCGACAATTA CTACCGTTCC
CATAATTTTC AAGGTCTTAA GGTGGCCGAT CCTTTTATGG GCGGGGGAAC ACCCCTTATT
GAGGCCAACC GTCTCGGCTG CCATATTCTG GGTTATGATA TTAATCCTAT GGCTTACTGG
ATTGTCCGCG AGGAGATTGA GCATCTGAAC TTGGAAGCTT ACCAGCAGGC AGCCCGGGAA
GTTGGATTTT TTTTAGAAGA AAAGGTAGGT CCTTTTTACC GGACCAGGTG TCCCATATGC
GGGCGCCAGG ATGCCCTGGT AAAATACTTC CTTTGGGTAA AGGTCCATCG CTGTCATAAT
TGCGGCCGGG AATTTGATCT CTTTCCAGGG TATGTTCTGG CGCAAAAGGG GCGTCACCCC
AAGGATGTCA TAATCTGTTC CACCTGCGGT AGCCTTAACG AAGTCGGGGA TAAAAGAAAT
CCCGGCCACT GCCATAATTG CGGCGAAGAA TTAAAAACAA AAGGCCCGGC TGGTCGTAAC
CAGTGCCCTT GTCCTCATTG TGGCGTTAGA AATTCTTATC CTGACCCGGA AAGTGGCCCT
CCCGGGCACC GGATGGTGGC CATTGAATAT CACTGCTCCT ATTGCAAGCC TGAGCACCGG
GGTCGCTTTT TTAAAAAACC GGATGCTGAT GATCTGGCCA AATTCGCAAC CGCTGTTGGC
ACATGGGAGG CGCTCCAGCC GCAGTTTGTT CCCGAAGAAA AAATACCCGC TGGCGATGAG
ACAAACCGGC TCCACCGGTG GGGTTACCGC TATTACCGGG AGATGTTTAA CGAACGCCAG
CTCCTTGGTT TGGAGTTGCT CGCCCGGAAA ATAAGCCAGC AGCCGGATGA ACGCATTAAA
AACGCCCTGG CCACCAATCT TTCCGATCTG CTGCGCTATC AAAACATGCT TTGTCGTTAC
GACCCCTATG CCCTGAAATC CCTGGATATT TTCTCTGTTC ACGGTTTCCC CGTCGGCTTA
ATCCAGTGCG AATCCAACAT GTTGGGGATA CCTGGGGGAA AAACGGGTCT AAATATTGGC
AGTGGCGGCT GGACCAATAT TGTCGACAAA TATTTAAAGG CCAAACACTA TTGCCAGTGG
CCCTTTGAAA TAAGGCATGT GAATGGCCGT AAACGGCAGT TATGGATCAA AGGAGAATGG
ATAGGGGAGC GCCGCCAAGG GATGACGCAA CAGCGGGAGG TAGATTTAAG GTGCGCCAGT
GCCACCACGG CCTTTCTGAA ACCATCTTCC CTGGATGCCG TTCTTACCGA CCCGCCTTAT
TTTGCCAACG TCCAGTATGC CGAACTCATG GATTTTTGTT ATGTATGGCT GCGGCGCCTG
GTGGGGGCTA GTAACCCGGT ATTTACCCCA CGGACAACCC GTAATCCTGA AGAACTCACA
GGTAATACTA CCATGTCCAG GGGGATCGAT GATTTTACCG GGGGGTTGAG CCGGGTTTTT
TCTAATATGG CCGCAGCCTT AAAACCGGGC GCCCCCTTTG TCTTTACTTA TCACCATAAC
AGGCTTGAAG CTTATTACCC TGTTGCCGTG GCCCTGCTAG ATGCCGGTCT GGCCTGTACA
GCCACCTTGC CCTGTCCGGC GGAAATGGCA GCCTCCATCC ATATCAACGG TACCGGTTCT
TCAATTATAG ATACGGTTTT CGTTTGCCGG ACTACAGGTG TAGTTTCACG GCGCCTGCTG
GTGAAAGAAC CGGAGCAAAT TGCGGCCTTA ATCATAAAGG AACTGGAGGA ACTGGAAAAG
GGCGGAGTGC CTGTGACGAG GGGAGATACT CGTTGCATTA TTTACGGCCA TTTAATCCGG
CTGGCCGTAT GGTATTTACG GGCAACATGG GATAAAAATC TAAGTTGGGA TAAAAAGTTC
GCCTTGATTG CCAGGATGAT TGATGAACTG GGCGGCGCCG GTGCTATCGA GACATATTTA
GAGGAGAACG GAGTACAGCT GAAGACGAGG CGCGAAACCA TCGTGTGTGA AGGTGAATCT
GAATATGGAG CTGGCGGTGA TGAAGTATCC TTTTGA
 
Protein sequence
MRRIYSGNEK YYRSAYMTIE RNFDIAFVAD LALHEKQIQQ NYRPIIAVHK WFARRPGTLF 
RSLLLAEFAQ GNLADNYYRS HNFQGLKVAD PFMGGGTPLI EANRLGCHIL GYDINPMAYW
IVREEIEHLN LEAYQQAARE VGFFLEEKVG PFYRTRCPIC GRQDALVKYF LWVKVHRCHN
CGREFDLFPG YVLAQKGRHP KDVIICSTCG SLNEVGDKRN PGHCHNCGEE LKTKGPAGRN
QCPCPHCGVR NSYPDPESGP PGHRMVAIEY HCSYCKPEHR GRFFKKPDAD DLAKFATAVG
TWEALQPQFV PEEKIPAGDE TNRLHRWGYR YYREMFNERQ LLGLELLARK ISQQPDERIK
NALATNLSDL LRYQNMLCRY DPYALKSLDI FSVHGFPVGL IQCESNMLGI PGGKTGLNIG
SGGWTNIVDK YLKAKHYCQW PFEIRHVNGR KRQLWIKGEW IGERRQGMTQ QREVDLRCAS
ATTAFLKPSS LDAVLTDPPY FANVQYAELM DFCYVWLRRL VGASNPVFTP RTTRNPEELT
GNTTMSRGID DFTGGLSRVF SNMAAALKPG APFVFTYHHN RLEAYYPVAV ALLDAGLACT
ATLPCPAEMA ASIHINGTGS SIIDTVFVCR TTGVVSRRLL VKEPEQIAAL IIKELEELEK
GGVPVTRGDT RCIIYGHLIR LAVWYLRATW DKNLSWDKKF ALIARMIDEL GGAGAIETYL
EENGVQLKTR RETIVCEGES EYGAGGDEVS F