Gene Moth_0639 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0639 
Symbol 
ID3832035 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp665290 
End bp667848 
Gene Length2559 bp 
Protein Length852 aa 
Translation table11 
GC content55% 
IMG OID637828580 
ProductDNA methylase N-4/N-6 
Protein accessionYP_429510 
Protein GI83589501 
COG category[L] Replication, recombination and repair 
COG ID[COG1743] Adenine-specific DNA methylase containing a Zn-ribbon 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.028356 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATTACG AAACCGGGTC ACTGTTTGAA GCGAAAAATG ATCCAAAGCT GGAGCCGGAG 
GACGATTTCG CCCGCTCCTG CCCGCCCCTT TCCGAGCTGC TGGATACGGT GCGCCACGTG
GAGGGCTTCC CCCTGGGCCG GGACGAAGAC ATCCTGGCCC TCTCCGACCC GCCCTACTAC
ACCGCTTGCC CCAACCCCTA CCTGGGCGAG TTCATCAGGC GCTACGGCAA ACCCTACGAT
GAGGCCACCG ATACCTACCA GCGCCCGCCC TTGGTGGCCG ACGTGACCGA GGGGAAGAAC
GACCCTGTTT ACAACGCCCA CTCCTACCAC ACCAAAGTGC CCCACAAGGC CATTATGAAA
TACATCGAGC ACTACACCGA ACCGGGCGAT ATCGTTTTCG ACGGCTTCTG CGGCAGCGGT
ATGACCGGCG TGGCGGCGCA GCTTCTGGGC CGGCGGGCCA TCCTCTGCGA CCTCTCGCCG
GCCGCCACCT TTATCGCCTA TAATTACAAC ACCCCCGTGG ACGTGGCCGC GTTCGAGCGG
GAGGCGAAAA GGATCCTCGC CGAGGTAGAG AAGGAATGCG GCTGGATGTA TGAGACCCTC
CACACCGACG GCCGCACCAA TGGGCGGATC AACTATACTG TCTGGTCCGA TGTCTTCATC
TGTCCCTACT GCGGCAGCGA ATACGTCTTC TGGGAGGCGG CGGTAGACAA GGAACGGGGC
AAAGTGCTTA ATGAATACCC CTGTCCCTCC TGCGGCGCCA AAGTAGCCAA GCGTGAGTGC
AAGCGGGCAT GGGTGGCCTT CTACGACCAG GTCATAGGCC AGGAAGTGAC CCAGGCCAAA
CAGGTGCCGG TTCTGATCAA CTATACGGTG GGCAAGAAGC GGTACGAAAA GAAGCCGGAC
CAGTACGACC TGGACCTGAT CCGCAGGATC GAGGAGAGCG CCATCCCCTA CTGGTTCCCG
ACGGATCGGA TGCTAGAGGG GGATAAGACT GTAGAACCCA TACGCCTAGG TATTACCCAT
GTGCACCATT TTTATACTAA AAGGAATTTA TGGGTGCTGG CAGCACTAAA CAGTAAAATT
TCTGCAGCAA AGGGAACCGG TGTCGCCGCT GTTCTCAGAT TTCTAATCTC TTCTTACAAC
CATACTCATA GCACTAAAAT GACGAGGATA ATATTTAAGG ATACCGGCAA ACCGGTGCTA
ACAAGTTGCC AGTCGGGTAC GTTGTATATA AGTTCTCTTC CTGTTGAAAA AAACATCTTA
CAGGGCTTAA CAAAAATGAA GCTGGCGCTT ATAAGTAGAG CGTTAAAGGC CTTGAGTTAT
GAGCAAGCAA TCACTACTTC CTCTTCCACC GGGTTTTCAT TAATAAGTAT TTATAATAAC
TGCATAGATT ACATTTTTAC TGATCCGCCC TTCGGTAGCA ACCTGATGTA CTCTGAGCTC
AACTTCCTCT GGGAGGCTTG GTTGAGGGTA TTCACCAACA ACAGGCCTGA AGCCATTATC
AATGAAACCC AGGGTAAGGG CCTGCCCGAG TATAAGGAGC TCATGACCGC CTGCTTCAAG
GAGATGTACC GCCTCCTCAA GCCCAACCGC TGGATGACTG TAGTCTTCCA CAATTCCAGG
GCTGCGGTGT GGAACGCCAT CCAGGAGGCC ATCACCAGGG CCGGCTTCGT CATTGCTCAA
GTGACGGTTA TGGATAGGAA GCAGGGTAGC TTTAACCAAG TAACTGCTGC TGGTGCAGTA
GAAAAAGATT TGATCATAAA TGCCTACAAG CCAAAGAAAC AGATGGAGGA AAACTTCCTG
CGCCGGGCCG GGGCGGGGCT GGAGCGGGAC TTCGTGGCCG ACCTTTTGGA GCACCTTCCC
GTGGTGCCCA ACGTGGGCCG CACCGAGAAG ATGCTCTATT CCAAGCTGCT GGCCTATTAC
GTACAGCGCG GTTTTGAAAT TCGCCTGGAT GCCAGGCAGT TCTATGTCCT GCTTAAAGAG
AATTTCAAGC TCATTGACGG TTACTGGTTC ACCGACGAGC AGGTTTTACA ATACGAGGAA
TGGAAGAGGA AACAGAGACT GGATGGCATC AAGGAGGTGC AGTCCGGCCA GCAGATCTTC
TTTGTGAGCG ACGAGCGCTC GGCTCTGGCC TGGCTTTATA ACTTCCTGGA AACACCCAGG
ACCTATACCG ACATTTACAC CGCCTATAGC CGGGCTCTTG TGCAAAGCGA CGATGCCATA
CCGGAGCTGA AGGAGCTTTT GGATAACAAC TTTATCCTGG AGAACGGCCG CTACCGCCGC
CCGCAGACGG AGCAGGAAAG AGAGGCCATC GAGGCCCAGC GGGAAAGGGA ATTGGGCCGG
GCCTTTGAGC GGCTCCTCGC CGAGGCCAGG AGTGGGGTTA AGAGGCTGAA GGGAGTACGC
AAGGAGGCGC TCATCTTCGG CTTTACCAAG GCCTACCGGG AGAAGCGTTA CCAGGACATC
CTGGCCGTGG CCAGGAAACT GGACCAGGAC TTCTTGGAGA CAAACGGCGA GATCAACGAC
TTTGTAGAAA TCGCCAGGCT GAAAACGGGA GAGGAATAA
 
Protein sequence
MDYETGSLFE AKNDPKLEPE DDFARSCPPL SELLDTVRHV EGFPLGRDED ILALSDPPYY 
TACPNPYLGE FIRRYGKPYD EATDTYQRPP LVADVTEGKN DPVYNAHSYH TKVPHKAIMK
YIEHYTEPGD IVFDGFCGSG MTGVAAQLLG RRAILCDLSP AATFIAYNYN TPVDVAAFER
EAKRILAEVE KECGWMYETL HTDGRTNGRI NYTVWSDVFI CPYCGSEYVF WEAAVDKERG
KVLNEYPCPS CGAKVAKREC KRAWVAFYDQ VIGQEVTQAK QVPVLINYTV GKKRYEKKPD
QYDLDLIRRI EESAIPYWFP TDRMLEGDKT VEPIRLGITH VHHFYTKRNL WVLAALNSKI
SAAKGTGVAA VLRFLISSYN HTHSTKMTRI IFKDTGKPVL TSCQSGTLYI SSLPVEKNIL
QGLTKMKLAL ISRALKALSY EQAITTSSST GFSLISIYNN CIDYIFTDPP FGSNLMYSEL
NFLWEAWLRV FTNNRPEAII NETQGKGLPE YKELMTACFK EMYRLLKPNR WMTVVFHNSR
AAVWNAIQEA ITRAGFVIAQ VTVMDRKQGS FNQVTAAGAV EKDLIINAYK PKKQMEENFL
RRAGAGLERD FVADLLEHLP VVPNVGRTEK MLYSKLLAYY VQRGFEIRLD ARQFYVLLKE
NFKLIDGYWF TDEQVLQYEE WKRKQRLDGI KEVQSGQQIF FVSDERSALA WLYNFLETPR
TYTDIYTAYS RALVQSDDAI PELKELLDNN FILENGRYRR PQTEQEREAI EAQRERELGR
AFERLLAEAR SGVKRLKGVR KEALIFGFTK AYREKRYQDI LAVARKLDQD FLETNGEIND
FVEIARLKTG EE