Gene Moth_1672 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1672 
Symbol 
ID3831943 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1708647 
End bp1710197 
Gene Length1551 bp 
Protein Length516 aa 
Translation table11 
GC content50% 
IMG OID637829597 
ProductN-6 DNA methylase 
Protein accessionYP_430517 
Protein GI83590508 
COG category[V] Defense mechanisms 
COG ID[COG0286] Type I restriction-modification system methyltransferase subunit 
TIGRFAM ID[TIGR00497] type I restriction system adenine methylase (hsdM) 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.179457 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.001187 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACGGAAA ACACCAATAT GGATCTTAGC ACCCTGGAAA ACTGGCTATG GGAGGCAGCC 
TGTGTAATTC GCGGTGCAGT TGATGCTCCC AAGTATAAGG ATTACATATT GCCCCTGATC
TTCCTAAAAC GCCTGTCAGA CGTATTTGAA GATGAAATAG CCAGGCTGGC CGAAGAGATA
TTTGATAGTA TAGAGGAAGC CCTGAAACAG GTTGAGGAAG ACCATGCCCT GGTGCGTTTT
TATATCCCTC CCCAGGCCCG CTGGGATGCT ATTTCCCGGC AGACTACCAA CATAGGCGAA
TACCTTACCA GTGCCGTGCG GGCTGTAGCC CGGGAGAATC CTAAACTGCA CGGCATCTTT
GAGAACATTG ACTTCAACGC CCAGATGGCC GGCCAGCCGG TTATTGATAA CGACCGTCTC
TATAACCTGA TTCAAGTCCT TTCCCGCCAT CGTTTAGGGT TAAAAGATGT AGAAGTAGAC
ATCCTGGGCC GGGCTTATGA ATACCTGCTG CGCAAATTCG CCGAGGGCCA GGGCCAGAGT
GCCGGCGAAT TCTATACCCC CCGGGAAGTT ACCTGGCTAA TGGCTTATCT ACTGGAGCCC
CGACCAGGAG ATGAGATCTA TGACCCGGCC TGCGGTTCCG GCGGCCTGTT GATCAAAAGC
GTATTAGCTC TTAAGGAAAC TTATGGTGAT GACCCTAGAA TAGCACCGGT TAAGATTTAT
GGTCAGGAAA TCCTTTATAC CACCTTCGCT ATGGCCAAAA TGAACGCCTT TATCCATGAC
CTGGAGGCTG ATATTCGCCT GGGCGATACA ATGGCCCGGC CGGCCTTTAC CAATCCCGAC
GGTTCTTTGC GTACCTTTGA TAAGGTCACT GCTAATCCCA TGTGGAACCA GAAATTCCCC
CTTCCCCTAT ATGAAGAAGA CCCCTTTGAT CGGTTTAAGT TTGGCGGCAT TCCGCCGGCA
TCAAGCGCTG ACTGGGGCTG GATCCAGCAT ATGTTTGCCT CCCTGAAAGA AGGGGGCAAA
ATGGCCGTGG TCCTGGATAC CGGTTCAGTC TCCCGGGGCA GCGGTAACCA GGGCTCCAAC
CGGGAGAGGG ACATTCGTAA AGTCTTTGTT GAAAACGACC TGGTAGAGTG CGTTATCTTG
TTGCCGGAAA ATATGTTCTA TAATACTACC GCCCCGGGTA TAATCATGGT CATAAATAAG
GCCAAAAAGC ATCCAGCGGA GATTTTATTA ATCAACGCCT CTAAACTGTT CACCAAAGGG
CGCCCCAAGA ATTACATGGA AGATGAGCAT ATAAAGCAGG TCTATAGCAT CTACCGGGAA
TGGCGGGAAG AAGAGGGATT AAGCAAAATA ATTCCAGTAG AAGAAGCAGC CCGCAATGAC
TACAATCTTA GCCCTTCTCG CTATGTATCT ATTAATGGCA AAGAAGAATA CCGGCCCATA
GAGGAAATAT TGGTCGAGCT GGCCGAGGTC GAGGAAGAAC GGCAAGCCGT AGATAAGGAA
CTGAATGATA TCCTGGGTAA GTTGGGTTTC GGAGGCTGGC TGAATGGGTA A
 
Protein sequence
MTENTNMDLS TLENWLWEAA CVIRGAVDAP KYKDYILPLI FLKRLSDVFE DEIARLAEEI 
FDSIEEALKQ VEEDHALVRF YIPPQARWDA ISRQTTNIGE YLTSAVRAVA RENPKLHGIF
ENIDFNAQMA GQPVIDNDRL YNLIQVLSRH RLGLKDVEVD ILGRAYEYLL RKFAEGQGQS
AGEFYTPREV TWLMAYLLEP RPGDEIYDPA CGSGGLLIKS VLALKETYGD DPRIAPVKIY
GQEILYTTFA MAKMNAFIHD LEADIRLGDT MARPAFTNPD GSLRTFDKVT ANPMWNQKFP
LPLYEEDPFD RFKFGGIPPA SSADWGWIQH MFASLKEGGK MAVVLDTGSV SRGSGNQGSN
RERDIRKVFV ENDLVECVIL LPENMFYNTT APGIIMVINK AKKHPAEILL INASKLFTKG
RPKNYMEDEH IKQVYSIYRE WREEEGLSKI IPVEEAARND YNLSPSRYVS INGKEEYRPI
EEILVELAEV EEERQAVDKE LNDILGKLGF GGWLNG