Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_1672 |
Symbol | |
ID | 3831943 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | - |
Start bp | 1708647 |
End bp | 1710197 |
Gene Length | 1551 bp |
Protein Length | 516 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 637829597 |
Product | N-6 DNA methylase |
Protein accession | YP_430517 |
Protein GI | 83590508 |
COG category | [V] Defense mechanisms |
COG ID | [COG0286] Type I restriction-modification system methyltransferase subunit |
TIGRFAM ID | [TIGR00497] type I restriction system adenine methylase (hsdM) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.179457 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.001187 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGACGGAAA ACACCAATAT GGATCTTAGC ACCCTGGAAA ACTGGCTATG GGAGGCAGCC TGTGTAATTC GCGGTGCAGT TGATGCTCCC AAGTATAAGG ATTACATATT GCCCCTGATC TTCCTAAAAC GCCTGTCAGA CGTATTTGAA GATGAAATAG CCAGGCTGGC CGAAGAGATA TTTGATAGTA TAGAGGAAGC CCTGAAACAG GTTGAGGAAG ACCATGCCCT GGTGCGTTTT TATATCCCTC CCCAGGCCCG CTGGGATGCT ATTTCCCGGC AGACTACCAA CATAGGCGAA TACCTTACCA GTGCCGTGCG GGCTGTAGCC CGGGAGAATC CTAAACTGCA CGGCATCTTT GAGAACATTG ACTTCAACGC CCAGATGGCC GGCCAGCCGG TTATTGATAA CGACCGTCTC TATAACCTGA TTCAAGTCCT TTCCCGCCAT CGTTTAGGGT TAAAAGATGT AGAAGTAGAC ATCCTGGGCC GGGCTTATGA ATACCTGCTG CGCAAATTCG CCGAGGGCCA GGGCCAGAGT GCCGGCGAAT TCTATACCCC CCGGGAAGTT ACCTGGCTAA TGGCTTATCT ACTGGAGCCC CGACCAGGAG ATGAGATCTA TGACCCGGCC TGCGGTTCCG GCGGCCTGTT GATCAAAAGC GTATTAGCTC TTAAGGAAAC TTATGGTGAT GACCCTAGAA TAGCACCGGT TAAGATTTAT GGTCAGGAAA TCCTTTATAC CACCTTCGCT ATGGCCAAAA TGAACGCCTT TATCCATGAC CTGGAGGCTG ATATTCGCCT GGGCGATACA ATGGCCCGGC CGGCCTTTAC CAATCCCGAC GGTTCTTTGC GTACCTTTGA TAAGGTCACT GCTAATCCCA TGTGGAACCA GAAATTCCCC CTTCCCCTAT ATGAAGAAGA CCCCTTTGAT CGGTTTAAGT TTGGCGGCAT TCCGCCGGCA TCAAGCGCTG ACTGGGGCTG GATCCAGCAT ATGTTTGCCT CCCTGAAAGA AGGGGGCAAA ATGGCCGTGG TCCTGGATAC CGGTTCAGTC TCCCGGGGCA GCGGTAACCA GGGCTCCAAC CGGGAGAGGG ACATTCGTAA AGTCTTTGTT GAAAACGACC TGGTAGAGTG CGTTATCTTG TTGCCGGAAA ATATGTTCTA TAATACTACC GCCCCGGGTA TAATCATGGT CATAAATAAG GCCAAAAAGC ATCCAGCGGA GATTTTATTA ATCAACGCCT CTAAACTGTT CACCAAAGGG CGCCCCAAGA ATTACATGGA AGATGAGCAT ATAAAGCAGG TCTATAGCAT CTACCGGGAA TGGCGGGAAG AAGAGGGATT AAGCAAAATA ATTCCAGTAG AAGAAGCAGC CCGCAATGAC TACAATCTTA GCCCTTCTCG CTATGTATCT ATTAATGGCA AAGAAGAATA CCGGCCCATA GAGGAAATAT TGGTCGAGCT GGCCGAGGTC GAGGAAGAAC GGCAAGCCGT AGATAAGGAA CTGAATGATA TCCTGGGTAA GTTGGGTTTC GGAGGCTGGC TGAATGGGTA A
|
Protein sequence | MTENTNMDLS TLENWLWEAA CVIRGAVDAP KYKDYILPLI FLKRLSDVFE DEIARLAEEI FDSIEEALKQ VEEDHALVRF YIPPQARWDA ISRQTTNIGE YLTSAVRAVA RENPKLHGIF ENIDFNAQMA GQPVIDNDRL YNLIQVLSRH RLGLKDVEVD ILGRAYEYLL RKFAEGQGQS AGEFYTPREV TWLMAYLLEP RPGDEIYDPA CGSGGLLIKS VLALKETYGD DPRIAPVKIY GQEILYTTFA MAKMNAFIHD LEADIRLGDT MARPAFTNPD GSLRTFDKVT ANPMWNQKFP LPLYEEDPFD RFKFGGIPPA SSADWGWIQH MFASLKEGGK MAVVLDTGSV SRGSGNQGSN RERDIRKVFV ENDLVECVIL LPENMFYNTT APGIIMVINK AKKHPAEILL INASKLFTKG RPKNYMEDEH IKQVYSIYRE WREEEGLSKI IPVEEAARND YNLSPSRYVS INGKEEYRPI EEILVELAEV EEERQAVDKE LNDILGKLGF GGWLNG
|
| |