Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_3154 |
Symbol | |
ID | 7874296 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 3414268 |
End bp | 3415794 |
Gene Length | 1527 bp |
Protein Length | 508 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 643700084 |
Product | Site-specific DNA-methyltransferase (adenine-specific) |
Protein accession | YP_002890128 |
Protein GI | 237653814 |
COG category | [V] Defense mechanisms |
COG ID | [COG0286] Type I restriction-modification system methyltransferase subunit |
TIGRFAM ID | [TIGR00497] type I restriction system adenine methylase (hsdM) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCAGGC GCATCACCCA GCAAGAACTC GAAAGCTACC TCTGGGGCGC GGCCGTGTTG CTGCGCGGGC TGATCGACGC CGGCGACTAC AAGCAGTTCA TCTTCCCGCT GCTGTTCTTC AAGCGGGTCT CCGACGTCTG GGACGAGGAG TACGAGGTCG CGCTGGCCGA GTCCGATGGC GACCTGTCCT ATGCCAAATT CGCCGAGAAC CATCGCTTCC AGATCCCCGC GGGCGCGCAC TGGAACGACG TGCGCCAGAC GCCGCGCAAC GTCGGCGCCG CCATCCAGCA GGCGATGCGG GCGATCGAGT CTGCCAACCC GGACCTGCTC GACGGCATTT TCGGCGACGC GCCGTGGACC AATCGCGAGC GCCTGCCCGA CGAAACGCTC AAGAACCTTA TCGAGCACTT CTCCACGCAG ACGCTCTCGG TCGCCAACGT GCCCGAGGAC GAGCTCGGCA ACGCCTACGA ATACCTCATC AAGAAGTTCG CCGACGACTC CGGCCACACC GCGGCCGAGT TCTACACCAA CCGCACCGTC GTCCACCTGA TGACGCAGCT TCTCGCCCCG CAGGCGGACG AGTCCATCTA CGACCCCACC TGCGGCACCG GCGGCATGCT GATCTCCGCG CTGGACGAGG TGAAGCGCTC GGGCGGCGAA TACCGCACGC TCAAGCTCTA CGGCCAGGAG CGCAACCTGA TCACCTCGTC GATCGCGCGC ATGAACCTCT TCCTGCACGG CGTCGAGGAC TTTCAGATCA TCCGCGGCGA CACCCTCGCC GAGCCGCGCC ACATCGAAGG TGACCGGCTG CGCCGCTTCG ACGTCATCCT CGCCAACCCG CCGTACTCCA TCAAACAGTG GGACCGCGAG GCGTGGACGC AGGACAAGTG GGGCCGCAAC TTCCTCGGCA CCCCGCCGCA GGGGCGGGCG GACTACGCCT TCCAGCAGCA CATCCTCGGC AGCCTGTCCG ACCGCGGTCG CTGCGCCATC CTGTGGCCGC ACGGCGTGCT GTTCCGCAAC GAGGAACAGG CCATGCGCAG CAAGATGATC GAGCAGGACT GGGTGGAGGC GGTCGTCGGC CTCGGTCCCA ACCTGTTCTA CAACTCCCCC ATGGAGTCCT GCATCCTGAT CTGCAACCGG CGCAAGCCGG CCGAACGCCA GGGCAGGGTG CTGTTCATCG ACGCGGTGGG CGAGGTCACG CGCGAGCGCG CGCAGAGCTT CCTCAAGCCC GAGCACCAGC AGCGCATCCT CGGTGCCTTC AAGGCCTTCG CCGACGCGCC CGGCTTTGCC CGGGTGGCGA CGCTCGCCGA ACTGCACAAG AACGCCGGCA ATCTGTCGAT TCCGCTGTAC GTGAAGCGCC CGTCGGCCAG TGCCGCCGCC GCGGCGGCCG GCGGCGACCA GCCGGCCAGC CTCGCCGAGG CCTGGGACGC CTGGCAGGAG AGCGGGCGGG CGTTCTGGCA GCAGATGGAT GCGCTGGTCG AGACGCTGGA CGGGCTCGGC GTGGCGAAGG AGGCAAGCGA TGCGTAG
|
Protein sequence | MSRRITQQEL ESYLWGAAVL LRGLIDAGDY KQFIFPLLFF KRVSDVWDEE YEVALAESDG DLSYAKFAEN HRFQIPAGAH WNDVRQTPRN VGAAIQQAMR AIESANPDLL DGIFGDAPWT NRERLPDETL KNLIEHFSTQ TLSVANVPED ELGNAYEYLI KKFADDSGHT AAEFYTNRTV VHLMTQLLAP QADESIYDPT CGTGGMLISA LDEVKRSGGE YRTLKLYGQE RNLITSSIAR MNLFLHGVED FQIIRGDTLA EPRHIEGDRL RRFDVILANP PYSIKQWDRE AWTQDKWGRN FLGTPPQGRA DYAFQQHILG SLSDRGRCAI LWPHGVLFRN EEQAMRSKMI EQDWVEAVVG LGPNLFYNSP MESCILICNR RKPAERQGRV LFIDAVGEVT RERAQSFLKP EHQQRILGAF KAFADAPGFA RVATLAELHK NAGNLSIPLY VKRPSASAAA AAAGGDQPAS LAEAWDAWQE SGRAFWQQMD ALVETLDGLG VAKEASDA
|
| |