Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_0004 |
Symbol | |
ID | 7085102 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 5709 |
End bp | 7262 |
Gene Length | 1554 bp |
Protein Length | 517 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 643697054 |
Product | Site-specific DNA-methyltransferase (adenine-specific) |
Protein accession | YP_002353703 |
Protein GI | 217968469 |
COG category | [V] Defense mechanisms |
COG ID | [COG0286] Type I restriction-modification system methyltransferase subunit |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCCCTCG ACCTCAACCA GCTTGAAACC CGCCTGTGGG CGGCAGCAGA CCAACTCTGG GCCAACACCG GCCTGAAGCC CTCGGAGTTC TCCAACCCGG TCCTCGGCCT GATCTTCCTG CGCTACGCCG AGAAGCGGTT CCATGAGGCC GAGGCCAAGC TGATCGAGAG CGGGCTGGGT GTGTCCGAGA TCGAGAAGTT CGACTATCAG GCCGAAGGGG CGCTGTACCT GCCCGACAAC GCCCACTTCT CCTACCTACT CGACCTCGCC GAAGGTCAGG ATCTGGGCAA GGCGGTCAAC GAGGCGATGG CGGCGGTCGA GGCCGAGAAC GAGGAGCTGA AGGGCGTTCT GCCGCGCTCC TACGGAAGGC TCCCCAATAC CGTCCTGGTC GAGCTGCTGC GGGTGCTGAA CGGGCTGGGT GAAGTCGAGG GCGATGCCTT CGGCAAGATC TACGAATACT TCCTCGGCAA GTTCGCCCTG GCGGAAGGCC AGAAGGGCGG CGTGTTCTAC ACCCCGACCA GTATCGTCAA ACTGATCGTC GAGATCATCG AGCCCTTCCA CGGCAAGATC TTCGACCCCG CGTGTGGCTC GGGCGGCATG TTCGTGCAGA GCGCCCAGTT CGTCAGCCGC CACCAGAAGC GGGCTGCCGA GGAGCTGACC GTGTACGGCA CCGAGAAGGC GAACGACACC GTCAAGCTCG CCAAGATGAA CCTCGCGGTG CATGGCCTCT CGGGCGACAT CCGCGAATCG AACACCTACT ACGAAGACCC GCACAAGGCC GTCGTCGGCA ACACCGGCAA GTTCGACTTC GTGATGGCGA ACCCGCCGTT CAACGTCTCG GGCGTGGACA AGGAACGGGT CAAGGACGAC CCCCGCTTCC CCTTCGGGAT CCCGACCACC GACAACGCCA ACTACCTCTG GATCCAGCAC TTCTACACCG CGCTGAACGA GCGCGGCCGT GCCGGTTTCG TCATGGCCAA CTCGGCGGGT GATGCGCGGG GCACCGAGCT GGAGATCCGC AAGAAGCTGA TTCAGACCGG CGGCGTGGAT GTGATCGTCT CGGTTGGCTC CAACTTCTTC TACACCGTCA CCCTGCCGTG CACGCTGTGG TTCTTCGACA GGGCAAAGGC CAAGGGCGAG CGCAAGGATG AGGTGCTGTT CATCGATGCG CGCGGCACCT ACCGGCAGGT CAGCCGGGCG ATCCGCGACT TCCTGCCCGA GCAGATCGAG TTTCTGGCCA ACATCGTGCG GCTATGGCGC GGCGAGGCGG TGGAGATCGA GGCGGGAAGC CAGGAGATGC TCCGGCAGCA GTTTCCGGAG GGAGGCTATC GGGACATCGC CGGGGTGTGC AAGGTGGCGA CGCTGGCGGA GATCGAGGCG CAGGGGTGGA GCCTGAATCC GGGGCGGTAT GTGGGGGTGG CTGATCGAGG CGCTGATGAC TTCGACTTTG CGGAAAAATT CGAGGCTCTT GCTGAGGAGC TTGAGCGGCT GAATGGCGAG GCAGATCAGC TTCAAGAGCA AATCAGTAGC CAAGCAGTGG CTTTACTCTC TTGA
|
Protein sequence | MSLDLNQLET RLWAAADQLW ANTGLKPSEF SNPVLGLIFL RYAEKRFHEA EAKLIESGLG VSEIEKFDYQ AEGALYLPDN AHFSYLLDLA EGQDLGKAVN EAMAAVEAEN EELKGVLPRS YGRLPNTVLV ELLRVLNGLG EVEGDAFGKI YEYFLGKFAL AEGQKGGVFY TPTSIVKLIV EIIEPFHGKI FDPACGSGGM FVQSAQFVSR HQKRAAEELT VYGTEKANDT VKLAKMNLAV HGLSGDIRES NTYYEDPHKA VVGNTGKFDF VMANPPFNVS GVDKERVKDD PRFPFGIPTT DNANYLWIQH FYTALNERGR AGFVMANSAG DARGTELEIR KKLIQTGGVD VIVSVGSNFF YTVTLPCTLW FFDRAKAKGE RKDEVLFIDA RGTYRQVSRA IRDFLPEQIE FLANIVRLWR GEAVEIEAGS QEMLRQQFPE GGYRDIAGVC KVATLAEIEA QGWSLNPGRY VGVADRGADD FDFAEKFEAL AEELERLNGE ADQLQEQISS QAVALLS
|
| |