Gene Tmz1t_3155 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_3155 
Symbol 
ID7874297 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp3415791 
End bp3417287 
Gene Length1497 bp 
Protein Length498 aa 
Translation table11 
GC content65% 
IMG OID643700085 
ProductN-6 DNA methylase 
Protein accessionYP_002890129 
Protein GI237653815 
COG category[V] Defense mechanisms 
COG ID[COG0286] Type I restriction-modification system methyltransferase subunit 
TIGRFAM ID[TIGR00497] type I restriction system adenine methylase (hsdM) 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGCATCAAC CCAGCATCAC GCTCGCCCAG CTCGAATCGC ACCTCTGGGA GTCGGCCAAC 
ATCCTCCGCG GGCCGGTCGA CGCGGCCGAT TTCAAGACCT ACATCTTTCC GCTGCTCTTC
TTCAAGCGCA TCTGCGACGT CTGGGATGAG GAGTATCAGG AGATCGTCGA CGAGACGGGC
GACGAGCAGC TCGCGTGGTT CCCCGAGTCG CACCGCTTCC AGATCCCCGA GGATTGCCAC
TGGAACGATG TCCGCGCCAA GGCTGCCAAC GTCGGCGCGG CGCTGCAGCA CGCGATGCGC
GAGATCGAGA AGGCCAACCC CGACACGCTC TATGGCGTGT TCGGCGATGC CCAGTGGTCG
AACAAGGAGC GCTTGTCCGA TGCGCTGCTC AAGGATCTCA TCGAGCACTT CTCGAAGCTG
CCGCTCGGCA ACGGCAACGT CACGTCCGAC CTGCTCGGCG ACGCATACGA ATACCTGATC
AAGAAGTTCG CAGACGCCAC GAACAAGAAG GCCGGCGAGT TCTATACGCC TCGCAGCGTG
GTGCGGCTGA TGATCGACAT GCTCGACCCG CGCGAAGGCG AGACCATCTA CGACCCTGCC
TGCGGCACCG GCGGCATGCT GCTGGCCGCG GTGCAGCACG TGCAGGAGAT GCACGGCGAC
GTGAAGCGCC TGTGGGGCAA GCTCTACGGG CAGGAGAAAA ACCTCACCAC CTCGTCCATC
GCGCGGATGA ACCTCTTCCT GCACGGCATC GAGGACTTCA AGATCGTGCG CGGCGACACG
CTGCGCAACC CCGCCTTCTT CGATGGCGAC CGCCTTTCCG CCTTCGACTG CGTCATCGCC
AACCCGCCGT TCTCGTTGGA GAAGTGGGGC GAGGATCTTT GGCTCAACGA CCCCTTCGGC
CGCAACTTCG CCGGCCTGCC GCCCTCTTCG AGCGGTGATT TCGCCTGGGT GCAGCACATG
GTCAAGTCCA TGGCCGACGG CACCGGCCGC ATGGCGGTCG TGCTGCCGCA AGGCGCGCTG
TTCCGCAAAA GCGCCGAAGG CGGCATCCGC CAGAAGCTGC TCAAGCTCGA CCTCATCGAA
GCGGTGATCG GTCTGGCGCC CAACCTGTTC TACGGCACCG GTCTGGCCGC GTGCATCCTG
GTGCTGCGCA AGAAGAAGCC CGCCGCGCGC CGGCGCAAGG TGTTGGTCGC CGACGCCTCT
CGCCTCTTCC GGCGGGGCAG GGCGCAGAAC TATCTTGAAG CCGAGCACGC CGCGCAGATC
CTCGGCTGGT ATCGCGACTT CCAGGACGTG CAGGACGCGG TGCGGGTCGT CGCACTCGAC
GAGATCGAGG CCGAGGACTG GACACTCAAC ATCTCGCGCT ACGTGCTGCC GCCGCTGCAG
GAAGACATCC CGCCGCTGCC CGAGGCGATC GCCGCCTTCA AGGACGCGCT CCAGCGCTGC
CGCGAGGCCG AAGAGCGCCT CGCCCAGGTC ATGGCGGAAG GGGGCTGGCT GAAATGA
 
Protein sequence
MHQPSITLAQ LESHLWESAN ILRGPVDAAD FKTYIFPLLF FKRICDVWDE EYQEIVDETG 
DEQLAWFPES HRFQIPEDCH WNDVRAKAAN VGAALQHAMR EIEKANPDTL YGVFGDAQWS
NKERLSDALL KDLIEHFSKL PLGNGNVTSD LLGDAYEYLI KKFADATNKK AGEFYTPRSV
VRLMIDMLDP REGETIYDPA CGTGGMLLAA VQHVQEMHGD VKRLWGKLYG QEKNLTTSSI
ARMNLFLHGI EDFKIVRGDT LRNPAFFDGD RLSAFDCVIA NPPFSLEKWG EDLWLNDPFG
RNFAGLPPSS SGDFAWVQHM VKSMADGTGR MAVVLPQGAL FRKSAEGGIR QKLLKLDLIE
AVIGLAPNLF YGTGLAACIL VLRKKKPAAR RRKVLVADAS RLFRRGRAQN YLEAEHAAQI
LGWYRDFQDV QDAVRVVALD EIEAEDWTLN ISRYVLPPLQ EDIPPLPEAI AAFKDALQRC
REAEERLAQV MAEGGWLK