Gene Tmz1t_3154 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_3154 
Symbol 
ID7874296 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp3414268 
End bp3415794 
Gene Length1527 bp 
Protein Length508 aa 
Translation table11 
GC content67% 
IMG OID643700084 
ProductSite-specific DNA-methyltransferase (adenine-specific) 
Protein accessionYP_002890128 
Protein GI237653814 
COG category[V] Defense mechanisms 
COG ID[COG0286] Type I restriction-modification system methyltransferase subunit 
TIGRFAM ID[TIGR00497] type I restriction system adenine methylase (hsdM) 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCAGGC GCATCACCCA GCAAGAACTC GAAAGCTACC TCTGGGGCGC GGCCGTGTTG 
CTGCGCGGGC TGATCGACGC CGGCGACTAC AAGCAGTTCA TCTTCCCGCT GCTGTTCTTC
AAGCGGGTCT CCGACGTCTG GGACGAGGAG TACGAGGTCG CGCTGGCCGA GTCCGATGGC
GACCTGTCCT ATGCCAAATT CGCCGAGAAC CATCGCTTCC AGATCCCCGC GGGCGCGCAC
TGGAACGACG TGCGCCAGAC GCCGCGCAAC GTCGGCGCCG CCATCCAGCA GGCGATGCGG
GCGATCGAGT CTGCCAACCC GGACCTGCTC GACGGCATTT TCGGCGACGC GCCGTGGACC
AATCGCGAGC GCCTGCCCGA CGAAACGCTC AAGAACCTTA TCGAGCACTT CTCCACGCAG
ACGCTCTCGG TCGCCAACGT GCCCGAGGAC GAGCTCGGCA ACGCCTACGA ATACCTCATC
AAGAAGTTCG CCGACGACTC CGGCCACACC GCGGCCGAGT TCTACACCAA CCGCACCGTC
GTCCACCTGA TGACGCAGCT TCTCGCCCCG CAGGCGGACG AGTCCATCTA CGACCCCACC
TGCGGCACCG GCGGCATGCT GATCTCCGCG CTGGACGAGG TGAAGCGCTC GGGCGGCGAA
TACCGCACGC TCAAGCTCTA CGGCCAGGAG CGCAACCTGA TCACCTCGTC GATCGCGCGC
ATGAACCTCT TCCTGCACGG CGTCGAGGAC TTTCAGATCA TCCGCGGCGA CACCCTCGCC
GAGCCGCGCC ACATCGAAGG TGACCGGCTG CGCCGCTTCG ACGTCATCCT CGCCAACCCG
CCGTACTCCA TCAAACAGTG GGACCGCGAG GCGTGGACGC AGGACAAGTG GGGCCGCAAC
TTCCTCGGCA CCCCGCCGCA GGGGCGGGCG GACTACGCCT TCCAGCAGCA CATCCTCGGC
AGCCTGTCCG ACCGCGGTCG CTGCGCCATC CTGTGGCCGC ACGGCGTGCT GTTCCGCAAC
GAGGAACAGG CCATGCGCAG CAAGATGATC GAGCAGGACT GGGTGGAGGC GGTCGTCGGC
CTCGGTCCCA ACCTGTTCTA CAACTCCCCC ATGGAGTCCT GCATCCTGAT CTGCAACCGG
CGCAAGCCGG CCGAACGCCA GGGCAGGGTG CTGTTCATCG ACGCGGTGGG CGAGGTCACG
CGCGAGCGCG CGCAGAGCTT CCTCAAGCCC GAGCACCAGC AGCGCATCCT CGGTGCCTTC
AAGGCCTTCG CCGACGCGCC CGGCTTTGCC CGGGTGGCGA CGCTCGCCGA ACTGCACAAG
AACGCCGGCA ATCTGTCGAT TCCGCTGTAC GTGAAGCGCC CGTCGGCCAG TGCCGCCGCC
GCGGCGGCCG GCGGCGACCA GCCGGCCAGC CTCGCCGAGG CCTGGGACGC CTGGCAGGAG
AGCGGGCGGG CGTTCTGGCA GCAGATGGAT GCGCTGGTCG AGACGCTGGA CGGGCTCGGC
GTGGCGAAGG AGGCAAGCGA TGCGTAG
 
Protein sequence
MSRRITQQEL ESYLWGAAVL LRGLIDAGDY KQFIFPLLFF KRVSDVWDEE YEVALAESDG 
DLSYAKFAEN HRFQIPAGAH WNDVRQTPRN VGAAIQQAMR AIESANPDLL DGIFGDAPWT
NRERLPDETL KNLIEHFSTQ TLSVANVPED ELGNAYEYLI KKFADDSGHT AAEFYTNRTV
VHLMTQLLAP QADESIYDPT CGTGGMLISA LDEVKRSGGE YRTLKLYGQE RNLITSSIAR
MNLFLHGVED FQIIRGDTLA EPRHIEGDRL RRFDVILANP PYSIKQWDRE AWTQDKWGRN
FLGTPPQGRA DYAFQQHILG SLSDRGRCAI LWPHGVLFRN EEQAMRSKMI EQDWVEAVVG
LGPNLFYNSP MESCILICNR RKPAERQGRV LFIDAVGEVT RERAQSFLKP EHQQRILGAF
KAFADAPGFA RVATLAELHK NAGNLSIPLY VKRPSASAAA AAAGGDQPAS LAEAWDAWQE
SGRAFWQQMD ALVETLDGLG VAKEASDA