Gene Tmz1t_0004 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_0004 
Symbol 
ID7085102 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp5709 
End bp7262 
Gene Length1554 bp 
Protein Length517 aa 
Translation table11 
GC content63% 
IMG OID643697054 
ProductSite-specific DNA-methyltransferase (adenine-specific) 
Protein accessionYP_002353703 
Protein GI217968469 
COG category[V] Defense mechanisms 
COG ID[COG0286] Type I restriction-modification system methyltransferase subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCCTCG ACCTCAACCA GCTTGAAACC CGCCTGTGGG CGGCAGCAGA CCAACTCTGG 
GCCAACACCG GCCTGAAGCC CTCGGAGTTC TCCAACCCGG TCCTCGGCCT GATCTTCCTG
CGCTACGCCG AGAAGCGGTT CCATGAGGCC GAGGCCAAGC TGATCGAGAG CGGGCTGGGT
GTGTCCGAGA TCGAGAAGTT CGACTATCAG GCCGAAGGGG CGCTGTACCT GCCCGACAAC
GCCCACTTCT CCTACCTACT CGACCTCGCC GAAGGTCAGG ATCTGGGCAA GGCGGTCAAC
GAGGCGATGG CGGCGGTCGA GGCCGAGAAC GAGGAGCTGA AGGGCGTTCT GCCGCGCTCC
TACGGAAGGC TCCCCAATAC CGTCCTGGTC GAGCTGCTGC GGGTGCTGAA CGGGCTGGGT
GAAGTCGAGG GCGATGCCTT CGGCAAGATC TACGAATACT TCCTCGGCAA GTTCGCCCTG
GCGGAAGGCC AGAAGGGCGG CGTGTTCTAC ACCCCGACCA GTATCGTCAA ACTGATCGTC
GAGATCATCG AGCCCTTCCA CGGCAAGATC TTCGACCCCG CGTGTGGCTC GGGCGGCATG
TTCGTGCAGA GCGCCCAGTT CGTCAGCCGC CACCAGAAGC GGGCTGCCGA GGAGCTGACC
GTGTACGGCA CCGAGAAGGC GAACGACACC GTCAAGCTCG CCAAGATGAA CCTCGCGGTG
CATGGCCTCT CGGGCGACAT CCGCGAATCG AACACCTACT ACGAAGACCC GCACAAGGCC
GTCGTCGGCA ACACCGGCAA GTTCGACTTC GTGATGGCGA ACCCGCCGTT CAACGTCTCG
GGCGTGGACA AGGAACGGGT CAAGGACGAC CCCCGCTTCC CCTTCGGGAT CCCGACCACC
GACAACGCCA ACTACCTCTG GATCCAGCAC TTCTACACCG CGCTGAACGA GCGCGGCCGT
GCCGGTTTCG TCATGGCCAA CTCGGCGGGT GATGCGCGGG GCACCGAGCT GGAGATCCGC
AAGAAGCTGA TTCAGACCGG CGGCGTGGAT GTGATCGTCT CGGTTGGCTC CAACTTCTTC
TACACCGTCA CCCTGCCGTG CACGCTGTGG TTCTTCGACA GGGCAAAGGC CAAGGGCGAG
CGCAAGGATG AGGTGCTGTT CATCGATGCG CGCGGCACCT ACCGGCAGGT CAGCCGGGCG
ATCCGCGACT TCCTGCCCGA GCAGATCGAG TTTCTGGCCA ACATCGTGCG GCTATGGCGC
GGCGAGGCGG TGGAGATCGA GGCGGGAAGC CAGGAGATGC TCCGGCAGCA GTTTCCGGAG
GGAGGCTATC GGGACATCGC CGGGGTGTGC AAGGTGGCGA CGCTGGCGGA GATCGAGGCG
CAGGGGTGGA GCCTGAATCC GGGGCGGTAT GTGGGGGTGG CTGATCGAGG CGCTGATGAC
TTCGACTTTG CGGAAAAATT CGAGGCTCTT GCTGAGGAGC TTGAGCGGCT GAATGGCGAG
GCAGATCAGC TTCAAGAGCA AATCAGTAGC CAAGCAGTGG CTTTACTCTC TTGA
 
Protein sequence
MSLDLNQLET RLWAAADQLW ANTGLKPSEF SNPVLGLIFL RYAEKRFHEA EAKLIESGLG 
VSEIEKFDYQ AEGALYLPDN AHFSYLLDLA EGQDLGKAVN EAMAAVEAEN EELKGVLPRS
YGRLPNTVLV ELLRVLNGLG EVEGDAFGKI YEYFLGKFAL AEGQKGGVFY TPTSIVKLIV
EIIEPFHGKI FDPACGSGGM FVQSAQFVSR HQKRAAEELT VYGTEKANDT VKLAKMNLAV
HGLSGDIRES NTYYEDPHKA VVGNTGKFDF VMANPPFNVS GVDKERVKDD PRFPFGIPTT
DNANYLWIQH FYTALNERGR AGFVMANSAG DARGTELEIR KKLIQTGGVD VIVSVGSNFF
YTVTLPCTLW FFDRAKAKGE RKDEVLFIDA RGTYRQVSRA IRDFLPEQIE FLANIVRLWR
GEAVEIEAGS QEMLRQQFPE GGYRDIAGVC KVATLAEIEA QGWSLNPGRY VGVADRGADD
FDFAEKFEAL AEELERLNGE ADQLQEQISS QAVALLS