Gene Tmz1t_3989 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_3989 
Symbol 
ID7873635 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp4386283 
End bp4387353 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content68% 
IMG OID643700926 
ProductN-6 DNA methylase 
Protein accessionYP_002890949 
Protein GI237654635 
COG category[V] Defense mechanisms 
COG ID[COG0286] Type I restriction-modification system methyltransferase subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACCAGC AAGCCCTCTC CGCCCTCATC TGGTCGGTCG CCGACCTCTT GCGCGGCGAC 
TTCAAGCAGT CCGAATACGG CCGCGTGATC CTGCCCTTCA CCGTGCTGCG CCGGCTCGAC
TGCGTGCTGG CGCCGACCAA GGCCGCGGTG CTCGTCGAAC ACCGCGACAA GGAGCAGGCC
GGGCTGCTCT ACCTGGTGGT GGAAAAGTTC GCCCACATCG AGCCCCACCC CAGGCGCGTC
GACAACGTGC ACATGGGCCT GGTCTTCGAG GAGCTGATCC GCAAGTTCGC CGAGATCTCC
AACGAGACCG CCGGCGAGCA CTTCACCCCG CGCGAGCTCA TCCGCCTGAT GGTGAGCCCG
CTCTTCATCG AGGACGACGA GGCGCTGTCC AAGCCCGGCA TCGTGCGCAC CATCTACGAC
CCCACCGCCG GCACCGGCAC CGGCCGCATG CTGTCGGTGG CGGGCGAGCA CCTGCACGAG
ATCAAGCCCG GCGCGCGCCT CACCATGTTC GGCCAGGAGC TCAACCCCGA GTCCTACGCC
ATCTGCAAGG CCGACATGCT GATCAAGGGC CAGGACGTGC GCAGCATCGT GCTCGGCAAC
ACGCTGTCCG AGACCCACAT CGGCGAGATC ACCCGCCTGC TCGGCGAATT CCTCGAAGCC
GAGCAGGCGG TGGTGAGCGA CGCCCAGGGC AAGGAGCTCG CGCGCGTGAC CCTCTTCCCC
GAGGTGCGCT GCCCGGCCGC GCCCGCGGGC GGCAAGGTCA AGCGTGTGCC CATCGCCCGC
GTCTTCCGCA ACCAGGACTT CGGCTACCGC ACGATCACCA TCGAGCGCCC GCTGCGCGAC
GCCGAGAACG TGCCGCTGTT CGAGGACGTG CAGGCCTGGT TCGAGCGCGA GGTGCTGTCC
CACGCCCCCG ACGCCTGGAT CGACCACGAC AAGACCCGGA TCGGCTATGA GATCCCCTTG
AACCGCCACT TCTACGTTTT CGAGCCGCCG CGGCCGCTGG CGGAGATCGA CGCCGACCTG
AAGCGCTCGA TGGACCGGAT CAAGCAGATG ATCGAGGGGC TGGCGGGATG A
 
Protein sequence
MNQQALSALI WSVADLLRGD FKQSEYGRVI LPFTVLRRLD CVLAPTKAAV LVEHRDKEQA 
GLLYLVVEKF AHIEPHPRRV DNVHMGLVFE ELIRKFAEIS NETAGEHFTP RELIRLMVSP
LFIEDDEALS KPGIVRTIYD PTAGTGTGRM LSVAGEHLHE IKPGARLTMF GQELNPESYA
ICKADMLIKG QDVRSIVLGN TLSETHIGEI TRLLGEFLEA EQAVVSDAQG KELARVTLFP
EVRCPAAPAG GKVKRVPIAR VFRNQDFGYR TITIERPLRD AENVPLFEDV QAWFEREVLS
HAPDAWIDHD KTRIGYEIPL NRHFYVFEPP RPLAEIDADL KRSMDRIKQM IEGLAG