Gene Tmz1t_0006 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_0006 
Symbol 
ID7085104 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp8404 
End bp11610 
Gene Length3207 bp 
Protein Length1068 aa 
Translation table11 
GC content63% 
IMG OID643697056 
Producttype I site-specific deoxyribonuclease, HsdR family 
Protein accessionYP_002353705 
Protein GI237653093 
COG category[V] Defense mechanisms 
COG ID[COG0610] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases 
TIGRFAM ID[TIGR00348] type I site-specific deoxyribonuclease, HsdR family 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCACCG TTCTCACCGC CAAGGAAGTG ACTGCCGGTT ACGGCGAACT CGTTCTCGTC 
GAGTTGCCCG CGATGGACTG TTTTGCTTCC CTGGGGTGGG AGGTCGCCAA CCTCTACGAC
GAAACCTTCG GCGTCGACGG CACGGAAGGG CGCAAGTCCT CCGCCGAGGT CATCCTGGTG
CCGCGCTTGC GGCGGGCGCT GGAGCGCATC AATCCCGGCT ACCCAGCCAC CGCCTACGAG
CAGGCCATCG AGCAGCTCAC CGAGGACCGC TCCAAGCAGA TCCCGGTGAA CGCCAACCAG
GCTTTCTACA CGCTGCTGCG CGACCGGGTG AAGGTGGAAA TCACCGACGA CGAGGGCAAC
CCGCAGACGG TCGAGCTGTT GGTCATTGAC TGGAACGACC CGGACAACAA CGACTTCTTC
CTCGCCCAGC AGATGTGGGT GTCGGGCGAG ATGTACAAGC GCCGCTGCGA CCTGCTCGGC
TTCGTCAATG GCATCCCGCT GGTCTTCGTC GAGCTGAAGG GGCCGCATGT GCCGCTGAAG
TCGGCCTATG ACGACAACCT GAAGGACTAC AAGGGCCAGA GCGTCCCGCA ACTGTTCCAC
CCCAACGCCT TCATCATCCT GTCGAACGGC TCGCAGACCC GCGTCGGCAC GCTGACCAGT
CCGTGGGAAC ACTTCTTCGA GTGGCGGCGG ATCGATGACG AGACCGAGGC GGGCTCGACC
TCGCTGGAGA CCGCGATCCG GGGGCTGTGC GACAAGCGGC GGCTGCTCGA CCTCGTCGAG
AACTTCACCG TGTTCGAGAC GGCGCGAGGC GGGCTGATCA AGAAGGTGGC CAAGAACCAC
CAGTACCTCG GGGTGAACAA GGCCATCGCG CAGATGGTGA AGCTGCGGGA GTCCGGCGAC
CGCGAGGCGG CGAAGAAGCT CGGCGTCTTC TGGCACACGC AGGGCAGCGG CAAGAGCCTG
TCGATGGTGT TCTTCACTCA GAAGGTGCTG AGGAAGCTCC CCGGCAAGTG GACCTTCGTG
ATGGTCACCG ACCGGGCTGA ACTCGACGAC CAGATCTACA AGACCTTCAC CGCCACCGGG
GCGATCACCG GGGCCGAGGT GCAGGCCAGC AGCGCCGAGA ACCTCAAGCA ACTGCTGCGC
GAGGACCACC GCTACGTCTT CAGCCTGATC CAGAAATTCC GCACGGAGAA GGGCGAGGCC
TACCCGAGGA TTTCTGAGCG GGACGACATC ATCGTCATCA CCGACGAGGC GCACCGCAGC
CAGTACGACG TGTTCGCGCT GAACATGCGT AACGCCCTGC CCAATGCCGC CTTCCTCGGC
TTCACCGGCA CCCCGCTGAT CGCAGGCGAA GAAGAGCGCA CCCGCGAGGT GTTCGGCGAT
TACGTGTCGG TGTACGACTT CGCCCGCTCC ATCGAGGACG GGGCGACGGT GCCCCTGTAC
TACGAGAATC GCACCCCCGA GCTGCAGATC ATCAACGACA ACCTGAACCG CGACATCGAG
CGCCTGCTGG AAGAGGCGGA GCTCTACGAG GAGCAGGAGA AGAAGCTGGA GCGGGAGTTC
GCCCGCGAAT ACCACCTGAT CACCCGCGAC GACCGCCTGG AGACCATCGC CGCCGATCTG
GTGAAGCACT TCCTCGGTCG CGGCTACCGG GGCAAGGCCA TGATGGTCTG CATCGACAAG
GCCACGGCGG TGCGGATGTA CGACAAGGTC CAGACGCACT GGAAGAAGCA TCTGGCCGAC
CTGGAAGCGC AACTGAAGAC CGCCAGCGGG GATGCGAAGG AGACGCTCGC CGACAAGATC
ATGGTCATGC GGACCACCGA CATGGCCGTG GTGGTGTCGC AGTCGCAGAA CGAGATCGAG
GAGCTGAAGG AAAAGGGCCT CGACATCGTC CCGCACCGCC AGCGCATGGT CGGCGAGAAC
CTCGACGAGC AGTTCAAGGA TCCCGATGGC CAACTGCGGC TGGTCTTCGT CTGCGCGATG
TGGATCACCG GCTTCGACGT GCCGACCTGC TCCACGCTCT ACCTCGACAA ACCGATGCGC
AACCACACCC TGATGCAGAC CATCGCCAGG GCGAACCGCG TCGCACCGGG CAAGACCGCC
GGCCTCATCG TCGATTACGT GGGCATCTTC CGTAACCTCC AGGACGCCCT GCGCATCTAC
GCCAAGCCCA ACCAGCCGGG GCAACTGCCG ATCAAGGACA AGGCGGCGCT GGTCGAGCAA
CTGGAAGGGC TGCTGCGGGA CGCGCAATCG TTCTGCACGG GCCTCGGGAT CGACCTGAAG
GGGATCGTGA ACACCCCGCC CGCGCAGCGC CTGGAAGCCC TCCAGAAGGC CATGGACGTG
ATCCTCGAAG CCGGCGAGGA CAAGACCAAG GCCTACCTGC TGCTCGCCGG TCAGGTCGCC
CGGACCTTCA AGGCCATCCT CCCCGACGCG GAAGCCAACG CCCATGCGCC GACCTCGGTG
CTGGTCGCCT ATCTCGGGGC GATGATCAAG GCGCTGCGTC CGCCACCCGA CATCTCCGGG
GTGATGAACG ACCTGGATGC GCTGCTCGAC GACTCCATCG CCACCGAGGG CTATCGCATC
GGGGCGCGAC CGGAAGCCGA GGCCCTGATC GACCTGTCGC AGATCGACTT CGCCGCCCTC
CAGAAGAAGT TCGAGGAAGG CAAGAAGGCC ACCGAGACCG AGAAGCTGAA GGGCCAGATC
GAGAAGAAGC TCGACACGAT GGTCCGCGAG AACAAGGGCC GGATCGACTT CCTGGAGAAG
TTCCAGAGGC TGATCGAGTC CTACAACTCA TCCAGTCACA ACCTGGAGGC CTTCTTCAAG
GAGCTGATGC ACTTCGCCCA GAACCTCACC CAGGAGGAAC AGCGGGCCAC CCGCGAGAAC
CTGAGCGAGG AAGAGCTGGC CATCTTCGAT CTCCTGACAC AACCCGAGCC CGAGCTGACC
GAGAAGGAAA AGAACGAGGT GAAGAAGGTG GCGAAGGACA TGCTCGCCAA ACTGAAGGCC
GAGAAGCTGG TGCTGGACTG GAAGCTGAAG ACCCAGACCA AGGCGGATGT CGAGCGGACG
ATCCGGGACT TTTACATCAG GCTGCCCGGG GCTTACACGC CGGAGCTGAA AAAAGATAAA
CGAGCCAAGA CGTACGCCCA CATCTTCGAG AACTATTTCG GGGCAGGGCA GAGCGTGTAT
CAGGATATTG GTGTTGCGGC GCATTAG
 
Protein sequence
MTTVLTAKEV TAGYGELVLV ELPAMDCFAS LGWEVANLYD ETFGVDGTEG RKSSAEVILV 
PRLRRALERI NPGYPATAYE QAIEQLTEDR SKQIPVNANQ AFYTLLRDRV KVEITDDEGN
PQTVELLVID WNDPDNNDFF LAQQMWVSGE MYKRRCDLLG FVNGIPLVFV ELKGPHVPLK
SAYDDNLKDY KGQSVPQLFH PNAFIILSNG SQTRVGTLTS PWEHFFEWRR IDDETEAGST
SLETAIRGLC DKRRLLDLVE NFTVFETARG GLIKKVAKNH QYLGVNKAIA QMVKLRESGD
REAAKKLGVF WHTQGSGKSL SMVFFTQKVL RKLPGKWTFV MVTDRAELDD QIYKTFTATG
AITGAEVQAS SAENLKQLLR EDHRYVFSLI QKFRTEKGEA YPRISERDDI IVITDEAHRS
QYDVFALNMR NALPNAAFLG FTGTPLIAGE EERTREVFGD YVSVYDFARS IEDGATVPLY
YENRTPELQI INDNLNRDIE RLLEEAELYE EQEKKLEREF AREYHLITRD DRLETIAADL
VKHFLGRGYR GKAMMVCIDK ATAVRMYDKV QTHWKKHLAD LEAQLKTASG DAKETLADKI
MVMRTTDMAV VVSQSQNEIE ELKEKGLDIV PHRQRMVGEN LDEQFKDPDG QLRLVFVCAM
WITGFDVPTC STLYLDKPMR NHTLMQTIAR ANRVAPGKTA GLIVDYVGIF RNLQDALRIY
AKPNQPGQLP IKDKAALVEQ LEGLLRDAQS FCTGLGIDLK GIVNTPPAQR LEALQKAMDV
ILEAGEDKTK AYLLLAGQVA RTFKAILPDA EANAHAPTSV LVAYLGAMIK ALRPPPDISG
VMNDLDALLD DSIATEGYRI GARPEAEALI DLSQIDFAAL QKKFEEGKKA TETEKLKGQI
EKKLDTMVRE NKGRIDFLEK FQRLIESYNS SSHNLEAFFK ELMHFAQNLT QEEQRATREN
LSEEELAIFD LLTQPEPELT EKEKNEVKKV AKDMLAKLKA EKLVLDWKLK TQTKADVERT
IRDFYIRLPG AYTPELKKDK RAKTYAHIFE NYFGAGQSVY QDIGVAAH