Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_0006 |
Symbol | |
ID | 7085104 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 8404 |
End bp | 11610 |
Gene Length | 3207 bp |
Protein Length | 1068 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 643697056 |
Product | type I site-specific deoxyribonuclease, HsdR family |
Protein accession | YP_002353705 |
Protein GI | 237653093 |
COG category | [V] Defense mechanisms |
COG ID | [COG0610] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases |
TIGRFAM ID | [TIGR00348] type I site-specific deoxyribonuclease, HsdR family |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCACCG TTCTCACCGC CAAGGAAGTG ACTGCCGGTT ACGGCGAACT CGTTCTCGTC GAGTTGCCCG CGATGGACTG TTTTGCTTCC CTGGGGTGGG AGGTCGCCAA CCTCTACGAC GAAACCTTCG GCGTCGACGG CACGGAAGGG CGCAAGTCCT CCGCCGAGGT CATCCTGGTG CCGCGCTTGC GGCGGGCGCT GGAGCGCATC AATCCCGGCT ACCCAGCCAC CGCCTACGAG CAGGCCATCG AGCAGCTCAC CGAGGACCGC TCCAAGCAGA TCCCGGTGAA CGCCAACCAG GCTTTCTACA CGCTGCTGCG CGACCGGGTG AAGGTGGAAA TCACCGACGA CGAGGGCAAC CCGCAGACGG TCGAGCTGTT GGTCATTGAC TGGAACGACC CGGACAACAA CGACTTCTTC CTCGCCCAGC AGATGTGGGT GTCGGGCGAG ATGTACAAGC GCCGCTGCGA CCTGCTCGGC TTCGTCAATG GCATCCCGCT GGTCTTCGTC GAGCTGAAGG GGCCGCATGT GCCGCTGAAG TCGGCCTATG ACGACAACCT GAAGGACTAC AAGGGCCAGA GCGTCCCGCA ACTGTTCCAC CCCAACGCCT TCATCATCCT GTCGAACGGC TCGCAGACCC GCGTCGGCAC GCTGACCAGT CCGTGGGAAC ACTTCTTCGA GTGGCGGCGG ATCGATGACG AGACCGAGGC GGGCTCGACC TCGCTGGAGA CCGCGATCCG GGGGCTGTGC GACAAGCGGC GGCTGCTCGA CCTCGTCGAG AACTTCACCG TGTTCGAGAC GGCGCGAGGC GGGCTGATCA AGAAGGTGGC CAAGAACCAC CAGTACCTCG GGGTGAACAA GGCCATCGCG CAGATGGTGA AGCTGCGGGA GTCCGGCGAC CGCGAGGCGG CGAAGAAGCT CGGCGTCTTC TGGCACACGC AGGGCAGCGG CAAGAGCCTG TCGATGGTGT TCTTCACTCA GAAGGTGCTG AGGAAGCTCC CCGGCAAGTG GACCTTCGTG ATGGTCACCG ACCGGGCTGA ACTCGACGAC CAGATCTACA AGACCTTCAC CGCCACCGGG GCGATCACCG GGGCCGAGGT GCAGGCCAGC AGCGCCGAGA ACCTCAAGCA ACTGCTGCGC GAGGACCACC GCTACGTCTT CAGCCTGATC CAGAAATTCC GCACGGAGAA GGGCGAGGCC TACCCGAGGA TTTCTGAGCG GGACGACATC ATCGTCATCA CCGACGAGGC GCACCGCAGC CAGTACGACG TGTTCGCGCT GAACATGCGT AACGCCCTGC CCAATGCCGC CTTCCTCGGC TTCACCGGCA CCCCGCTGAT CGCAGGCGAA GAAGAGCGCA CCCGCGAGGT GTTCGGCGAT TACGTGTCGG TGTACGACTT CGCCCGCTCC ATCGAGGACG GGGCGACGGT GCCCCTGTAC TACGAGAATC GCACCCCCGA GCTGCAGATC ATCAACGACA ACCTGAACCG CGACATCGAG CGCCTGCTGG AAGAGGCGGA GCTCTACGAG GAGCAGGAGA AGAAGCTGGA GCGGGAGTTC GCCCGCGAAT ACCACCTGAT CACCCGCGAC GACCGCCTGG AGACCATCGC CGCCGATCTG GTGAAGCACT TCCTCGGTCG CGGCTACCGG GGCAAGGCCA TGATGGTCTG CATCGACAAG GCCACGGCGG TGCGGATGTA CGACAAGGTC CAGACGCACT GGAAGAAGCA TCTGGCCGAC CTGGAAGCGC AACTGAAGAC CGCCAGCGGG GATGCGAAGG AGACGCTCGC CGACAAGATC ATGGTCATGC GGACCACCGA CATGGCCGTG GTGGTGTCGC AGTCGCAGAA CGAGATCGAG GAGCTGAAGG AAAAGGGCCT CGACATCGTC CCGCACCGCC AGCGCATGGT CGGCGAGAAC CTCGACGAGC AGTTCAAGGA TCCCGATGGC CAACTGCGGC TGGTCTTCGT CTGCGCGATG TGGATCACCG GCTTCGACGT GCCGACCTGC TCCACGCTCT ACCTCGACAA ACCGATGCGC AACCACACCC TGATGCAGAC CATCGCCAGG GCGAACCGCG TCGCACCGGG CAAGACCGCC GGCCTCATCG TCGATTACGT GGGCATCTTC CGTAACCTCC AGGACGCCCT GCGCATCTAC GCCAAGCCCA ACCAGCCGGG GCAACTGCCG ATCAAGGACA AGGCGGCGCT GGTCGAGCAA CTGGAAGGGC TGCTGCGGGA CGCGCAATCG TTCTGCACGG GCCTCGGGAT CGACCTGAAG GGGATCGTGA ACACCCCGCC CGCGCAGCGC CTGGAAGCCC TCCAGAAGGC CATGGACGTG ATCCTCGAAG CCGGCGAGGA CAAGACCAAG GCCTACCTGC TGCTCGCCGG TCAGGTCGCC CGGACCTTCA AGGCCATCCT CCCCGACGCG GAAGCCAACG CCCATGCGCC GACCTCGGTG CTGGTCGCCT ATCTCGGGGC GATGATCAAG GCGCTGCGTC CGCCACCCGA CATCTCCGGG GTGATGAACG ACCTGGATGC GCTGCTCGAC GACTCCATCG CCACCGAGGG CTATCGCATC GGGGCGCGAC CGGAAGCCGA GGCCCTGATC GACCTGTCGC AGATCGACTT CGCCGCCCTC CAGAAGAAGT TCGAGGAAGG CAAGAAGGCC ACCGAGACCG AGAAGCTGAA GGGCCAGATC GAGAAGAAGC TCGACACGAT GGTCCGCGAG AACAAGGGCC GGATCGACTT CCTGGAGAAG TTCCAGAGGC TGATCGAGTC CTACAACTCA TCCAGTCACA ACCTGGAGGC CTTCTTCAAG GAGCTGATGC ACTTCGCCCA GAACCTCACC CAGGAGGAAC AGCGGGCCAC CCGCGAGAAC CTGAGCGAGG AAGAGCTGGC CATCTTCGAT CTCCTGACAC AACCCGAGCC CGAGCTGACC GAGAAGGAAA AGAACGAGGT GAAGAAGGTG GCGAAGGACA TGCTCGCCAA ACTGAAGGCC GAGAAGCTGG TGCTGGACTG GAAGCTGAAG ACCCAGACCA AGGCGGATGT CGAGCGGACG ATCCGGGACT TTTACATCAG GCTGCCCGGG GCTTACACGC CGGAGCTGAA AAAAGATAAA CGAGCCAAGA CGTACGCCCA CATCTTCGAG AACTATTTCG GGGCAGGGCA GAGCGTGTAT CAGGATATTG GTGTTGCGGC GCATTAG
|
Protein sequence | MTTVLTAKEV TAGYGELVLV ELPAMDCFAS LGWEVANLYD ETFGVDGTEG RKSSAEVILV PRLRRALERI NPGYPATAYE QAIEQLTEDR SKQIPVNANQ AFYTLLRDRV KVEITDDEGN PQTVELLVID WNDPDNNDFF LAQQMWVSGE MYKRRCDLLG FVNGIPLVFV ELKGPHVPLK SAYDDNLKDY KGQSVPQLFH PNAFIILSNG SQTRVGTLTS PWEHFFEWRR IDDETEAGST SLETAIRGLC DKRRLLDLVE NFTVFETARG GLIKKVAKNH QYLGVNKAIA QMVKLRESGD REAAKKLGVF WHTQGSGKSL SMVFFTQKVL RKLPGKWTFV MVTDRAELDD QIYKTFTATG AITGAEVQAS SAENLKQLLR EDHRYVFSLI QKFRTEKGEA YPRISERDDI IVITDEAHRS QYDVFALNMR NALPNAAFLG FTGTPLIAGE EERTREVFGD YVSVYDFARS IEDGATVPLY YENRTPELQI INDNLNRDIE RLLEEAELYE EQEKKLEREF AREYHLITRD DRLETIAADL VKHFLGRGYR GKAMMVCIDK ATAVRMYDKV QTHWKKHLAD LEAQLKTASG DAKETLADKI MVMRTTDMAV VVSQSQNEIE ELKEKGLDIV PHRQRMVGEN LDEQFKDPDG QLRLVFVCAM WITGFDVPTC STLYLDKPMR NHTLMQTIAR ANRVAPGKTA GLIVDYVGIF RNLQDALRIY AKPNQPGQLP IKDKAALVEQ LEGLLRDAQS FCTGLGIDLK GIVNTPPAQR LEALQKAMDV ILEAGEDKTK AYLLLAGQVA RTFKAILPDA EANAHAPTSV LVAYLGAMIK ALRPPPDISG VMNDLDALLD DSIATEGYRI GARPEAEALI DLSQIDFAAL QKKFEEGKKA TETEKLKGQI EKKLDTMVRE NKGRIDFLEK FQRLIESYNS SSHNLEAFFK ELMHFAQNLT QEEQRATREN LSEEELAIFD LLTQPEPELT EKEKNEVKKV AKDMLAKLKA EKLVLDWKLK TQTKADVERT IRDFYIRLPG AYTPELKKDK RAKTYAHIFE NYFGAGQSVY QDIGVAAH
|
| |