Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_3601 |
Symbol | hsdR |
ID | 7873106 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 3950630 |
End bp | 3954043 |
Gene Length | 3414 bp |
Protein Length | 1137 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 643700541 |
Product | type I restriction enzyme EcoKI subunit R |
Protein accession | YP_002890571 |
Protein GI | 237654257 |
COG category | [V] Defense mechanisms |
COG ID | [COG4096] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTTGGCG ACATGCCATC GAATTTCGGA CATCTCAAGG TGCACGACCA GCAATTGGTG CGCCTAGGGA TGCTCGCCGA GCGCTATTTT TCTGATGACC CGAACACTTG CCTCCTGAAG CTTCGCCAAC TGACGGAGCT GCTCGCGCAG CTCGCAGCGT CGAAGGTGGG AATCTATACG TCGCCTGACG AGAAGCAGGT CGACCTGCTC CGGAGATTGC AGGACAAGGG AATCGTTCCA CGCGAAGTTG GGGCCTTGTT CGCCGAGGTG CGCAAGGCAG GTAACGACGC GAACCATTGC TTGAGTGGCG ACCATCGCAC GGCGTTGCTG GGCCTTCGGC TTAGCTGGCA ACTCGGCGTG TGGTTCCATC GGACCTTCAA AGACCCGGCC TTCAAGTCAG GCCCGTTCCA GCCGCCAGCG CCCCCGGCAA ACGAGAGTGC AGAACTGAAA ACCGAGCTCG ATGCACTTCG CGCCGAGCTG AGCCAGTATC GCGCCGCACA GAAAGATGCG GCTCAGGCGC TTGATCAGAT TCAAGTCCAG ATCCGACAAG CCGAGGACGA CAGCGCTGTC TGGGAAACCA TGGCTGCCGA GGCCGAAGCG GCAAAGGCCG AACTGCTCCA GCGGCTGGAG GTACTCCAAG CCCAAGCTGC AGCCGCGCCG CCCCAAGTCA TTGCCAAGTT CGTGGCAGCA GCAGATTCGG CGGCCGCTGT CATCCAGATC AGCGAAGGCG ACACTCGCAG GCTCATCGAC CAACAGCTTG CCCTCGCAGG CTGGACGGTC GATTCGGCTC GCATCACATT TGCCAGGGGG ATACGTCCCC AACGCGGTCA AAACCTCGCC ATCGCCGAAT GGCCCACCGA GACCGGGCCA GCCGACTATG CGTTGTTCAT CGACCTCATG CCGGTGGCCA TCGTGGAGGC CAAGCGCAAG AACATCGATG TATCAGCCGC ATTGCAGCAG GCCAAGCGCT ACAGCCGCGG CTTCCGGGTA TCGCCCGAGG TGGAACTGCC CCTGAGCAAT TTTGGGGCCA ACGCGGAATT TCGAGTTCCC TTTGTGTTTT CGTCCAACGG CCGCCCGTAC TTGCGGCAGC TCGCGGAGCG AAGCGGCGTC TGGTTTTGCG ACCTGCGTCG GCCCGCGAAC CTGGGTCACG CCCTCGACGG CTGGTACACG CCGGAAGGCC TCAACGCCCT GCTACAGCGT GACGACGACC GCGCGCACAC CGAACTCGCC AATGCGCCGT TTGATTTCGG CTTCCCGCTT CGGCCCTACC AGCAGCGCGC CATCCTGGCC ACCGAAGCCA GCATCCGCGA TGGGCAGCGC GCCATCCTGC TGGCCATGGC CACGGGCACC GGCAAGACCA AGACCTGCAT CGCACTGATC TACCGCCTGC TCAAGGCCAA ACGCTTCAGG CGCATCCTGT TCCTCGTCGA CCGCTCGGCC CTGGGCGAGC AGGCCGCCAA CGCGTTCAAG GACACGCGCA TGGAACGCCT GCAGACCTTT GCCGACATCT TCGGCATCAA GGAGCTGGAG GTCCAGGCGC CCGACGACGA CACCGCCGTG CACCTGGCCA CCGTGCAAGG CATGGTGCAG CGTGTGCTGT ACCCCAGCGA CGGCACGCCG CCGCCGCCCA TCGACCAGTA CGACTGCATC ATCGTCGACG AGTGCCACCG CGGCTACCTG CTGGACCGTG AGCTGTCAGA CACCGAGCTG AGCTTCCGTG GCTACGACGA CTACGTGTCC AAGTACCGAC GTGTGCTGGA CTACTTCGAC GCGGTGAAGG TCGGCCTCAC CGCCACACCC GCCCTTCACA CCACGCAAAT CTTCGGTACG CCCGTCTTCG CCTACGGCTA CCGCGAAGCC GTGGTCGACG GCTATCTGGT GGACTACGAG CCACCAATCC AGGTGCATAC CCTGCTCTCC GGGCAGGGCA TTGCGTGGAA GGCCGGCGAA GAGGTCAAGG TCTACAACAC CGCCCGCCAG CAGATCGAGC TGTTCAAGAC GCCCGACGAG ATCAAGCTCA AGGTCGATGA CTTCAACCGC AAGGTCATCA CGCGTCCCTT CAACGAGGTG GTCTGCACCT ACCTGGCACA AGAACTGGAC CCGGCCTCCC GCCGCAAGAC CTTGATCTTC TGCGTCAGCG ACAGCCACGC CGACATGGTG GTGGACTTGT TGAAGAAGGC CTTCGCCGCG CAGTACGGCG CGGTCGAGGA CGACGCCGTC ATCAAGATCA CCGGCGCGGC CGACAAGCCC TTGCAGCTGA TCCGCCGCTA CAAGAACGAG CGCCTGCCGA ATGTCGCTGT CACCGTCGAC CTGCTGACCA CGGGCGTCGA TGTGCCCGAG ATCTGCAACC TGGTGTTCCT GCGCCAAGTG AACAGCCGCA TCCTGTTCGA CCAGATGCTG GGCCGCGCGA CGCGGCTCTG TAACTTTGGC GGCACCGACG TGAAGGACGC TTTCCGCGTG TTCGATGCGG TCCGCATCTT CGAGGCCATT GGCGACATGA CGGCCATGAA ACCCGTCGTC GTAAACCCGA AGATCACCTT CACTCAGCTT TCGCAGGAGC TGGCCACACT GAAGGATGAA TCGGCCACCG AACTGGTGCG CGACCAGTTC CTGGCCAAGC TGCAGGCCAA GAAGCGCCAC CTCACCGACA AGAACCGGCA GGACTTCGAG GCCAAGGCGG GTATGTCGGT GCAGGCCTTC GTCCAGAAGC TGAAGGCGAT GCCACTGGCC GATGTGGCGG CGTGGTTCGT GCAGAACCCG GAACTGGGCG AGCTGCTCGA CCGCCGAAGC GATGGCCCCG AGCGCGAGAT GTTCATTTCA GAGCACACCG ATGCCTTCGA CCGTGCCGAG CGCGGCTACG GCAAGGGCAA GAAGCCCGAC GACTACATCC GCGCCTTCAG CGAGTTCATC AAGACGCAGG GCAACCAGAT TCCGGCGCTG GTGACCGTGC TGACACGGCC ACGCGAGTTG ACCCGCGCGC AGCTGCGCGA ACTGGTCTTG GCGCTGGACC AGGCCGGCTT CACCGAAACC AGCCTCGCCT CGGCCTGGCG CGAGCTGACC AACCAGGACA TCGCGGCGCG CATCGTGGGC TACATCCGCC AGGCAGCCAT CGGCGATGCC CTGGTCCCCT ATGTGGAGCG CGTCGACCGG GCCTTGCAGC ATCTGCTGGC GCATCCGCCT GCAGGCAAGC CCTGGAGCAC ACCGCAGCGC GACTGGCTCA AGCGCATCGC TGCGCAGACC AAGGCCAATG TGCTGGTGGA CCGCTCCGCC ATCGACGACC CCGACTTGAT CTTCAAGCGC GAAGGTGGAG GCTTTAACCG GCTGGACAAG GTCTTCAACG GCCAACTTCA GCCCGTGCTC GACGCCTTCA ACGACGCGCT CTGGGCCTTG CCGCCCGAAG CTGCCAACCG CTGA
|
Protein sequence | MFGDMPSNFG HLKVHDQQLV RLGMLAERYF SDDPNTCLLK LRQLTELLAQ LAASKVGIYT SPDEKQVDLL RRLQDKGIVP REVGALFAEV RKAGNDANHC LSGDHRTALL GLRLSWQLGV WFHRTFKDPA FKSGPFQPPA PPANESAELK TELDALRAEL SQYRAAQKDA AQALDQIQVQ IRQAEDDSAV WETMAAEAEA AKAELLQRLE VLQAQAAAAP PQVIAKFVAA ADSAAAVIQI SEGDTRRLID QQLALAGWTV DSARITFARG IRPQRGQNLA IAEWPTETGP ADYALFIDLM PVAIVEAKRK NIDVSAALQQ AKRYSRGFRV SPEVELPLSN FGANAEFRVP FVFSSNGRPY LRQLAERSGV WFCDLRRPAN LGHALDGWYT PEGLNALLQR DDDRAHTELA NAPFDFGFPL RPYQQRAILA TEASIRDGQR AILLAMATGT GKTKTCIALI YRLLKAKRFR RILFLVDRSA LGEQAANAFK DTRMERLQTF ADIFGIKELE VQAPDDDTAV HLATVQGMVQ RVLYPSDGTP PPPIDQYDCI IVDECHRGYL LDRELSDTEL SFRGYDDYVS KYRRVLDYFD AVKVGLTATP ALHTTQIFGT PVFAYGYREA VVDGYLVDYE PPIQVHTLLS GQGIAWKAGE EVKVYNTARQ QIELFKTPDE IKLKVDDFNR KVITRPFNEV VCTYLAQELD PASRRKTLIF CVSDSHADMV VDLLKKAFAA QYGAVEDDAV IKITGAADKP LQLIRRYKNE RLPNVAVTVD LLTTGVDVPE ICNLVFLRQV NSRILFDQML GRATRLCNFG GTDVKDAFRV FDAVRIFEAI GDMTAMKPVV VNPKITFTQL SQELATLKDE SATELVRDQF LAKLQAKKRH LTDKNRQDFE AKAGMSVQAF VQKLKAMPLA DVAAWFVQNP ELGELLDRRS DGPEREMFIS EHTDAFDRAE RGYGKGKKPD DYIRAFSEFI KTQGNQIPAL VTVLTRPREL TRAQLRELVL ALDQAGFTET SLASAWRELT NQDIAARIVG YIRQAAIGDA LVPYVERVDR ALQHLLAHPP AGKPWSTPQR DWLKRIAAQT KANVLVDRSA IDDPDLIFKR EGGGFNRLDK VFNGQLQPVL DAFNDALWAL PPEAANR
|
| |