Gene Tmz1t_3601 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_3601 
SymbolhsdR 
ID7873106 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp3950630 
End bp3954043 
Gene Length3414 bp 
Protein Length1137 aa 
Translation table11 
GC content64% 
IMG OID643700541 
Producttype I restriction enzyme EcoKI subunit R 
Protein accessionYP_002890571 
Protein GI237654257 
COG category[V] Defense mechanisms 
COG ID[COG4096] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTTGGCG ACATGCCATC GAATTTCGGA CATCTCAAGG TGCACGACCA GCAATTGGTG 
CGCCTAGGGA TGCTCGCCGA GCGCTATTTT TCTGATGACC CGAACACTTG CCTCCTGAAG
CTTCGCCAAC TGACGGAGCT GCTCGCGCAG CTCGCAGCGT CGAAGGTGGG AATCTATACG
TCGCCTGACG AGAAGCAGGT CGACCTGCTC CGGAGATTGC AGGACAAGGG AATCGTTCCA
CGCGAAGTTG GGGCCTTGTT CGCCGAGGTG CGCAAGGCAG GTAACGACGC GAACCATTGC
TTGAGTGGCG ACCATCGCAC GGCGTTGCTG GGCCTTCGGC TTAGCTGGCA ACTCGGCGTG
TGGTTCCATC GGACCTTCAA AGACCCGGCC TTCAAGTCAG GCCCGTTCCA GCCGCCAGCG
CCCCCGGCAA ACGAGAGTGC AGAACTGAAA ACCGAGCTCG ATGCACTTCG CGCCGAGCTG
AGCCAGTATC GCGCCGCACA GAAAGATGCG GCTCAGGCGC TTGATCAGAT TCAAGTCCAG
ATCCGACAAG CCGAGGACGA CAGCGCTGTC TGGGAAACCA TGGCTGCCGA GGCCGAAGCG
GCAAAGGCCG AACTGCTCCA GCGGCTGGAG GTACTCCAAG CCCAAGCTGC AGCCGCGCCG
CCCCAAGTCA TTGCCAAGTT CGTGGCAGCA GCAGATTCGG CGGCCGCTGT CATCCAGATC
AGCGAAGGCG ACACTCGCAG GCTCATCGAC CAACAGCTTG CCCTCGCAGG CTGGACGGTC
GATTCGGCTC GCATCACATT TGCCAGGGGG ATACGTCCCC AACGCGGTCA AAACCTCGCC
ATCGCCGAAT GGCCCACCGA GACCGGGCCA GCCGACTATG CGTTGTTCAT CGACCTCATG
CCGGTGGCCA TCGTGGAGGC CAAGCGCAAG AACATCGATG TATCAGCCGC ATTGCAGCAG
GCCAAGCGCT ACAGCCGCGG CTTCCGGGTA TCGCCCGAGG TGGAACTGCC CCTGAGCAAT
TTTGGGGCCA ACGCGGAATT TCGAGTTCCC TTTGTGTTTT CGTCCAACGG CCGCCCGTAC
TTGCGGCAGC TCGCGGAGCG AAGCGGCGTC TGGTTTTGCG ACCTGCGTCG GCCCGCGAAC
CTGGGTCACG CCCTCGACGG CTGGTACACG CCGGAAGGCC TCAACGCCCT GCTACAGCGT
GACGACGACC GCGCGCACAC CGAACTCGCC AATGCGCCGT TTGATTTCGG CTTCCCGCTT
CGGCCCTACC AGCAGCGCGC CATCCTGGCC ACCGAAGCCA GCATCCGCGA TGGGCAGCGC
GCCATCCTGC TGGCCATGGC CACGGGCACC GGCAAGACCA AGACCTGCAT CGCACTGATC
TACCGCCTGC TCAAGGCCAA ACGCTTCAGG CGCATCCTGT TCCTCGTCGA CCGCTCGGCC
CTGGGCGAGC AGGCCGCCAA CGCGTTCAAG GACACGCGCA TGGAACGCCT GCAGACCTTT
GCCGACATCT TCGGCATCAA GGAGCTGGAG GTCCAGGCGC CCGACGACGA CACCGCCGTG
CACCTGGCCA CCGTGCAAGG CATGGTGCAG CGTGTGCTGT ACCCCAGCGA CGGCACGCCG
CCGCCGCCCA TCGACCAGTA CGACTGCATC ATCGTCGACG AGTGCCACCG CGGCTACCTG
CTGGACCGTG AGCTGTCAGA CACCGAGCTG AGCTTCCGTG GCTACGACGA CTACGTGTCC
AAGTACCGAC GTGTGCTGGA CTACTTCGAC GCGGTGAAGG TCGGCCTCAC CGCCACACCC
GCCCTTCACA CCACGCAAAT CTTCGGTACG CCCGTCTTCG CCTACGGCTA CCGCGAAGCC
GTGGTCGACG GCTATCTGGT GGACTACGAG CCACCAATCC AGGTGCATAC CCTGCTCTCC
GGGCAGGGCA TTGCGTGGAA GGCCGGCGAA GAGGTCAAGG TCTACAACAC CGCCCGCCAG
CAGATCGAGC TGTTCAAGAC GCCCGACGAG ATCAAGCTCA AGGTCGATGA CTTCAACCGC
AAGGTCATCA CGCGTCCCTT CAACGAGGTG GTCTGCACCT ACCTGGCACA AGAACTGGAC
CCGGCCTCCC GCCGCAAGAC CTTGATCTTC TGCGTCAGCG ACAGCCACGC CGACATGGTG
GTGGACTTGT TGAAGAAGGC CTTCGCCGCG CAGTACGGCG CGGTCGAGGA CGACGCCGTC
ATCAAGATCA CCGGCGCGGC CGACAAGCCC TTGCAGCTGA TCCGCCGCTA CAAGAACGAG
CGCCTGCCGA ATGTCGCTGT CACCGTCGAC CTGCTGACCA CGGGCGTCGA TGTGCCCGAG
ATCTGCAACC TGGTGTTCCT GCGCCAAGTG AACAGCCGCA TCCTGTTCGA CCAGATGCTG
GGCCGCGCGA CGCGGCTCTG TAACTTTGGC GGCACCGACG TGAAGGACGC TTTCCGCGTG
TTCGATGCGG TCCGCATCTT CGAGGCCATT GGCGACATGA CGGCCATGAA ACCCGTCGTC
GTAAACCCGA AGATCACCTT CACTCAGCTT TCGCAGGAGC TGGCCACACT GAAGGATGAA
TCGGCCACCG AACTGGTGCG CGACCAGTTC CTGGCCAAGC TGCAGGCCAA GAAGCGCCAC
CTCACCGACA AGAACCGGCA GGACTTCGAG GCCAAGGCGG GTATGTCGGT GCAGGCCTTC
GTCCAGAAGC TGAAGGCGAT GCCACTGGCC GATGTGGCGG CGTGGTTCGT GCAGAACCCG
GAACTGGGCG AGCTGCTCGA CCGCCGAAGC GATGGCCCCG AGCGCGAGAT GTTCATTTCA
GAGCACACCG ATGCCTTCGA CCGTGCCGAG CGCGGCTACG GCAAGGGCAA GAAGCCCGAC
GACTACATCC GCGCCTTCAG CGAGTTCATC AAGACGCAGG GCAACCAGAT TCCGGCGCTG
GTGACCGTGC TGACACGGCC ACGCGAGTTG ACCCGCGCGC AGCTGCGCGA ACTGGTCTTG
GCGCTGGACC AGGCCGGCTT CACCGAAACC AGCCTCGCCT CGGCCTGGCG CGAGCTGACC
AACCAGGACA TCGCGGCGCG CATCGTGGGC TACATCCGCC AGGCAGCCAT CGGCGATGCC
CTGGTCCCCT ATGTGGAGCG CGTCGACCGG GCCTTGCAGC ATCTGCTGGC GCATCCGCCT
GCAGGCAAGC CCTGGAGCAC ACCGCAGCGC GACTGGCTCA AGCGCATCGC TGCGCAGACC
AAGGCCAATG TGCTGGTGGA CCGCTCCGCC ATCGACGACC CCGACTTGAT CTTCAAGCGC
GAAGGTGGAG GCTTTAACCG GCTGGACAAG GTCTTCAACG GCCAACTTCA GCCCGTGCTC
GACGCCTTCA ACGACGCGCT CTGGGCCTTG CCGCCCGAAG CTGCCAACCG CTGA
 
Protein sequence
MFGDMPSNFG HLKVHDQQLV RLGMLAERYF SDDPNTCLLK LRQLTELLAQ LAASKVGIYT 
SPDEKQVDLL RRLQDKGIVP REVGALFAEV RKAGNDANHC LSGDHRTALL GLRLSWQLGV
WFHRTFKDPA FKSGPFQPPA PPANESAELK TELDALRAEL SQYRAAQKDA AQALDQIQVQ
IRQAEDDSAV WETMAAEAEA AKAELLQRLE VLQAQAAAAP PQVIAKFVAA ADSAAAVIQI
SEGDTRRLID QQLALAGWTV DSARITFARG IRPQRGQNLA IAEWPTETGP ADYALFIDLM
PVAIVEAKRK NIDVSAALQQ AKRYSRGFRV SPEVELPLSN FGANAEFRVP FVFSSNGRPY
LRQLAERSGV WFCDLRRPAN LGHALDGWYT PEGLNALLQR DDDRAHTELA NAPFDFGFPL
RPYQQRAILA TEASIRDGQR AILLAMATGT GKTKTCIALI YRLLKAKRFR RILFLVDRSA
LGEQAANAFK DTRMERLQTF ADIFGIKELE VQAPDDDTAV HLATVQGMVQ RVLYPSDGTP
PPPIDQYDCI IVDECHRGYL LDRELSDTEL SFRGYDDYVS KYRRVLDYFD AVKVGLTATP
ALHTTQIFGT PVFAYGYREA VVDGYLVDYE PPIQVHTLLS GQGIAWKAGE EVKVYNTARQ
QIELFKTPDE IKLKVDDFNR KVITRPFNEV VCTYLAQELD PASRRKTLIF CVSDSHADMV
VDLLKKAFAA QYGAVEDDAV IKITGAADKP LQLIRRYKNE RLPNVAVTVD LLTTGVDVPE
ICNLVFLRQV NSRILFDQML GRATRLCNFG GTDVKDAFRV FDAVRIFEAI GDMTAMKPVV
VNPKITFTQL SQELATLKDE SATELVRDQF LAKLQAKKRH LTDKNRQDFE AKAGMSVQAF
VQKLKAMPLA DVAAWFVQNP ELGELLDRRS DGPEREMFIS EHTDAFDRAE RGYGKGKKPD
DYIRAFSEFI KTQGNQIPAL VTVLTRPREL TRAQLRELVL ALDQAGFTET SLASAWRELT
NQDIAARIVG YIRQAAIGDA LVPYVERVDR ALQHLLAHPP AGKPWSTPQR DWLKRIAAQT
KANVLVDRSA IDDPDLIFKR EGGGFNRLDK VFNGQLQPVL DAFNDALWAL PPEAANR