Gene Tmz1t_3175 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_3175 
Symbol 
ID7874315 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp3448901 
End bp3452248 
Gene Length3348 bp 
Protein Length1115 aa 
Translation table11 
GC content62% 
IMG OID643700103 
Producttype III restriction protein res subunit 
Protein accessionYP_002890147 
Protein GI237653833 
COG category[V] Defense mechanisms 
COG ID[COG4096] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGGCGAATA CGGATGCCCG AGCATCCTGC TTCTATGCGC GCCGCACGCT GGAACTGGGT 
GTCGCCTGGC TCTACAAGCA CGACAAGTCG CTGAAGCTGC CCTATCAAGA CAATCTCAGC
GCGCTGATAC ACGAGCCGAC CTTCCGCCAG ACGGTCGGCG ATGCGCTCTT CACCAAAGCC
CGGCTCATCA AGGACCTGGG CAACATGGCG GTCCACAGCG CCAAGAAGAT GGCGCCTGCC
GACGCGGTGA ATACCACCCG CGAGCTTTTC CACTTTTGCT ACTGGCTTGC GCGCACCTAC
GGGCGCGTCG CCCGTCCGAA CCCAAGTCAG CGGTTCGACA TCAAGCTGCT GCCAACCGCG
TCCGCACTGC CGGCGCAAAC CGTCGAGCAA CTGCAAAAGC TTGAGGTCGA GCTACGGGCT
AAGGACGAGA AGCTGTTTTC CCTGCTGTCC GAAAGGGCGG CACTGGATGA GGAACTGCGT
CGCCTGCGCG AAGAGTTCGC CGCGATACGG CAGGCCAACA CGGCCCAGCC TGATACGCAC
GACTATTCCG AAGCCGAGAC CCGCAAGCTC TTCATCGACA CGCTACTGAA GGAAGCCGGT
TGGCATCTCG ACCCGGCCAA GAACTTCGAA GTCGAAGTCA CCGGGATGCC CAATGCCGAA
AACAAGGGCT ACGTCGATTA CGTGCTGTGG GGCGACGACG GCAAGCCGCT CGGTTTGATT
GAGGCGAAAC GCACCACCAA GAACCCCACG GTCGGGCAGC AGCAGGCCAA GCTCTACGCG
GACTGCCTGG AGGCGCAATA CGGTCAGCGC CCGGTCATCT TCTACTCGAA CGGCTACGAG
CACTGGATAT GGGACGACAG GTCCTATCCG CCGCGCGCGG TGCAGGGGTT CTACAAGAAG
GCTGAGCTCG AGCTCCTCAT CCAGCGGCGC AACAGCCGCA AGAAGCTGTC AGAAGCCGTC
ATCAACAGCG CCATCATCGA GCGCTACTAC CAGACGCGGG CGGTGCGCCG TGTCGGGGAA
AGCTTCGAGA CCGACAAGCT GAGGAAGTCG CTGCTGGTGA TGGCGACCGG TGCCGGCAAG
ACCCGGACAG TGATTGCACT GGCCGACATC CTGATGCGGT GCAACTGGGC CAAGCGCGTG
CTGTTCCTCG CCGACCGGGT GGCGCTCGTC AATCAGGCGG TGAATGCCTT CAAGGCGCAT
CTGCCGGATT CGGCGCCAGT GAACCTGGTG ACCGACAAGG CCACCGAAGG GCGGGTGTAT
GTGTCGACCT ACCCGACGAT GATGGGCCTG ATTGACGAGG CCTCGAATGG GGAAAACGCA
GGCCAGCGCC GCTTTGGCGT CGGTCACTTC GACCTCATCA TCATCGACGA GGCGCATCGC
TCCGTCTACC AGAAGTACCG CGCCATCTTC GACTACTTCG ACTCGCTGCT GGTGGGGCTT
ACCGCCACGC CCAAGGACGA AATCGACCAC AACACCTACG GTCTGTTCGA CCTCGAAACC
GGCGTGCCGA CCGACGCTTA CGGCCTCGAC GAAGCCGTGG CCGACAAGCA CCTGGTGCCG
CCGGTCCCCA TCTCGGTGCC GCTCAAATTC CAGCGCGAAG GCATCAAGTA CGAGGACCTT
TCCGAGGAGG AGAAGGAAGT CTGGGACGCT CTCGAGTGGA GCCACGACGG GACGGTGCCG
GACGAGGTGA ATGCCGAGGC CGTGAACAAG TGGCTGTTCA ATACGGACAC CGTCGACAAG
GTGCTCGAGA CCCTGATGAC CCAGGGGCAG AAGGTGGCCG GAGGCGACCG GCTGGGCAAG
ACCATCATTT TTGCCAAGAA CAACGACCAC GCTGACTTCA TCGCCCAGCG CTTCAATGCC
AACTACCCGC ACTACAAGGG CCACTTCGCA CGGGTGGTGA CCTACAAGAC CGAATACGCC
CAGAGCCTCA TCGACGACTT TTCGGCCAAG GACAAGATGC CGCATATCGC CATCTCGGTC
GACATGCTCG ACACCGGCAT CGACGTACCC GAGGTGGTTA ATCTGGTCTT CTTCAAGATT
GTTCGCTCGA AGACGAAGTT CTGGCAGATG GTCGGGCGTG GCACCCGCCT GTGCAAGGAC
CTGTTCGGCC CGGGCGAGGA CAAGCAGAGC TTCTACATTT TCGACTTCTG CCAGAACCTG
GAATTCTTCA GCCAGAACCC GAACTTCGTC GAGAGTTCGG CCGCCGAGCC CCTTAGCAAG
CGGCTCTTCG GAGCACGGCT GCAACTCATC TCCAGCCTGG ATGCCAAGTT GACCCGGGGG
CTAACCGCCA GCGACCAGGT CGCCGCGCCG TACGGTGGGC ACCTGACAGA AGCGCAACTG
CGGGCCGAGA CGGCCGCGAT GCTGCATGAG AACGTGGCTG CGATGAACCA GGACAACTTC
GTCGTCCGAC CGCACCGCCA GTACGTCGAG AAATACGCCA AATCCGAAGC GTGGCAGGTG
CTGGGGCCAG ATGACTTTGA TGTGCTGACC AATCGAGTCG CGGGCCTTCC GACCGAGCTC
GTCGATGAAG ACGAGGAGGC GAAGCGCTTC GATATGCTGG TGCTGCGCAC CCAACTCTCG
GTGCTGCAGG CATTGGCCGC CTTCACCGGT CTGAAGGAGA AGATTCAAGC CCTGGCCAGT
GCCCTGGAAG AGCAGTCAGC GATTCCGGCT ATCAACGCGG AGATGGTGCT CATTCAGGCG
GTCGCCAGTG AAGACTGGTG GGAAGGTGTG ACCGTACCGA TGCTCGAAAC GGTTCGCCGC
CGGCTACGCG CCTTGGTCAA GCTCATCCCC AAGGGGGAGA AGAAGGTCGT CTATACGGAT
TTCGAGGACG AGATTGGGGA CCTTTCCACC ATCGACCTCC CGCAAGTGAC GGCTGGCCTG
AACATGGCGA AGTTCAAGGA CAAGGCACGC GCCTTCCTGC GGGCTCACGA GTCACACCTG
GCGCTGCAGC GGCTGCGTCG CAATCAGCCT CTCACCCCGA CTGACCTTGT TGAGCTGGAA
AAGATGCTGC TGGAGGCTGG CGGGTCGCCA GAGCTCATCA GCGAAGCGAG GGAGAAAAGC
CACGGCCTCG GCATCTTCAT CCGCTCACTG GTGGGGCTCG ACCGGGAGAC AGCGATCCAG
GCCTTCAGCG ACTTCATCGG TGGCACTACG GCGACGCCGA ACCAAATTGA GTTCATCAAT
CTCGTGGTCG AGGAGCTGAC GCAGAACGGG GTGATGGAGC CGGGGCGGTT GTTCGAGTCG
CCGTATACAG ACATCAATGC GCAGGGGCCG TTGGGGGTGT TCCCGCCGGC AACGGTCACG
CAGATTGTGC AGGTGCTGGA GGGGATTCGG GAACGAGCCG TGGCCTAA
 
Protein sequence
MANTDARASC FYARRTLELG VAWLYKHDKS LKLPYQDNLS ALIHEPTFRQ TVGDALFTKA 
RLIKDLGNMA VHSAKKMAPA DAVNTTRELF HFCYWLARTY GRVARPNPSQ RFDIKLLPTA
SALPAQTVEQ LQKLEVELRA KDEKLFSLLS ERAALDEELR RLREEFAAIR QANTAQPDTH
DYSEAETRKL FIDTLLKEAG WHLDPAKNFE VEVTGMPNAE NKGYVDYVLW GDDGKPLGLI
EAKRTTKNPT VGQQQAKLYA DCLEAQYGQR PVIFYSNGYE HWIWDDRSYP PRAVQGFYKK
AELELLIQRR NSRKKLSEAV INSAIIERYY QTRAVRRVGE SFETDKLRKS LLVMATGAGK
TRTVIALADI LMRCNWAKRV LFLADRVALV NQAVNAFKAH LPDSAPVNLV TDKATEGRVY
VSTYPTMMGL IDEASNGENA GQRRFGVGHF DLIIIDEAHR SVYQKYRAIF DYFDSLLVGL
TATPKDEIDH NTYGLFDLET GVPTDAYGLD EAVADKHLVP PVPISVPLKF QREGIKYEDL
SEEEKEVWDA LEWSHDGTVP DEVNAEAVNK WLFNTDTVDK VLETLMTQGQ KVAGGDRLGK
TIIFAKNNDH ADFIAQRFNA NYPHYKGHFA RVVTYKTEYA QSLIDDFSAK DKMPHIAISV
DMLDTGIDVP EVVNLVFFKI VRSKTKFWQM VGRGTRLCKD LFGPGEDKQS FYIFDFCQNL
EFFSQNPNFV ESSAAEPLSK RLFGARLQLI SSLDAKLTRG LTASDQVAAP YGGHLTEAQL
RAETAAMLHE NVAAMNQDNF VVRPHRQYVE KYAKSEAWQV LGPDDFDVLT NRVAGLPTEL
VDEDEEAKRF DMLVLRTQLS VLQALAAFTG LKEKIQALAS ALEEQSAIPA INAEMVLIQA
VASEDWWEGV TVPMLETVRR RLRALVKLIP KGEKKVVYTD FEDEIGDLST IDLPQVTAGL
NMAKFKDKAR AFLRAHESHL ALQRLRRNQP LTPTDLVELE KMLLEAGGSP ELISEAREKS
HGLGIFIRSL VGLDRETAIQ AFSDFIGGTT ATPNQIEFIN LVVEELTQNG VMEPGRLFES
PYTDINAQGP LGVFPPATVT QIVQVLEGIR ERAVA