Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_3175 |
Symbol | |
ID | 7874315 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 3448901 |
End bp | 3452248 |
Gene Length | 3348 bp |
Protein Length | 1115 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 643700103 |
Product | type III restriction protein res subunit |
Protein accession | YP_002890147 |
Protein GI | 237653833 |
COG category | [V] Defense mechanisms |
COG ID | [COG4096] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGGCGAATA CGGATGCCCG AGCATCCTGC TTCTATGCGC GCCGCACGCT GGAACTGGGT GTCGCCTGGC TCTACAAGCA CGACAAGTCG CTGAAGCTGC CCTATCAAGA CAATCTCAGC GCGCTGATAC ACGAGCCGAC CTTCCGCCAG ACGGTCGGCG ATGCGCTCTT CACCAAAGCC CGGCTCATCA AGGACCTGGG CAACATGGCG GTCCACAGCG CCAAGAAGAT GGCGCCTGCC GACGCGGTGA ATACCACCCG CGAGCTTTTC CACTTTTGCT ACTGGCTTGC GCGCACCTAC GGGCGCGTCG CCCGTCCGAA CCCAAGTCAG CGGTTCGACA TCAAGCTGCT GCCAACCGCG TCCGCACTGC CGGCGCAAAC CGTCGAGCAA CTGCAAAAGC TTGAGGTCGA GCTACGGGCT AAGGACGAGA AGCTGTTTTC CCTGCTGTCC GAAAGGGCGG CACTGGATGA GGAACTGCGT CGCCTGCGCG AAGAGTTCGC CGCGATACGG CAGGCCAACA CGGCCCAGCC TGATACGCAC GACTATTCCG AAGCCGAGAC CCGCAAGCTC TTCATCGACA CGCTACTGAA GGAAGCCGGT TGGCATCTCG ACCCGGCCAA GAACTTCGAA GTCGAAGTCA CCGGGATGCC CAATGCCGAA AACAAGGGCT ACGTCGATTA CGTGCTGTGG GGCGACGACG GCAAGCCGCT CGGTTTGATT GAGGCGAAAC GCACCACCAA GAACCCCACG GTCGGGCAGC AGCAGGCCAA GCTCTACGCG GACTGCCTGG AGGCGCAATA CGGTCAGCGC CCGGTCATCT TCTACTCGAA CGGCTACGAG CACTGGATAT GGGACGACAG GTCCTATCCG CCGCGCGCGG TGCAGGGGTT CTACAAGAAG GCTGAGCTCG AGCTCCTCAT CCAGCGGCGC AACAGCCGCA AGAAGCTGTC AGAAGCCGTC ATCAACAGCG CCATCATCGA GCGCTACTAC CAGACGCGGG CGGTGCGCCG TGTCGGGGAA AGCTTCGAGA CCGACAAGCT GAGGAAGTCG CTGCTGGTGA TGGCGACCGG TGCCGGCAAG ACCCGGACAG TGATTGCACT GGCCGACATC CTGATGCGGT GCAACTGGGC CAAGCGCGTG CTGTTCCTCG CCGACCGGGT GGCGCTCGTC AATCAGGCGG TGAATGCCTT CAAGGCGCAT CTGCCGGATT CGGCGCCAGT GAACCTGGTG ACCGACAAGG CCACCGAAGG GCGGGTGTAT GTGTCGACCT ACCCGACGAT GATGGGCCTG ATTGACGAGG CCTCGAATGG GGAAAACGCA GGCCAGCGCC GCTTTGGCGT CGGTCACTTC GACCTCATCA TCATCGACGA GGCGCATCGC TCCGTCTACC AGAAGTACCG CGCCATCTTC GACTACTTCG ACTCGCTGCT GGTGGGGCTT ACCGCCACGC CCAAGGACGA AATCGACCAC AACACCTACG GTCTGTTCGA CCTCGAAACC GGCGTGCCGA CCGACGCTTA CGGCCTCGAC GAAGCCGTGG CCGACAAGCA CCTGGTGCCG CCGGTCCCCA TCTCGGTGCC GCTCAAATTC CAGCGCGAAG GCATCAAGTA CGAGGACCTT TCCGAGGAGG AGAAGGAAGT CTGGGACGCT CTCGAGTGGA GCCACGACGG GACGGTGCCG GACGAGGTGA ATGCCGAGGC CGTGAACAAG TGGCTGTTCA ATACGGACAC CGTCGACAAG GTGCTCGAGA CCCTGATGAC CCAGGGGCAG AAGGTGGCCG GAGGCGACCG GCTGGGCAAG ACCATCATTT TTGCCAAGAA CAACGACCAC GCTGACTTCA TCGCCCAGCG CTTCAATGCC AACTACCCGC ACTACAAGGG CCACTTCGCA CGGGTGGTGA CCTACAAGAC CGAATACGCC CAGAGCCTCA TCGACGACTT TTCGGCCAAG GACAAGATGC CGCATATCGC CATCTCGGTC GACATGCTCG ACACCGGCAT CGACGTACCC GAGGTGGTTA ATCTGGTCTT CTTCAAGATT GTTCGCTCGA AGACGAAGTT CTGGCAGATG GTCGGGCGTG GCACCCGCCT GTGCAAGGAC CTGTTCGGCC CGGGCGAGGA CAAGCAGAGC TTCTACATTT TCGACTTCTG CCAGAACCTG GAATTCTTCA GCCAGAACCC GAACTTCGTC GAGAGTTCGG CCGCCGAGCC CCTTAGCAAG CGGCTCTTCG GAGCACGGCT GCAACTCATC TCCAGCCTGG ATGCCAAGTT GACCCGGGGG CTAACCGCCA GCGACCAGGT CGCCGCGCCG TACGGTGGGC ACCTGACAGA AGCGCAACTG CGGGCCGAGA CGGCCGCGAT GCTGCATGAG AACGTGGCTG CGATGAACCA GGACAACTTC GTCGTCCGAC CGCACCGCCA GTACGTCGAG AAATACGCCA AATCCGAAGC GTGGCAGGTG CTGGGGCCAG ATGACTTTGA TGTGCTGACC AATCGAGTCG CGGGCCTTCC GACCGAGCTC GTCGATGAAG ACGAGGAGGC GAAGCGCTTC GATATGCTGG TGCTGCGCAC CCAACTCTCG GTGCTGCAGG CATTGGCCGC CTTCACCGGT CTGAAGGAGA AGATTCAAGC CCTGGCCAGT GCCCTGGAAG AGCAGTCAGC GATTCCGGCT ATCAACGCGG AGATGGTGCT CATTCAGGCG GTCGCCAGTG AAGACTGGTG GGAAGGTGTG ACCGTACCGA TGCTCGAAAC GGTTCGCCGC CGGCTACGCG CCTTGGTCAA GCTCATCCCC AAGGGGGAGA AGAAGGTCGT CTATACGGAT TTCGAGGACG AGATTGGGGA CCTTTCCACC ATCGACCTCC CGCAAGTGAC GGCTGGCCTG AACATGGCGA AGTTCAAGGA CAAGGCACGC GCCTTCCTGC GGGCTCACGA GTCACACCTG GCGCTGCAGC GGCTGCGTCG CAATCAGCCT CTCACCCCGA CTGACCTTGT TGAGCTGGAA AAGATGCTGC TGGAGGCTGG CGGGTCGCCA GAGCTCATCA GCGAAGCGAG GGAGAAAAGC CACGGCCTCG GCATCTTCAT CCGCTCACTG GTGGGGCTCG ACCGGGAGAC AGCGATCCAG GCCTTCAGCG ACTTCATCGG TGGCACTACG GCGACGCCGA ACCAAATTGA GTTCATCAAT CTCGTGGTCG AGGAGCTGAC GCAGAACGGG GTGATGGAGC CGGGGCGGTT GTTCGAGTCG CCGTATACAG ACATCAATGC GCAGGGGCCG TTGGGGGTGT TCCCGCCGGC AACGGTCACG CAGATTGTGC AGGTGCTGGA GGGGATTCGG GAACGAGCCG TGGCCTAA
|
Protein sequence | MANTDARASC FYARRTLELG VAWLYKHDKS LKLPYQDNLS ALIHEPTFRQ TVGDALFTKA RLIKDLGNMA VHSAKKMAPA DAVNTTRELF HFCYWLARTY GRVARPNPSQ RFDIKLLPTA SALPAQTVEQ LQKLEVELRA KDEKLFSLLS ERAALDEELR RLREEFAAIR QANTAQPDTH DYSEAETRKL FIDTLLKEAG WHLDPAKNFE VEVTGMPNAE NKGYVDYVLW GDDGKPLGLI EAKRTTKNPT VGQQQAKLYA DCLEAQYGQR PVIFYSNGYE HWIWDDRSYP PRAVQGFYKK AELELLIQRR NSRKKLSEAV INSAIIERYY QTRAVRRVGE SFETDKLRKS LLVMATGAGK TRTVIALADI LMRCNWAKRV LFLADRVALV NQAVNAFKAH LPDSAPVNLV TDKATEGRVY VSTYPTMMGL IDEASNGENA GQRRFGVGHF DLIIIDEAHR SVYQKYRAIF DYFDSLLVGL TATPKDEIDH NTYGLFDLET GVPTDAYGLD EAVADKHLVP PVPISVPLKF QREGIKYEDL SEEEKEVWDA LEWSHDGTVP DEVNAEAVNK WLFNTDTVDK VLETLMTQGQ KVAGGDRLGK TIIFAKNNDH ADFIAQRFNA NYPHYKGHFA RVVTYKTEYA QSLIDDFSAK DKMPHIAISV DMLDTGIDVP EVVNLVFFKI VRSKTKFWQM VGRGTRLCKD LFGPGEDKQS FYIFDFCQNL EFFSQNPNFV ESSAAEPLSK RLFGARLQLI SSLDAKLTRG LTASDQVAAP YGGHLTEAQL RAETAAMLHE NVAAMNQDNF VVRPHRQYVE KYAKSEAWQV LGPDDFDVLT NRVAGLPTEL VDEDEEAKRF DMLVLRTQLS VLQALAAFTG LKEKIQALAS ALEEQSAIPA INAEMVLIQA VASEDWWEGV TVPMLETVRR RLRALVKLIP KGEKKVVYTD FEDEIGDLST IDLPQVTAGL NMAKFKDKAR AFLRAHESHL ALQRLRRNQP LTPTDLVELE KMLLEAGGSP ELISEAREKS HGLGIFIRSL VGLDRETAIQ AFSDFIGGTT ATPNQIEFIN LVVEELTQNG VMEPGRLFES PYTDINAQGP LGVFPPATVT QIVQVLEGIR ERAVA
|
| |