Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_3599 |
Symbol | |
ID | 7873104 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 3947418 |
End bp | 3949016 |
Gene Length | 1599 bp |
Protein Length | 532 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 643700539 |
Product | restriction modification system DNA specificity domain protein |
Protein accession | YP_002890569 |
Protein GI | 237654255 |
COG category | [V] Defense mechanisms |
COG ID | [COG0732] Restriction endonuclease S subunits |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.743632 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATCGATT CCGCTCCACA GGCCAGCGAT CTTCCCGCAG GCTGGGACGT CGCGTCATTC GGTGAACTCA ACTCGTTTAG CGGCAGCACG GTAAACCCCG CGACACGGCC AGACGAAGTC TTTGAGCTCT ACAGCGTGCC GAGCTTCCCA ACAAAGCACC CCGAGCAGCT ACCGGGACGT GCGATCGGCT CGACGAAGCA GACCGTCAGG CCTGGTGACG TCCTGGTCTG CAAGATCAAC CCCCGCATTA ATCGTGTTTG GACAGTCGGT ACCCGTCGCG ATCACGAGCA AATTGCCTCG TCAGAGTGGA TCGGGTTCCG ATCTGACGCC ATGGTGCCGC GGTTCGCGAA GCACTACTTC AGCGAACCGT CATTCCGGTC GCTCTTGTGC AGCGAGGTCT CCGGCGTAGG CGGCTCCCTG ACCCGCGCCC AGCCAAGTCG CGTAGCCAAG TATCCTGTCC TCGTTGCGCC GCTGGCAGAA CAGGCCCGCA TCGCCGACCA ACTCGAGGCC CTGCTGGCGC GTATCCAGGC CTGTCAGGAC CGCCTGGAGG CCATTCCGGC GTTGCTCAAG CGGTTTCGAA AGCTGGTTCT CTCGTCTGCG CTTTCTGGCG ACCTGACTGA AGTCTGGCGG GCCGAACAAG GAGTGGGCTT AGATACTTGG TCGGCGAGGA CGATTGCTGA CGTCGCGGAA GTTGGGACTG GATCCACCCC TCTTCGATCA AACAGCAACT TCTACGCAGA GACCGGGACC CCTTGGGTGA CAAGCGCGGC CACGAGTCGC CCTTACATCG ACTCTGCCGA CCAGTACGTG ACTAAAGCGG CAATCGATGC ACACAGGCTC AGGGTCTACC GCCCCGGGAC ACTGATCATT GCTATGTACG GTGAAGGGAA GACTCGTGGG CAAGTCAGCG AGCTTCGAAT TGACGCGACC ATCAATCAGG CCTGCGCTGC GATAACTGTC GATGAGCAGC AAGCCAACGC CGCCTTCGTC AAGCTTGCAC TCTTGTCGCA GTACGAGCAA ACGCGCGCGC TTGCGGAAGG CGGCGCGCAG CCAAATCTGA ACTTGTCCAA GGTGCGCGGA ATTCCACTAC GCCTGCCAGA AGGGCCCGAA CAAGCTCAGA TCGTTCATCG AGTTGGAGAA CTGTTCGCTT TTGCCGACAC CATCGATTCT CGCGTCGCTG CGGCAACAGG CAAGACACGG AAGCTTCCCT CGCTCACTCT CGCCAAAGCC TTCCGCGGCG ACTTGGTTCC GCAAGATCCC ACCGACGAGC CGGCCAGCGT CTTGCTGGCC CGTATTGCCG CCCAACGCGC AGCGCCCCCG CATGCCGCCT CGGCAACCAC ACCGCGCCGC GGCCGCCCAC CCCGTGCCCC GAAGGAAACC GCCGCCATGA CCAAGAGCCG CCAGGACGAC GACGTGACGG GTCAGCCCTA CCTGGCCGCG CACCTGCACC GCATCGGCAC GCCCGCCAGC GCCGAAGCAC TGTTCAAGGT GGCCGAGTTG CCCGTCGCCG ACTTCTACAA GCAACTCGCT TGGGAGGTGG CGCAAGGCCA CGTGAAGGAC AACCAGACCA CGCTGGAGCC CGGGCATGCG GCTGGATAA
|
Protein sequence | MIDSAPQASD LPAGWDVASF GELNSFSGST VNPATRPDEV FELYSVPSFP TKHPEQLPGR AIGSTKQTVR PGDVLVCKIN PRINRVWTVG TRRDHEQIAS SEWIGFRSDA MVPRFAKHYF SEPSFRSLLC SEVSGVGGSL TRAQPSRVAK YPVLVAPLAE QARIADQLEA LLARIQACQD RLEAIPALLK RFRKLVLSSA LSGDLTEVWR AEQGVGLDTW SARTIADVAE VGTGSTPLRS NSNFYAETGT PWVTSAATSR PYIDSADQYV TKAAIDAHRL RVYRPGTLII AMYGEGKTRG QVSELRIDAT INQACAAITV DEQQANAAFV KLALLSQYEQ TRALAEGGAQ PNLNLSKVRG IPLRLPEGPE QAQIVHRVGE LFAFADTIDS RVAAATGKTR KLPSLTLAKA FRGDLVPQDP TDEPASVLLA RIAAQRAAPP HAASATTPRR GRPPRAPKET AAMTKSRQDD DVTGQPYLAA HLHRIGTPAS AEALFKVAEL PVADFYKQLA WEVAQGHVKD NQTTLEPGHA AG
|
| |