Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_2373 |
Symbol | |
ID | 7094295 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011667 |
Strand | + |
Start bp | 35503 |
End bp | 36504 |
Gene Length | 1002 bp |
Protein Length | 333 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 643701061 |
Product | restriction endonuclease-like protein |
Protein accession | YP_002364202 |
Protein GI | 217980152 |
COG category | [V] Defense mechanisms |
COG ID | [COG3440] Predicted restriction endonuclease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 69 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 35 |
Fosmid unclonability p-value | 0.0000067766 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGTGCTTG CTCAGTCAGA TCTCGCTACG CATATGACAA TAATCTGCCT AACACCTCAG CCAGACTGGG ATGCTCCGAT CTTCAAGATT CTGGCGAACA ACGACACCGG GAGTGCTCCG GGGCATCAGG GCGGGATTGT TATCCCGAAG GATCTGCGTT CTTTCTTTCC AGGGCTTGTG GACAACACGT CGCACTACCG GCCAACGGTT GATCAACGTA TTGATGCCCA GCTTTTTGAC GGAGACAAAT TTCTAGCGAC AGTGAACACT CGCTACCAGT ATCAGACATG GGGCGGCGCG CGCAGTCCAG AGTCGCGTTT GACGGATCAG CTCTCTACTC TCAGAAATCG CGCAAGCGGT GGCGACATCC TACTCATCCA GCGGAATATC AGCACTCTCG ATCAGTACCG CCTCGTACTA GTACGTCAGT CGAGCCCTGA TTTTGCGCTG GTCATGCGCC TTGCGGCGGG GAGGCGCTGG GGTGTGCTTT ACCCAGAACG AGTTCCCCTT GCAGACGATG ATCTAACAGA TGCGTTTAAG GAAGAACTCG AGCGTGAGGG CAAGCCTTTC AAACTCATTG ACGACGAAGC TGGCACAACA ACCGTAACTG TGAAGAAGGT TGCCCGGGCG CTTGCGTTTA GGACGATCGT TATTGCGCTT TACGACGAAC GCTGCGCAGT TTGTGGAGAG GGGCTGAAGT CACCTGCTGG GGCTACTGAG GTTGAAGCTG CTCATGTTGT CCCCCGCTCT CAGTTCGGTG CGGATGATGC CCGAAACGGT GTTTCACTGT GTAAGGCACA TCACTGGGCG TTCGATAGAG GCCTGTTCGG CGTAGGGGAT GATCGCACTG TCGTTGTTCC AAGCTCAGTA CGGTCACTTG TGCAGAACAA AAGCATCTCC ACGTTTTTGG GTAGGCGAAT CAGGGAGGCC AGTGACCCTA GGCGCTCTGT TCATCCTGAT GCTTTTGCGT GGCACCGCAA GAATCTGCTC CTCAGCGGCT AG
|
Protein sequence | MVLAQSDLAT HMTIICLTPQ PDWDAPIFKI LANNDTGSAP GHQGGIVIPK DLRSFFPGLV DNTSHYRPTV DQRIDAQLFD GDKFLATVNT RYQYQTWGGA RSPESRLTDQ LSTLRNRASG GDILLIQRNI STLDQYRLVL VRQSSPDFAL VMRLAAGRRW GVLYPERVPL ADDDLTDAFK EELEREGKPF KLIDDEAGTT TVTVKKVARA LAFRTIVIAL YDERCAVCGE GLKSPAGATE VEAAHVVPRS QFGADDARNG VSLCKAHHWA FDRGLFGVGD DRTVVVPSSV RSLVQNKSIS TFLGRRIREA SDPRRSVHPD AFAWHRKNLL LSG
|
| |