Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_2222 |
Symbol | |
ID | 7083654 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 2503120 |
End bp | 2504457 |
Gene Length | 1338 bp |
Protein Length | 445 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 643699242 |
Product | peptidase M48 Ste24p |
Protein accession | YP_002355858 |
Protein GI | 217970624 |
COG category | [R] General function prediction only |
COG ID | [COG4784] Putative Zn-dependent protease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCTTCT TCGACGACCT CGCCCGCAGC CTCGAACGCA ACCGCATCAC CCGCCGCCAG GCGCTGTGGC TGCTCGGCGC GGGCGCGGCG GCCGGCCTGT CGGGCTGCGC CACCTCCCCG GTCACCGGCG AGACCATCCT CGTCGGCATG AGCGAGGCGC AGGAGAAGCA GACCGACGCC CAGGTCGCGC CGCACCAGTT CTCGCAGGAC CTCGGCGCCA TCCAGGACGA GGCGGTCAAC CGCTACGTCG CCGGCATCGG CCAGCGCATG GGCACGCTCA CCCACCGTCC GCAGATGCCC TACTCCTACC GCGTGCTCAA CGCCAACTAC GTCAACGCCT ACACCTTCCC CGGCGGCGCG ATGGGCGTGA CGCGCGGCAT CCTCGCCGAC CTCGACGACG AGGCCCAGCT CGCCGCCCTG CTCGGCCACG AGCTCGGCCA CGTCAACGCC CGCCACGCCG CCCAGCGCCA GGGCCAGAAC CTGGTCGCGC AGGCGGCGCT CGCCGGGCTC AACGTGGCGG CACAGAGCTC CGACTGGGGC GGGCTGATGA GCATGGGCGG ACAGATCGGC GCCAGCGCCC TGCTCGCCGG CTACTCGCGC GAGCACGAGC GCGAGGCCGA TGCGCTCGGG CAGGAATATC TCGTCAAGGC CGGCTACCCG GCGACCGGCA TGGTGCGCCT GCACCAGTTG CTGGTTGCCG AGGAAAAATC CGCCCCCTCG CTGCTGCAGA CGATGTTCTC CACCCACCCG ATGAGCAGCG AGCGCATGCA GGCCGCGCAG GCCGCGGCCG ACGCGCGCTA CCGCATCAGC AACAGCCTGG ACGCCCGCCG CGAGCGCTTC ATGGACAGCA CCGCCAGCCT GCGCCGCATC CGCCCCACCA TCGACGCCTG CAAGAACGGC GAAACCGCGA TGGCCGCCAG GCAGTACCCC AAGGCGCAGG CCGAATTCCA GACCGCGCTG GCCAGGACCC CGCGCGACTA CGCCAGCAAC CTGCGCATGG CCCAGTGCCT GCAGGCCCAG GGCCAGACCG CGAAGGCGGT GGACTACGCC GACAACGCGC GCGAGATCTA CCCGCAGGAG GCGCAGGCCT ACAAGCTCGC CGGCGTGCTC GCCCTGCAGC AGCGCGACGC CGGCCGCGCC TACCAGAACC TCGACCGCTT CGACCGCCTG CTCCCCGGCG ACGCCGGCAT CACCTTCCTG AAGGGCATCT CGCTCGAAGG CATGGGCAAC CGCCAGGCCG CCGCCCAGCA CTACGCCGCC TACCTGCGCC AGAGCCAGCA GGGCAACGCC GCGCAGTACT CGTACAACCG GCTCAAGGCC TGGGGGATGG TGAAGTAG
|
Protein sequence | MSFFDDLARS LERNRITRRQ ALWLLGAGAA AGLSGCATSP VTGETILVGM SEAQEKQTDA QVAPHQFSQD LGAIQDEAVN RYVAGIGQRM GTLTHRPQMP YSYRVLNANY VNAYTFPGGA MGVTRGILAD LDDEAQLAAL LGHELGHVNA RHAAQRQGQN LVAQAALAGL NVAAQSSDWG GLMSMGGQIG ASALLAGYSR EHEREADALG QEYLVKAGYP ATGMVRLHQL LVAEEKSAPS LLQTMFSTHP MSSERMQAAQ AAADARYRIS NSLDARRERF MDSTASLRRI RPTIDACKNG ETAMAARQYP KAQAEFQTAL ARTPRDYASN LRMAQCLQAQ GQTAKAVDYA DNAREIYPQE AQAYKLAGVL ALQQRDAGRA YQNLDRFDRL LPGDAGITFL KGISLEGMGN RQAAAQHYAA YLRQSQQGNA AQYSYNRLKA WGMVK
|
| |