Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_2987 |
Symbol | |
ID | 7874377 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 3240741 |
End bp | 3241796 |
Gene Length | 1056 bp |
Protein Length | 351 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 643699908 |
Product | peptidase U32 |
Protein accession | YP_002889963 |
Protein GI | 237653649 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0826] Collagenase and related proteases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.654873 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGATCG TCGCCCCCAT CCGGCAACTC GACGAGATCG CCGCGCTGGC GCGCGCCGGG GCGGACGAAC TCTATTGCGG GGTGACGCCG CGCGAGTGGG CCGAGCGCTT CGGTGGCGCC AGCGCCAACC GCCGTCCCGG AGGCAACCTC CCCTCGCTGG CCGCGCTCGC CGAGGCGGTG GCGCTCGCCC ACCGCAATGG CGTACAGCTC TCCCTGGTGC TCAACGCACA GCAGTATTCG GTCGAACAGA TCGAGTTCGC GCTCGCGATC GCGCACCGCT ACGTCGACAT GGGGGGCGAT GCGGTGATCG CGAGCGACCC CGGCCTGCTC CTGGCGCTGG CCGAAGCCGA GCCGGAGCTG CGCATCCACG TCAGCTCGGT GGCCACCTGC CGCAATGCCG ACGGCGCCCG CCTCTACCGC GAGCTCGGCG CGCGCCGGCT GATCCTGCCG CGCGACATCA CCCTCGACGA GGCCGCGGAG ATCGCCGCCG CAGTGCCCGA CCTGGAGATC GAGGCCTTCG TGCTCAACGA CGGCTGCGTC TACGAGGAAG GCAGCTGCAA CACCCTGCAC CTCCCGGGCG CGCTCGGCGG GCCGATCTGC CTGGACCGCT ACGCCTACGC GCACCGCCAC CGCGACGGCC GGCCGCTCTC GGCCGCGCTC GCGGCCCGCC TGCAGGAGAA CGACGAGGCC TATCGGCGCT GGCTGTGGTA CCGCTTCTCC TGCGGCTTCA CCACCACCGC CGACGGCCTG CCCTTCGGCC CCTGCGGCCT GTGCGCGATC CCGGCGTTCG GGCGCGGCGG CATCCACGCG CTCAAGATCG CCGGCCGCGA GGGTCCGCCC GAGCGCAAGC TCGCCAGCGT GCGCATGGTC CGGCGGATCC TCGACGCCCA CGACAACGGC GAAGCCCCCG CGGCGGTGAT GGCCCGTGCG CGCAACCTGC GGCCTGCGCA CGAACACTGC GCGACCGGCT TCATGTGCTA CTACCCGGAG GTCGTCTCCC GCGCATCCGA AGCGGCGCAG CCGCTGTGCG ACGGTCGCGC AGCAGGTGCT CAGTAG
|
Protein sequence | MKIVAPIRQL DEIAALARAG ADELYCGVTP REWAERFGGA SANRRPGGNL PSLAALAEAV ALAHRNGVQL SLVLNAQQYS VEQIEFALAI AHRYVDMGGD AVIASDPGLL LALAEAEPEL RIHVSSVATC RNADGARLYR ELGARRLILP RDITLDEAAE IAAAVPDLEI EAFVLNDGCV YEEGSCNTLH LPGALGGPIC LDRYAYAHRH RDGRPLSAAL AARLQENDEA YRRWLWYRFS CGFTTTADGL PFGPCGLCAI PAFGRGGIHA LKIAGREGPP ERKLASVRMV RRILDAHDNG EAPAAVMARA RNLRPAHEHC ATGFMCYYPE VVSRASEAAQ PLCDGRAAGA Q
|
| |