Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_0717 |
Symbol | |
ID | 7083946 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 798928 |
End bp | 799944 |
Gene Length | 1017 bp |
Protein Length | 338 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 643697743 |
Product | peptidase U32 |
Protein accession | YP_002354385 |
Protein GI | 217969151 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0826] Collagenase and related proteases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCCGTGT CTTCCCCGAT CGAACTCGTC TGCCCCGCGG GCAGCCTCCC GGCCCTCAAG ACGGCCATCG ATCACGGCGC CGACTGCGTC TATCTCGGTT TCAAGGACGC CACCAACGCA CGCAACTTTT CCGGGCTCAA CTTCGATCCG GCGCAGGTGC GCGAGGGTGT CGCCTATGCG CACGCACGCA AGCGCAAGGT GCTGCTGGCG CTCAACACCT ATCCCCAGAC CGACAACTGG TCGTCCTGGA CACAGGCGAT CGACCGCGCC GCCGACTACG GCGTTGACGC GGTGATCCTC GCCGATCCCG GGCTCATGGC CTACGCCGCA AAGACCCACC CGCAGCTGCG CCTGCACCTG TCGGTGCAGG GCTCGGCGAC CAGCTACGAG GCGATCAACT GGTACCGCGA ACGCTTCGGC ATCCAGCGCG CGGTGCTGCC GCGCGTGCTG TCGATGGCCC AGGTCGAGAG CGTGGTCGCC AACACCCAAG TCGAGATCGA GGTGTTCGGC TTCGGCGGCC TGTGCGTGAT GGTCGAGGGC CGCTGCGCAC TGTCGGCCTA CGCCACCGGC GAGTCGCCCA ACTGCAACGG CGCCTGCTCG CCGGGCAAGC ACGTGCGCTG GGAAGACACC CCGCGCGGCA TGGAGACGCG GCTCAACGGC ATCCTCATCG ACCGCTTCTC CGAAGGCGAG CGTGCCGGCT ATCCCACCCT GTGCAAGGGT CGCTTCGAGG TCAACGACGA GACCTACTAC GCCCTCGAGG AGCCCACCAG CCTCAACACG CTCGAGTTGC TGCCCGAACT CGCGCGCATC GGCATCCGCG CGATCAAGAT CGAGGGTCGC CAGCGCAGCC CGGCCTACGT CGCCCAGGTC ACCAAGGTGT GGCGTGCCGC GCTCGACAAG TTGAACAGCG GCGGCGCCGC AGGTTTCAGC GTGCTGCCGG CGTGGATGGC CGAACTCAAC AAGGTCTCCG AAGGCCAGAG CCACACCCTC GGCGCCTACT ACCGCCCGTG GAAGTGA
|
Protein sequence | MSVSSPIELV CPAGSLPALK TAIDHGADCV YLGFKDATNA RNFSGLNFDP AQVREGVAYA HARKRKVLLA LNTYPQTDNW SSWTQAIDRA ADYGVDAVIL ADPGLMAYAA KTHPQLRLHL SVQGSATSYE AINWYRERFG IQRAVLPRVL SMAQVESVVA NTQVEIEVFG FGGLCVMVEG RCALSAYATG ESPNCNGACS PGKHVRWEDT PRGMETRLNG ILIDRFSEGE RAGYPTLCKG RFEVNDETYY ALEEPTSLNT LELLPELARI GIRAIKIEGR QRSPAYVAQV TKVWRAALDK LNSGGAAGFS VLPAWMAELN KVSEGQSHTL GAYYRPWK
|
| |