Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_0105 |
Symbol | |
ID | 7085203 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 121251 |
End bp | 122720 |
Gene Length | 1470 bp |
Protein Length | 489 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 643697152 |
Product | protease Do |
Protein accession | YP_002353801 |
Protein GI | 217968567 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | [TIGR02037] periplasmic serine protease, Do/DeqQ family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGCAAAC TCCTGACCGT CACCGCCGTC GCGCTCGCGC TCGGCACCCT CTTTCCCCAA GCCTCGGCGC TGAGCGCCGC GCACGCCCAG GCGGCCGCCC CGTCGATGGT CTCGCCCAAC CGCTTCGGCC TGCCCGACTT CGGCGACCTG GTCGAGCAGG TGGGCCCGGC GGTGGTCAAT ATCCGCGTGG TGCAGCGCGC GGCCCAGGGC ACGGCGGGCG AGAACCCGCT CGCCAACGAC CCCTTCTACG ACTTCCTGCG CCGCTTCGGC GTGCCCATGC CGGGCTTCCC CGGCCAGCCC GGCATGGGGC CGCGCGCGCG CCAGGGCATC GGCTCGGGCT TCATCGTCAG CAAGGACGGC TACGTGCTCA CCAACGCCCA CGTGGTCGCC GGCGAGGATG GCGACGCCGC GCTCTCCGAG GTCACCGTCA CCCTGATCGA CAAGCGCGAG TTCAAGGCCA AGGTGGTCGG CATCGACCGT CGTACCGACG TCGCCCTGCT CAAGCTCGAC GCCAGCGGCC TGCCCGCGGT GAAGATCGGC AACCCCGACC AGGCCCGCGT GGGCGAATGG GTGGTGGCGA TGGGCTCGCC CTTCGGCTTC GACAACACGG TCACCGCCGG CATCATCTCG GCCAAGGCGC GCCGCCTGCC CGACGAGACC TACGTGCCTT TCATCCAGAC CGACGTGGCG ATCAACCCGG GCAACTCCGG CGGTCCGCTG TTCAACCTCG CCGGCGAGGT GATCGGCATC AACTCGCAGA TCTACTCGCG CTCGGGCGGC TTCATGGGGA TCTCGTTCGC GATCCCGATC GACGTCGCGA TGAACATCAA GGACCAGCTC GTCAGCCACG GCCGCGTGCA GCGCGGCCGC CTCGGCATCG CCATCCAGAA CGTCGACAAG GACCTCGCCC AATCCTTCGG CCTCACCGAT CCGCGCGGCG CGCTGGTCGC CAGCGTCGAG CCGGGCAGCG CCGCCGACAA GGCCGGGCTG CAGGCGGGCG ACGTGGTGCT CGCGGTCGAT GGCCGCCGCA TCGACGACTC TGCCGAGCTG CCGCGGGTGA TCGGCGAGAA ACGCCCGGGC ACGCGCGTGA AGCTGGAGCT GTGGCGCGAC GGCCGCAGCC GCGAGGTCGC CGCCACGCTG GACGAGCTCA AGGCCGAGAC CGTGGCCGGC AGCGCACCGG CGCCCAGCCA GGTGGAGCGT ATCGGCGAGC AGTTCGGACT GAGCGCGCGT GCGCTCACCG CGGAAGAGAC CGCCCGCCTG CAGCTGCGCG GCGGGGTGGT GGTGGAGGGC GCCGACGGCG CCGCGGCCCG CGCCGGCCTG CAGCGCGGCG ACGTCATCCT GGCGATCAAC AACCAGCCGG TCGCCAGCGT CGCAGACCTG CGCGCCCAGC TCGAGCGCGC CGGCAAGCGC TTCGCGCTGC TGATCCAGCG TGGCGAGGCA CGCATCTTCG TGCCGGTACG CCTGGAGTGA
|
Protein sequence | MRKLLTVTAV ALALGTLFPQ ASALSAAHAQ AAAPSMVSPN RFGLPDFGDL VEQVGPAVVN IRVVQRAAQG TAGENPLAND PFYDFLRRFG VPMPGFPGQP GMGPRARQGI GSGFIVSKDG YVLTNAHVVA GEDGDAALSE VTVTLIDKRE FKAKVVGIDR RTDVALLKLD ASGLPAVKIG NPDQARVGEW VVAMGSPFGF DNTVTAGIIS AKARRLPDET YVPFIQTDVA INPGNSGGPL FNLAGEVIGI NSQIYSRSGG FMGISFAIPI DVAMNIKDQL VSHGRVQRGR LGIAIQNVDK DLAQSFGLTD PRGALVASVE PGSAADKAGL QAGDVVLAVD GRRIDDSAEL PRVIGEKRPG TRVKLELWRD GRSREVAATL DELKAETVAG SAPAPSQVER IGEQFGLSAR ALTAEETARL QLRGGVVVEG ADGAAARAGL QRGDVILAIN NQPVASVADL RAQLERAGKR FALLIQRGEA RIFVPVRLE
|
| |