Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_0916 |
Symbol | |
ID | 7085019 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 1003950 |
End bp | 1005101 |
Gene Length | 1152 bp |
Protein Length | 383 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 643697939 |
Product | HtrA2 peptidase |
Protein accession | YP_002354579 |
Protein GI | 217969345 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | [TIGR02037] periplasmic serine protease, Do/DeqQ family [TIGR02038] periplasmic serine pepetdase DegS |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGCCGCT TGTGGCTCAT CTTCGCGCAG GCGGTGACCG TCAGCGTCGC CGTGCTGTTC GTGCTCAACA CCCTCAAGCC GGAATGGCTG CGCAAGGCCA CGCCCGCGGC GGTGGTGGCG ATCCTCGAGG CACCTGCGGG CAGCGAGCGT GCGCCGGCGG CAGGCTCCTA CGCCCCGGCT GCACAGCGTT CGATGCCGGC GGTAGTGCAC ATCTTCACCA GCAAGGGCGG TCGCGGCCAG CGCCACCCGC TGCTCGACGA CCCGCTGTTC CGCCACTTCT TCGGGGAGCG CCCGGACAGC GCACCGAGCG AATCCGGCCT CGGCTCGGGC GTCATCGTCA GCCCCGACGG CTACGTACTC ACCAACAACC ACGTCATCGA GACCGCCGAC GCGATCGAGG TCGCGCTCAA CGACGGCCGC AAGTTCGCCG CCCGCCTGGT CGGGCGCGAC CCCGAGACCG ACCTCGCGGT GCTGCGCATC GACGGCGCCG AGGCGCTTCC GGCGATCACC TTCCCGGCCG CCGACAGCCT CGCGGTGGGC GACGTGGTGC TGGCGATCGG CAACCCCTTC GGCGTCGGCC AGACGGTCAC CATGGGCATC GTCTCGGCGC TCGGTCGCAG CCAGCTCGGC ATCAACACCT TCGAGAACTA CATCCAGACC GACGCCGCGA TCAACCCGGG CAACTCGGGT GGCGCGCTGG TCGACAGCGC GGGCAGCCTG GTCGGCATCA ACACGGCGAT CTACTCGCGC TCGGGCGGTT CGCTCGGCAT CGGCTTCGCA ATCCCGGTCT CGATCGCGCG CGACGTGCTC GAGCAGATCG TCGCCACCGG CGAGGTCGTG CGCGGCTGGG TGGGCGTGGA GATCCAGGAC CTCACACCCG AACTGGCCGC CTCGTTCGGC TATCGCGACG CCGGCGGCGC GCTCATCGCC GGCGTACTGC GCGGCAGCCC GGCAGATCGG GCGGGCGTGC GTCCGGGCGA CGTCCTGGTC GCGCTCGACG GCGAGAGCGT GCGCGACCCG CGCGCGATGC TCGACATGGT CGCGGCGCTC TCACCCGGCA AGCGTGCGGT GTTCAGGGTG CGGCGCGGCG CCGAGGCGCT CGACCTCGAG GTGGAAGTGG GCCGCCGCCC GATCCCGCCC GCCGCGCGCT GA
|
Protein sequence | MRRLWLIFAQ AVTVSVAVLF VLNTLKPEWL RKATPAAVVA ILEAPAGSER APAAGSYAPA AQRSMPAVVH IFTSKGGRGQ RHPLLDDPLF RHFFGERPDS APSESGLGSG VIVSPDGYVL TNNHVIETAD AIEVALNDGR KFAARLVGRD PETDLAVLRI DGAEALPAIT FPAADSLAVG DVVLAIGNPF GVGQTVTMGI VSALGRSQLG INTFENYIQT DAAINPGNSG GALVDSAGSL VGINTAIYSR SGGSLGIGFA IPVSIARDVL EQIVATGEVV RGWVGVEIQD LTPELAASFG YRDAGGALIA GVLRGSPADR AGVRPGDVLV ALDGESVRDP RAMLDMVAAL SPGKRAVFRV RRGAEALDLE VEVGRRPIPP AAR
|
| |