Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_2317 |
Symbol | |
ID | 7085304 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 2607541 |
End bp | 2609004 |
Gene Length | 1464 bp |
Protein Length | 487 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 643699338 |
Product | protease Do |
Protein accession | YP_002355952 |
Protein GI | 217970718 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | [TIGR02037] periplasmic serine protease, Do/DeqQ family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACCCCG AAGTCCGCAT GCAAACGAGC GCCGCGAGGG CGCTCGCCGT GGTGCTCTGT CTCCTGCTGA TGCTCTGCGC GCCGAGGGGC TTCGCGGCCG GCACGGCAGA GCGGGCGATG CTCCCCGATT TCACCCGCCT CGTGGCGACA CAGGGCGCCG CCGTGGTCAA TATCAGCGCC ACCCAGGTGG CTGCGCAGCC GCAGAAGCAA CCCTTCCGTC TTCCGGAACT CGACGAGTCG GATCCGATGT TCGAGTTCTT CCGCAAGTTC ATCCCGCGCA TGCCCGAGTA CCCGGGGGCG GAGCCCGACG ACAAGTCGCT CGGCTCGGGC TTCATCATCA GCGCCGACGG TTTCATCCTC ACCAACGCGC ATGTGGTGGA GGCCGCAGAG AGCATCGTGG TCCGCCTCGC CGACAAGCGC GAGTTCGACG CCACGGTGAT CGGCGCCGAT GCGCGCAGCG ACGTCGCGCT GATCCGCATC GAGGCCAAGG ACCTGCCTCA TGTCGTGCTC GGCGACCCCG AGGCGCTCGC GGTGGGCGAG TGGGTGCTGG CGATCGGTTC GCCCTTCGGC TTCGAGCAGT CGGTCACCGC CGGCATCGTC AGCGCCAAGG GGCGCAGTCT GCCCGACGAG AACTTCGTGC CCTTCATCCA GACCGACGTC GCGATCAATC CGGGAAATTC GGGCGGGCCG CTGTTCAACC TGCGTGGCGA GGTGATCGGC ATCAATTCGC AGATCTACAG TCGTACGGGC GGGTTCATGG GTCTGTCCTT CGCGATCCCG ATCGATGTCG CGATGGACGT GCAGCAGCAA CTGCGCGAGA AGGGCAGGGT GGAGCGCGGG CGCATCGGGG TGTCGATCCA GGAGATCACC CGCGACCTCG CCGACAGTTT CGGGCTGCCG CGCCCTGCGG GTGCGCTGGT GAGCAGTGTG GAGGCCGGTG GTCCGGCGGC GCTCGGCGGC GTCGTCCAGG GCGACGTGAT CGTGCGCTTC AACCAGCGCA ACGTCGAGAA TTCGGCCGAC CTTCCGCGCA TCGTCGCGGC GGCGCGCCCG GGCAGCAAGG TCGAGGTCGA GATCTATCGC GACGGGGCGC CGCGTTCCCT GAGCTTGACG CTGGGCGAAT GGCGCGACCC GGAGGAAGAG GTCGAGCCCG TGGCGGTCGG TCTGGCCACG GGCGCGACCA ACCGCCTCGG CCTCGAACTC GTCGCGCCCA CGGCGCAGCA GCGGCGCGAG CGCGGGCTTG CGCACGGCCT GTTGGTGCAG CGTGCCGAAA AGTCCGCCGC GCGGGCCCAG ATCGTTCCCG GCGACCTCGT GCTGGCGATC GTCGTCGAAG GGCGCCAGGC CAGGCTCGAT CGCATCGAGG ATTTCGAACG CGTGGTCGCC GCACTCAAGC CCGGGCAGCA GGTCACCCTG CTGGTCGGGC GCGGCGAGAG CGCGTCCTAT GTCAGCCTGC GCGCCGACAA GTGA
|
Protein sequence | MNPEVRMQTS AARALAVVLC LLLMLCAPRG FAAGTAERAM LPDFTRLVAT QGAAVVNISA TQVAAQPQKQ PFRLPELDES DPMFEFFRKF IPRMPEYPGA EPDDKSLGSG FIISADGFIL TNAHVVEAAE SIVVRLADKR EFDATVIGAD ARSDVALIRI EAKDLPHVVL GDPEALAVGE WVLAIGSPFG FEQSVTAGIV SAKGRSLPDE NFVPFIQTDV AINPGNSGGP LFNLRGEVIG INSQIYSRTG GFMGLSFAIP IDVAMDVQQQ LREKGRVERG RIGVSIQEIT RDLADSFGLP RPAGALVSSV EAGGPAALGG VVQGDVIVRF NQRNVENSAD LPRIVAAARP GSKVEVEIYR DGAPRSLSLT LGEWRDPEEE VEPVAVGLAT GATNRLGLEL VAPTAQQRRE RGLAHGLLVQ RAEKSAARAQ IVPGDLVLAI VVEGRQARLD RIEDFERVVA ALKPGQQVTL LVGRGESASY VSLRADK
|
| |