Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpet_0347 |
Symbol | |
ID | 5171309 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermotoga petrophila RKU-1 |
Kingdom | Bacteria |
Replicon accession | NC_009486 |
Strand | - |
Start bp | 331803 |
End bp | 333182 |
Gene Length | 1380 bp |
Protein Length | 459 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 640562851 |
Product | protease Do |
Protein accession | YP_001243952 |
Protein GI | 148269492 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | [TIGR02037] periplasmic serine protease, Do/DeqQ family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.342826 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGAAAT TCTTTCTGAC CATCGCTGTG GTACTGATGC TCACCAGCGT CTTTCCATAC GTCAATCCCG ACTACGAAAG CCCCATCGTT AACGTGGTCG AAGCGTGTGC ACCGGCCGTT GTGAAGATAG ATGTTGTGAA GACGGTGAAG ACTTCCTTCT TCGATCCTTA TTTTGAGCAG TTCTTCAAAA AGTGGTTCGG TGAACTTCCA CCCGGTTTCG AAAGGCAGGT TGCCAGCCTC GGATCGGGCT TCATCTTCGA CCCAGAAGGT TACATACTCA CAAACTACCA CGTGGTCGGT GGAGCGGACA ACATCACTGT GACCATGCTC GACGGAAGCA AATACGACGC GGAGTACATA GGAGGAGATG AAGAGCTCGA CATCGCGGTC ATAAAGATCA AAGCCAGCGA TAAGAAATTC CCCTATCTGG AATTCGGTGA TTCTGACAAG GTAAAGATCG GCGAATGGGC GATAGCCATA GGGAACCCCC TCGGCTTCCA GCACACGGTG ACTGTGGGAG TGGTCAGCGC TACCAACAGG AGGATTCCAA AGCCAGATGG AAGTGGTTAC TACGTGGGAC TCATCCAGAC CGACGCGGCG ATCAACCCGG GAAACAGCGG AGGACCCCTT CTGAACATAC ACGGTGAAGT GATAGGCATA AATACGGCCA TCGTCAACCC TCAGGAAGCT GTGAACCTCG GTTTTGCCAT CCCGATCAAC ACGGTGAAGA AGTTCCTCGA CACCATACTC ACCCAAAAGA AGGTTGAGAA GGCTTACCTC GGTGTGACGG TCATGAACCT CACCGAAGAA ACGGCGAAAG CTCTGGGGCT TGAATCTACG AGTGGTGCTC TGATAACGAG TGTTCAGAAA GGATCTCCCG CTGAAAAAGC CGGGCTCAAA GAAGGGGACG TGATCCTCAA GGTTGACGAT CAGGATGTGA GAAGTCACGA AGAGTTGGTT TCCATCATAC ACACTTACAA ACCGGGAGAC ACAGCCGTTC TCACCATAGA GAGAAAGGGA AAGATCATGA AAGTTCAGGT GACGTTCGGT TCCTCATCTG AAGAAGAGAA AACAACGACT GGTGAGGAAA AAATCGATGT TCTTGGCATC ACAGTGTCGA ACATCACGCC GGCCGACAGG GAGACTTATT CGATCCCGGA AGAGATAAAC GGGGTCATAG TGAAGGAGAG CACCGGGAAG TTCGGTCTAC AGAAGGGAGA CGTAATCACC ACAGTGTACG TGAACGGTAA AAGGTATGAT ATAAACTCTG TAGAGGATCT CAAGAAAGTT ACATCCATTG TTAAAAATGG TGACTACATC GCCCTTTACA TTTACAGGAA TGGAGCGAAA GTCTTCGTGA GCTTCATCTA TCAGAGATAG
|
Protein sequence | MKKFFLTIAV VLMLTSVFPY VNPDYESPIV NVVEACAPAV VKIDVVKTVK TSFFDPYFEQ FFKKWFGELP PGFERQVASL GSGFIFDPEG YILTNYHVVG GADNITVTML DGSKYDAEYI GGDEELDIAV IKIKASDKKF PYLEFGDSDK VKIGEWAIAI GNPLGFQHTV TVGVVSATNR RIPKPDGSGY YVGLIQTDAA INPGNSGGPL LNIHGEVIGI NTAIVNPQEA VNLGFAIPIN TVKKFLDTIL TQKKVEKAYL GVTVMNLTEE TAKALGLEST SGALITSVQK GSPAEKAGLK EGDVILKVDD QDVRSHEELV SIIHTYKPGD TAVLTIERKG KIMKVQVTFG SSSEEEKTTT GEEKIDVLGI TVSNITPADR ETYSIPEEIN GVIVKESTGK FGLQKGDVIT TVYVNGKRYD INSVEDLKKV TSIVKNGDYI ALYIYRNGAK VFVSFIYQR
|
| |