Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TRQ2_0365 |
Symbol | |
ID | 6091770 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermotoga sp. RQ2 |
Kingdom | Bacteria |
Replicon accession | NC_010483 |
Strand | - |
Start bp | 351856 |
End bp | 353235 |
Gene Length | 1380 bp |
Protein Length | 459 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 642487543 |
Product | protease Do |
Protein accession | YP_001738404 |
Protein GI | 170288166 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | [TIGR02037] periplasmic serine protease, Do/DeqQ family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGAAAT TCTTTCTGAC CATCGCTGTG GCACTGATAC TCACCAGCGT CTTTCCATAC GTCAACCCCG ACTACGAAAG TCCCATCGTT AATGTGGTCG AAGTGTGTGC ACCGGCCGTT GTGAAGATAG ACGTTGTGAA GACGGTGAAG ACTTCCTTCT TCGATCCTTA TTTTGAGCAG TTCTTCAAAA AGTGGTTCGG TGAACTTCCA CCCGGTTTCG AAAGGCAGGT TGCCAGCCTC GGATCGGGCT TCATCTTCGA CCCAGAAGGT TACATACTCA CCAACTACCA CGTGGTAGGT GGAGCGGACA ACATCACTGT GACCATGCTC GACGGAAGCA AATACGACGC GGAGTACATA GGAGGAGATG AAGAGCTCGA CATCGCGGTC ATAAAGATCA AAGCCAGCGA TAAGAAATTC CCCTATCTGG AATTCGGTGA TTCTGACAAA GTGAAGATCG GTGAATGGGC GATAGCCATA GGAAACCCCC TCGGCTTCCA GCACACGGTG ACTGTGGGAG TGGTCAGCGC TACCAACAGG AGGATTCCAA AGCCAGATGG AAGTGGTTAC TACGTGGGAC TCATCCAGAC CGACGCGGCG ATCAACCCGG GAAACAGTGG AGGACCCCTT TTGAACATAC ACGGTGAAGT GATAGGCATA AACACCGCTA TCGTCAATCC TCAGGAAGCT GTGAACCTCG GCTTTGCCAT CCCGATCAAC ACGGTGAAGA AGTTCCTTGA CACCATACTC ACCCAAAAGA AGGTTGAAAA AGCCTACCTC GGCGTGACGG TCATGACCCT CACCGAAGAA ACGGCGAAAG CTCTGGGGCT TGAATCTACG AGTGGTGCTC TGATAACGAG TGTTCAGAAA GGATCTCCTG CTGAAAAGGC CGGGCTCAAA GAAGGGGACG TGATCCTCAA GGTTGACGAT CAGGACGTGA GAAGTCACGA AGAACTGGTC TCCATCATAC ACACTTACAA ACCGGGAGAC ACTGCCGTTC TCACTATAGA GAGAAAGGGA AAGATCATGA AAGTTCAGGT GACGTTCGGT TCCTCATCTG AAGAAGAGAA AACAACGACT GGTGAGGAAA AAATCGATGC CCTTGGTATC ACAGTGTCGA ACATCACGCC GGCCGACAGG GAGACTTATT CGATCCCGGA AGAGATAAAC GGGGTCATAG TGAAGGAGAG CACCGGGAAG TTCGGTCTAC AGAAGGGAGA CGTAATCACC ACAGTGTACG TGAACGGCAA AAGGTATGAT ATAAACTCTG TAGGGGATCT CAAGAAAGTT ACATCCATTG TTAAAAATGG TGACTACATC GCCCTTTACA TTTACAGGAA TGGAGCGAAA GTCTTCGTGA GCTTCATCTA TCAGAGATAG
|
Protein sequence | MKKFFLTIAV ALILTSVFPY VNPDYESPIV NVVEVCAPAV VKIDVVKTVK TSFFDPYFEQ FFKKWFGELP PGFERQVASL GSGFIFDPEG YILTNYHVVG GADNITVTML DGSKYDAEYI GGDEELDIAV IKIKASDKKF PYLEFGDSDK VKIGEWAIAI GNPLGFQHTV TVGVVSATNR RIPKPDGSGY YVGLIQTDAA INPGNSGGPL LNIHGEVIGI NTAIVNPQEA VNLGFAIPIN TVKKFLDTIL TQKKVEKAYL GVTVMTLTEE TAKALGLEST SGALITSVQK GSPAEKAGLK EGDVILKVDD QDVRSHEELV SIIHTYKPGD TAVLTIERKG KIMKVQVTFG SSSEEEKTTT GEEKIDALGI TVSNITPADR ETYSIPEEIN GVIVKESTGK FGLQKGDVIT TVYVNGKRYD INSVGDLKKV TSIVKNGDYI ALYIYRNGAK VFVSFIYQR
|
| |