Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmel_1797 |
Symbol | |
ID | 5297715 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermosipho melanesiensis BI429 |
Kingdom | Bacteria |
Replicon accession | NC_009616 |
Strand | + |
Start bp | 1781355 |
End bp | 1782713 |
Gene Length | 1359 bp |
Protein Length | 452 aa |
Translation table | 11 |
GC content | 32% |
IMG OID | 640770065 |
Product | protease Do |
Protein accession | YP_001307017 |
Protein GI | 150021663 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | [TIGR02037] periplasmic serine protease, Do/DeqQ family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGACAAAGA GATTAGGTAT ATTAATTTTA ACGGTATTGG CTATTTTTTC TTTTGGCTAT GTAAACCCAA ACTACGAAAG TCCTATCACA AAAGTAGTTG ATGAAGCTGC ACCTGCTGTA GTAAAGATTG AAGCAACTGT ATATTCTACA TCGTATATTG ATCCATTCAT CGAGGAATTC TTCAAAAGAT GGTTTGGTGA TATTCCCAAG CAATATCAGC AAAAAGGTAC TAGTTTAGGC TCAGGTTTTA TATTTGAAAA AGAAGGTTAT ATATTAACTA ATTTTCATGT AGTTGATGGT GCTGAAAATA TAAAAGTCAG TTTGTTAGAT GGAAAAGAAT TCAGCGCCGA ATTTATCGGT GGAGACAAAG AACTAGATAT TGCTATACTA AAAATAGATC CAAAAAACCA GGAACTTCCC GTTTTAGAAT TTGGTGATTC CGATAAATTA AAAATTGGTG AATGGGCAAT CGCAATAGGT AATCCACTTG GTTTTCAACA TACCGTTACA GTTGGGGTAA TTTCGGCAAC AGGAAGAAAA ATACCAAAAC CCGATAATGA CGGTTATTAC ACAAATCTAA TTCAAACAGA CGCTGCAATC AACCCAGGAA ATAGCGGAGG CCCACTTTTA AATATACATG GTCAAGTTAT TGGTATTAAT ACTGCAATTA TTGCCCCTTC AGAAGCTATG AATATAGGTT TTGCAATACC AATTAACACA GCAAAAAGAT TTATAGATAG TATAATAAAA ACTGGTAAAG CCGAAAAAGC ATATCTTGGA GTCTATATGC AAACAGTTAC AAAAGAACTT GCAAAAGCAT TAGGGTTAAA AACTGATAAA GGGGTTTTTA TTTCACAAGT TATAAAAGAC TCTCCAGCTG AAAAAGCTGG TTTAAAGGAT GGAGATGTTA TTATAGAAGT TGAAGGTCTT TCTGTAACAT CTGCAAGTGA ATTAAAATCG ATAATCCACA ACTACACTCC CGGTTCAAAA ATAAAGATTA TAGTAAATAG AAAAGGAAAA ATAATTAAAT TTGAAGTGAC TTTAGGTAAA TCAAAAGAAA CTGAAAAAGT AACATCAGCA AAAGAGTTTA TGGGATTAAC TGTAAAAGAC ATTACAAATG CAGATAGAGA AGAATATCAA ATTCCAGAAG AAATTAGTGG AGTAGTTGTA AAAAACAGCA AAATAAGTTA CATTAGTGAA GGATATGTCA TTTTCAGAAT AGCAATCAAC GGACAAAAAT ATGAAATTAG AAATATAAAT GATTGGAACA AAGTAATTTC AAAAATAAAC AAAGGCAATT ATGTTGCTCT CTTCTACTAC TATAAAGGTG CTACGGGAGT ATTTTCATTT GGATATTAA
|
Protein sequence | MTKRLGILIL TVLAIFSFGY VNPNYESPIT KVVDEAAPAV VKIEATVYST SYIDPFIEEF FKRWFGDIPK QYQQKGTSLG SGFIFEKEGY ILTNFHVVDG AENIKVSLLD GKEFSAEFIG GDKELDIAIL KIDPKNQELP VLEFGDSDKL KIGEWAIAIG NPLGFQHTVT VGVISATGRK IPKPDNDGYY TNLIQTDAAI NPGNSGGPLL NIHGQVIGIN TAIIAPSEAM NIGFAIPINT AKRFIDSIIK TGKAEKAYLG VYMQTVTKEL AKALGLKTDK GVFISQVIKD SPAEKAGLKD GDVIIEVEGL SVTSASELKS IIHNYTPGSK IKIIVNRKGK IIKFEVTLGK SKETEKVTSA KEFMGLTVKD ITNADREEYQ IPEEISGVVV KNSKISYISE GYVIFRIAIN GQKYEIRNIN DWNKVISKIN KGNYVALFYY YKGATGVFSF GY
|
| |