Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_0830 |
Symbol | |
ID | 4601833 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | + |
Start bp | 783524 |
End bp | 784756 |
Gene Length | 1233 bp |
Protein Length | 410 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 639773607 |
Product | hypothetical protein |
Protein accession | YP_920234 |
Protein GI | 119719739 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG1030] Membrane-bound serine protease (ClpP class) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGGGTA GGGTCGGCGT CTTCATCGCG TTTAGTGTCT TCCTGCTTCT ATGCGCAGTG GCTTACGCTC AGCCGAGGGT CGTGGTTGTA GACTTGGAGG GCCGTATCGA CGAGGGTGCG TACTATACCG TTAAGAGGGG TCTAGAGGAA GCCGACAAGG GCGTGGTGGT CGTGGTTATC CGTAGTTACG GCGGCTACCT GAAATCCATG GACAAGATAG TCGAGCTACT CGTATCCTCC GAGGCGAGGA CTATCGCGTG GGTCCCCCCG GGGGAGAGGA CCGTGTCGGC GGCAGCCGTG ATCGCTCTTT CAGCCCAGAG GCTCTACATG GGTAAAGGCG CTGTTATCGG GTCGATCAAG CCGTACCCTG ACGACCCTAA GACTGTGGAG TACGTTGTAG CCAGGGTAGC CGCTTTACTC TCGAAGAAGG GGGTTAACGA CTCCAGGGGT CTTGCCAGGA GGCTCGTGGT CGACGCGCAG AGCTTTACGA GCGACGAGGC AGTTGCAACC GGCATAGCGG ATGGAACGGC GGACAGCCTC GCGGAGTTAC TGCAGAAAGA GGGTCTAAGC TTTGCCAGCG TCCACTACGT ATCCGGCGAC TTAGTCAGCG ACTTACTCTC GGTGCTGCTG GACCCAGCCC TAGCGGTGTT GCTGGCACTC CTGGGGGCCA TGCTCCTGCT CTTAGAGTTC AAGGTTACGG GATTCCAGGG ATGGGGGGTG ATAGGGGCAG CCCTCATAGT GATCTCCCTG TACTCCTTCG ATGTTATAGG CGTAAGCCTT AGCACGTTCA TACTGGCGTT GCTAGGCATA ACCCTGATAA TAGTCGAGCT CGCCAAGCCA GGGGTTCAGG TCGCCGGCAT TGCAGGCGTA GCCTTAATAG CCCTCGCTGT GATACTCGAG TATGCTTCGC GCCCCTACCC TGTTTTCACT CCCAACGTCC TGGTAGTAGC GGTTCCCCTG GCGATCCTAG TCCTCCTGCT TGCCGTGGTA ATTTCGAAGG CACTGGAGAC TGTAAGGATG AAGGCTCCGA GCCTTCAGGA AAGGCTTATC GGGAAAATAG GCGTCGCCAA GACGCGGATA GAGCCCGGGA AGAGGGGCGT AGTTTACGTC GACGGGGAGG ACTGGACCGC TACTTCTGAG TACGTAGTGG AGGAAGGAGA GAGCGTAGAG GTTGTCGCTT TGGATGGGTT GTTCCTAAAG GTGAAACCGG TACGGCGGGA GCACTCGCAG TGA
|
Protein sequence | MKGRVGVFIA FSVFLLLCAV AYAQPRVVVV DLEGRIDEGA YYTVKRGLEE ADKGVVVVVI RSYGGYLKSM DKIVELLVSS EARTIAWVPP GERTVSAAAV IALSAQRLYM GKGAVIGSIK PYPDDPKTVE YVVARVAALL SKKGVNDSRG LARRLVVDAQ SFTSDEAVAT GIADGTADSL AELLQKEGLS FASVHYVSGD LVSDLLSVLL DPALAVLLAL LGAMLLLLEF KVTGFQGWGV IGAALIVISL YSFDVIGVSL STFILALLGI TLIIVELAKP GVQVAGIAGV ALIALAVILE YASRPYPVFT PNVLVVAVPL AILVLLLAVV ISKALETVRM KAPSLQERLI GKIGVAKTRI EPGKRGVVYV DGEDWTATSE YVVEEGESVE VVALDGLFLK VKPVRREHSQ
|
| |