Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_0989 |
Symbol | |
ID | 4601965 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | - |
Start bp | 940254 |
End bp | 941264 |
Gene Length | 1011 bp |
Protein Length | 336 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 639773767 |
Product | hypothetical protein |
Protein accession | YP_920392 |
Protein GI | 119719897 |
COG category | [S] Function unknown |
COG ID | [COG5493] Uncharacterized conserved protein containing a coiled-coil domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.132758 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGCGGCGC TGGGTAGGGG GGAGTGGGAG CGGTTGGTTA AGGCTTTGGA GGAGGATAGG GAGCTTAGGT ACGCTTTAGC CGGTTTGCTG GGATTCAGGG ATTTGCTCGA GAAGATGGAC GCAACGCTGA ACGAGATAAG GGCGCTGAGG GAGGGACAGG AGCAACTGTG GAGAAACCAG GAAAAACTGT GGGAAGAAGT GAAGTCCCTC AGAGAGGGGC AGGGAAAGCT ATGGGAAGAA GTTAAGGCGC TGAGAGAGGA TCAGAGGAGG CTGTGGGAGG AAGTTAAGGC TCTCAGAGAA AACCAGGAAA AGCTATGGGA GGAGGTTAGA GCGCTGAGAG AGGACCAGGG GAAGTTGTGG GAAGGCCAGC AGAGGCTCTG GGAGGAGGTC AAGGCACTGA GAGAGGGACA GGAGAAGCTC TGGGAAGAGG TAAGGAAGCT GTGGGAGGAG GTAAAAGCTC TGAGGGAGAA CCAGGAAAAG CTGTGGGAGG AAGTGAAAGC TCTTAGAGAG GGACAGGAGA AACTATGGGA AGAGGTTAAG GCGCTGAGAG AAGAGCAAGG AAAGCTGTGG AAAGAAGTGA AGTCTCTCAG AGAGGAGCAA GGGATCCTCG CGAGAAAGAT GGACTCCTTC GAGAGACGCC TCATAGCGCT GGGCGCCAGG TGGGGCATCG AGTCGGAAGC CGCTTTCAGA GAAGCCATGA GGGGAGTCGT CGAGGAAATA CTAGGCGCAG GCGAAGTCCT CAGGTGGGTC TACTACGACG AAGACGGCGA AGTCCTCGGA TACCCCTCCA GGGTCGAAGC AGACATACTG ATAAAAGACA AGGTACACGT ACTCATCGAA GTAAAACCCA GCGCCTCCAG CGGAGACATA GCAAAGCTCT GGAGGCTCGG ACGCCTATAC GAGAAGAAAA CCGGCACAAA GCCAAGACTA GTCCTCGTAA CACCCTTCAT AGAAGAAGAA GCACTAAAAG CCGCAAAACA ACTCGGAATA GAAGTATACA CGAACACCTA G
|
Protein sequence | MAALGRGEWE RLVKALEEDR ELRYALAGLL GFRDLLEKMD ATLNEIRALR EGQEQLWRNQ EKLWEEVKSL REGQGKLWEE VKALREDQRR LWEEVKALRE NQEKLWEEVR ALREDQGKLW EGQQRLWEEV KALREGQEKL WEEVRKLWEE VKALRENQEK LWEEVKALRE GQEKLWEEVK ALREEQGKLW KEVKSLREEQ GILARKMDSF ERRLIALGAR WGIESEAAFR EAMRGVVEEI LGAGEVLRWV YYDEDGEVLG YPSRVEADIL IKDKVHVLIE VKPSASSGDI AKLWRLGRLY EKKTGTKPRL VLVTPFIEEE ALKAAKQLGI EVYTNT
|
| |