Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_0647 |
Symbol | |
ID | 4601513 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | + |
Start bp | 598356 |
End bp | 599657 |
Gene Length | 1302 bp |
Protein Length | 433 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 639773419 |
Product | elongation factor 1-alpha |
Protein accession | YP_920052 |
Protein GI | 119719557 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG5256] Translation elongation factor EF-1alpha (GTPase) |
TIGRFAM ID | [TIGR00231] small GTP-binding protein domain [TIGR00483] translation elongation factor EF-1 alpha [TIGR00485] translation elongation factor TU |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.000513743 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCTGAGA AAAAGCCACA CTTAAACCTG GTAGTGATAG GACACATCGA CCACGGAAAA AGCACCCTAA TGGGAAGACT CCTCTACGAG ATAGGCGCGG TAGACCCCAG GCTGATTCAG CAGTACGAGG AGGAAGCGAA AAAGATGGGT AGGGAGACGT GGAAGTACGC TTGGGTTCTA GACAAGCTCA AGGAGGAGAG AGAGAAGGGT ATCACAATCG ACCTCGGCTT CTACAAGTTC GAGACTAAGA AGTACTTCTT CACGCTGATT GACGCGCCGG GTCACAGGGA CTTCGTTAAG AACATGATAA CCGGAGCTAG CCAGGCTGAC GTCGCATTGC TCGTCGTATC TGCTAAGGAG GGTGAATTCG AGGCTGGCAT AAGCCCTGCT GGTCAGACCA GGGAGCACGT CTTCCTGGCG AAGACGATGG GCGTAGACCA GCTGGTCGTG GCTATAAACA AGATGGACAC GGTTAACTAC AGCAAGGAGA GGTACGAGGA AATTAAGAAC CAGCTGATAA GGTTGCTCCG AATGGTCGGC TACAAGGTGG ACGAGATACC GTTCATACCG ACTTCGGCGT GGGAAGGCGT GAACGTGTCC AAGAGGACCC CCGAGAAGAC TCCGTGGTAC GACGGGCCAT GCCTCTACGA GGCGTTCGAC TTCTTCAAGG AGCCTCCGAG GCCCATAGAC AAGCCGCTAA GGATACCCAT ACAGGACGTC TACAGCATTA AAGGAGTAGG CACAGTTCCC GTTGGGAGAG TCGAGACAGG CGTACTCAAA GTTGGAGACA AGATAATCAT CAACCCGCCG AAAGCAGTGG GAGAAGTCAA ATCCATAGAG ACCCACCACA CGCCGCTCCA GGAGGCTATA CCAGGGGACA ACATAGGTTT CAACGTGAAG GGCGTTGAAA AATCTCAGTT GCGGCGTGGC GACGTGGCAG GACATACAAC GAACCCGCCG ACTGTTGCGG AAGAATTCAC AGGTAGGATC TTCGTCCTGT ACCACCCGAC GGCCATCGCG GCAGGCTACA CACCGGTGCT GCACATACAC ACGGCGACCG TCCCGGTAAC GTTTGAGGAG CTACTTCAGA AGCTTGACCC AAGGACGGGT AGCGTTGCAG AGGAGAAGCC GCAGTACATT AAGCAGGGTG ACTCCGCCAT CGTAAGGTTC AAACCGAGGA AGCCGGTCGT CGTGGAGAAG TACTCTGAGT TCCCACCACT AGGCAGGTTC GCCATTAGAG ACTCTGGCCG CACCATTGCT GCCGGAGTAG TAATCGACGT GAAGAAAGCC GAAGGCTATT AA
|
Protein sequence | MSEKKPHLNL VVIGHIDHGK STLMGRLLYE IGAVDPRLIQ QYEEEAKKMG RETWKYAWVL DKLKEEREKG ITIDLGFYKF ETKKYFFTLI DAPGHRDFVK NMITGASQAD VALLVVSAKE GEFEAGISPA GQTREHVFLA KTMGVDQLVV AINKMDTVNY SKERYEEIKN QLIRLLRMVG YKVDEIPFIP TSAWEGVNVS KRTPEKTPWY DGPCLYEAFD FFKEPPRPID KPLRIPIQDV YSIKGVGTVP VGRVETGVLK VGDKIIINPP KAVGEVKSIE THHTPLQEAI PGDNIGFNVK GVEKSQLRRG DVAGHTTNPP TVAEEFTGRI FVLYHPTAIA AGYTPVLHIH TATVPVTFEE LLQKLDPRTG SVAEEKPQYI KQGDSAIVRF KPRKPVVVEK YSEFPPLGRF AIRDSGRTIA AGVVIDVKKA EGY
|
| |