Gene Tpen_0647 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_0647 
Symbol 
ID4601513 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp598356 
End bp599657 
Gene Length1302 bp 
Protein Length433 aa 
Translation table11 
GC content54% 
IMG OID639773419 
Productelongation factor 1-alpha 
Protein accessionYP_920052 
Protein GI119719557 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG5256] Translation elongation factor EF-1alpha (GTPase) 
TIGRFAM ID[TIGR00231] small GTP-binding protein domain
[TIGR00483] translation elongation factor EF-1 alpha
[TIGR00485] translation elongation factor TU 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000513743 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTGAGA AAAAGCCACA CTTAAACCTG GTAGTGATAG GACACATCGA CCACGGAAAA 
AGCACCCTAA TGGGAAGACT CCTCTACGAG ATAGGCGCGG TAGACCCCAG GCTGATTCAG
CAGTACGAGG AGGAAGCGAA AAAGATGGGT AGGGAGACGT GGAAGTACGC TTGGGTTCTA
GACAAGCTCA AGGAGGAGAG AGAGAAGGGT ATCACAATCG ACCTCGGCTT CTACAAGTTC
GAGACTAAGA AGTACTTCTT CACGCTGATT GACGCGCCGG GTCACAGGGA CTTCGTTAAG
AACATGATAA CCGGAGCTAG CCAGGCTGAC GTCGCATTGC TCGTCGTATC TGCTAAGGAG
GGTGAATTCG AGGCTGGCAT AAGCCCTGCT GGTCAGACCA GGGAGCACGT CTTCCTGGCG
AAGACGATGG GCGTAGACCA GCTGGTCGTG GCTATAAACA AGATGGACAC GGTTAACTAC
AGCAAGGAGA GGTACGAGGA AATTAAGAAC CAGCTGATAA GGTTGCTCCG AATGGTCGGC
TACAAGGTGG ACGAGATACC GTTCATACCG ACTTCGGCGT GGGAAGGCGT GAACGTGTCC
AAGAGGACCC CCGAGAAGAC TCCGTGGTAC GACGGGCCAT GCCTCTACGA GGCGTTCGAC
TTCTTCAAGG AGCCTCCGAG GCCCATAGAC AAGCCGCTAA GGATACCCAT ACAGGACGTC
TACAGCATTA AAGGAGTAGG CACAGTTCCC GTTGGGAGAG TCGAGACAGG CGTACTCAAA
GTTGGAGACA AGATAATCAT CAACCCGCCG AAAGCAGTGG GAGAAGTCAA ATCCATAGAG
ACCCACCACA CGCCGCTCCA GGAGGCTATA CCAGGGGACA ACATAGGTTT CAACGTGAAG
GGCGTTGAAA AATCTCAGTT GCGGCGTGGC GACGTGGCAG GACATACAAC GAACCCGCCG
ACTGTTGCGG AAGAATTCAC AGGTAGGATC TTCGTCCTGT ACCACCCGAC GGCCATCGCG
GCAGGCTACA CACCGGTGCT GCACATACAC ACGGCGACCG TCCCGGTAAC GTTTGAGGAG
CTACTTCAGA AGCTTGACCC AAGGACGGGT AGCGTTGCAG AGGAGAAGCC GCAGTACATT
AAGCAGGGTG ACTCCGCCAT CGTAAGGTTC AAACCGAGGA AGCCGGTCGT CGTGGAGAAG
TACTCTGAGT TCCCACCACT AGGCAGGTTC GCCATTAGAG ACTCTGGCCG CACCATTGCT
GCCGGAGTAG TAATCGACGT GAAGAAAGCC GAAGGCTATT AA
 
Protein sequence
MSEKKPHLNL VVIGHIDHGK STLMGRLLYE IGAVDPRLIQ QYEEEAKKMG RETWKYAWVL 
DKLKEEREKG ITIDLGFYKF ETKKYFFTLI DAPGHRDFVK NMITGASQAD VALLVVSAKE
GEFEAGISPA GQTREHVFLA KTMGVDQLVV AINKMDTVNY SKERYEEIKN QLIRLLRMVG
YKVDEIPFIP TSAWEGVNVS KRTPEKTPWY DGPCLYEAFD FFKEPPRPID KPLRIPIQDV
YSIKGVGTVP VGRVETGVLK VGDKIIINPP KAVGEVKSIE THHTPLQEAI PGDNIGFNVK
GVEKSQLRRG DVAGHTTNPP TVAEEFTGRI FVLYHPTAIA AGYTPVLHIH TATVPVTFEE
LLQKLDPRTG SVAEEKPQYI KQGDSAIVRF KPRKPVVVEK YSEFPPLGRF AIRDSGRTIA
AGVVIDVKKA EGY