Gene Tpen_0284 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_0284 
Symbol 
ID4602094 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp248050 
End bp249261 
Gene Length1212 bp 
Protein Length403 aa 
Translation table11 
GC content55% 
IMG OID639773040 
ProductDNA-directed RNA polymerase, subunit A'' 
Protein accessionYP_919697 
Protein GI119719202 
COG category[K] Transcription 
COG ID[COG0086] DNA-directed RNA polymerase, beta' subunit/160 kD subunit 
TIGRFAM ID[TIGR02389] DNA-directed RNA polymerase, subunit A'' 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAGTGAGC CTATAAGCGA GGAGGAGCTG TGGAACGAGA TCGAGAAGTA CCACGAGCTT 
CTGCCCGCCT CTCTGCGCAG GGAACTCTAC GACAAAATAT TGAAGGCGGG GCTAAGCCGC
TCAGAGGCTC TCTCAGTGAT CAATGAGGCT CTCCGAAGAT ACCTGTCGAG CCTAGTCTCC
CCCGGCGAGG CAGTAGGCAT GGTGGCGGCA CAATCTATAG GCGAGCCCGC CACCCAAATG
ACCTTGAGGA CGTTCCACTT CGCCGGCGTA AGGGAGCTCA ACGTGACCCT GGGGCTTCCG
AGGCTTATCG AAATAGTGGA CTTAAGGCGC GAACCCTCAA CCCCCATAAT GGAAGTGTAC
CTGGAGCCGG AGCACGCCAA AGACCTCGAC TTCGCGTTGA AGATAGCCAG GGAGATCGAG
CTCACCACGA TGGAAAACCT CTGTAGCTCC ATAACCATAG ACTACTTCGA GTTCGCCATA
GTAATGGAGC TAGACCCAGA CATGATGGAG AACAGGGGCG TCACGATGGA TGACGTTATA
AACGCCCTGG AGAAGCTGAA GGGCAAGAAA GGGAAGATAG AAGTGAACGG TAATACCGTT
GTGCTCTACA CGGGGCTCGA GGATGTCACA AAGCTTAGAC GCATGTACGA CAGGGTCCTA
AACCTAAGAA TCAAGGGGTT AAAGGGTATC CGCCACGCCA TAGTCAAGCC GGTCAGGGAC
GAGAAGGGAG AGCTCGTCGA GTACGTGATA CTCACGGAGG GTAGCAACCT GAAGGCCGTT
CTTGGAATAG AGGGGGTAGA CCCGAGGAGA ACAACCACGA ACAACATACT CGAGATATAC
GAGGTGCTAG GCATCGAGGC GGCGCGGGCA GCCATAATAA AGGAGATAAA GAAGGTTCTC
GATGAACACG GACTCGACGT GGACTGGCGC CATATAATGA TGGTCGCCGA CGCGATGACT
TACTCCGGCA AGGTGCGGCA AGTGGGTAGG CACGGAGTAG CCGGGGAGAA GGGGAGCGTG
CTGGCGAGAG CCAGCTTCGA GGTGACGGTG AAGAACCTCG TAGAGGCAGC GCTTCGAGGA
GAGCTCGACG AGCTAAGGGG GGTCATCGAA AACGTGATTA TAGGTAGCAA GCCTATACCG
CTGGGTACTG GCTCAGTAAA GTTAAAAATG AGGTATGAGT TTGGACAGAG CGCGCAGAAA
GAGGTGCAGT AG
 
Protein sequence
MSEPISEEEL WNEIEKYHEL LPASLRRELY DKILKAGLSR SEALSVINEA LRRYLSSLVS 
PGEAVGMVAA QSIGEPATQM TLRTFHFAGV RELNVTLGLP RLIEIVDLRR EPSTPIMEVY
LEPEHAKDLD FALKIAREIE LTTMENLCSS ITIDYFEFAI VMELDPDMME NRGVTMDDVI
NALEKLKGKK GKIEVNGNTV VLYTGLEDVT KLRRMYDRVL NLRIKGLKGI RHAIVKPVRD
EKGELVEYVI LTEGSNLKAV LGIEGVDPRR TTTNNILEIY EVLGIEAARA AIIKEIKKVL
DEHGLDVDWR HIMMVADAMT YSGKVRQVGR HGVAGEKGSV LARASFEVTV KNLVEAALRG
ELDELRGVIE NVIIGSKPIP LGTGSVKLKM RYEFGQSAQK EVQ