Gene Tpen_0286 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_0286 
Symbol 
ID4602096 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp251931 
End bp255323 
Gene Length3393 bp 
Protein Length1130 aa 
Translation table11 
GC content56% 
IMG OID639773042 
ProductDNA-directed RNA polymerase subunit B 
Protein accessionYP_919699 
Protein GI119719204 
COG category[K] Transcription 
COG ID[COG0085] DNA-directed RNA polymerase, beta subunit/140 kD subunit 
TIGRFAM ID[TIGR03670] DNA-directed RNA polymerase subunit B 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCTCGC GCGCCGTGCA GGCAAAGCTA TCCAGGGAGG ATAGGTGGAG AGTGGTAAAA 
GCATACGTAG ACGAGCTAGG CCTCGTCAGG CAACACTTAG ACTCCTTCAA CGCTTTTCTA
GAGCGTGGAC TCCAGGAGAT AGTGGACGAG GTCGGCGGTA TCAAGGTTGA GTCGCAGGGT
GTTGAGATAA AGTTTGGGAA AATAGAGGTT GGTCAGCCCA CGTTTAGGGA GGCTGACGGT
AGCGATCTCG CTTTGACACC TATGATAGCA AGGCTGAGAA ACATAACCTA CGCGGCACCG
CTCTACCTTA CAATGACGCT CTACGTCGAC GGGGAGGAGA GGCGGACGGA GTCCGTCTAC
ATAGGTAGTC TGCCCATAAT GGTGAAAAGC AAGAAGTGCG TTCTCTACGG TCTCAAGTCG
GAAGACGAGA TAGTTAAGTA CGGCGAGGAC CCGTACGACC CCGGAGGGTA CTTCATCGTT
AACGGCTCTG AGAGAGTAAT CGTGATGCAG GAGGATCTGT CGGTGAACAG GGTTCTCGTG
GACTACGGCG GCGCTAGCGG CTCCGTCACG CATACAGCAA AGGTTTTCAG CGTTGCCGCT
GGCCAAAGGT CGCCGCTTAC AGTCGAGAGA ACGAAGGATG GGATGATATA CGCCTCCTTC
CCGGCTTGCC CGTCGAAAAT ACCAGTAGTA GTGTTGATGC GCGCGCTGGG TCTAAAGACG
GACCAGGAGA TAGCCTACGC CATAGGCAAC GATCCGATAA TTCAGCAGGA GTTTCTGCCC
GTATTAATGG AGCAATCGAA GATCGCCGCT ACGCCCGAGG AGGCTCTCGA CTACATAGGC
TCCAGGGTGT CCCCGGGCCA GCCGAGAAAC GTGAGGATAG AGAGGGCGCA AGCCGTTCTA
GATGAAAACT TGTTGCCGCA CATAGGTAGA GGCCCCGCTG CTAGGATTTC TAAGGCTTTC
TTCGTGGGGC AGATGGTTTC CCGGTTGCTC GAACTCAAGC TCGGCATGCG CGGGCCCGAT
GATAAGGATC ACCTCGCGAA TAAGAGGATA CGGCAGGCCG GCGAGCTGAT AGCGCAGGTG
TTCAGGAGCG CGTTTAGACA GCTAGTAAAA GAGATGACTT ACTCCATCGA GAGGCACACT
TCCAAGACTC GGGATATCAA CCTGGTGAGT ATAGTCAGGC CGGACATAAT TACCGAGAGG
TTAAACCACG CACTCGCGAC TGGCAACTGG GTGGGCGGCA GGACGGGTGT GAGCCAGATT
CTGGACAGGA CGAACTACCT CTCGACAATC TCCCACTTGA GGAGGGTTGT CTCGCCGCTA
TCCAGGACTC AGCCGCACTT CGAGGCTAGG GAGCTTCACC CGACGCAGTG GGGCAGGCTT
TGCCCCGTAG AAAGCCCCGA GGGGCAGAAC TGCGGCTTGG TAAAGCACTT GGCGCTTCTG
GCTACTCTCT CGAACGGGAC GGACGAGAAG CAGGTGTACG ACCTGCTGGT AGGCAGGCTG
GGAGTAGTCC CCGTGGAGAA GACGGTGGGG AAGAATATCT CGGGGGCTAG GGTCTACCTC
AACGGGCGGC TCATAGGCTA CGTTGAGGAC GGCAAGGGTC TCGCGGAGAC TTTGAGGAAG
CTTCGAAGAG AGGGAAGAAT AAGTCACGAG GTCAACGTTG CCTTCTATTC TCATGAATAC
ACCGTCGGCG GGGTTAAGGG TAGGATAGAG GAGGTTTACG TCAACTGCGA CGCTGGGAGG
ATACGAAGAC CTCTGATAGT AGTCGAAAAC GGTGAACCGA GGCTTAAACA CGAACATGTA
GAGCTGTTGA GGAAAGGCGA GTGGACTTGG AGCGACCTCA TAGAGAACGG CATAGTCGAG
TACCTAGACG CAGAGGAAGA GGAGAACGCC TACATAGCTA CGGATGTGTC CGAGCTAACT
CCTCAGCACA CTCACCTCGA GATTGTCCCG GCGGCGATCC TAGGCATTAT CGCGATGACG
ATACCCTTTA TCGAGTACAA TCAGTCGCCG AGAAACTCGT ATCAGGCGGC GATGGCTAAG
CAGTCCCTGG GAATACCGCA CTACAACTTC AAGCTCCGCA TGGACCCCAG GATGCACGTG
ATGTACTACC CCCAGAAACC GCTCGTGAAG ACTCGCATCT TCGACCTGCT ACCCTTAGAC
AACCTGCCCT ACGGCACAAA CATGGTGGTA GCGGTTCTGA CAGGCGGAGG ATACAACATC
CAGGATGCGG TGGTCATCAA CAAGGCTGCG ATAGAAAGAG GCATGTCGAG GTCCGTCTTC
TTTAGAACAT ACGAAGCCGA GGAGAGGAGG TATCCCGGTG GGCTCGAGGA TAGGTTCGAA
AAGCCCTCCC TAGAAAAGGA CCTTCTAGAC GTTAAGCCTC CTCAGGCTTA CGAGGCTATA
GACCCCGTGG ACGGCATAGC CTACGTGGAG GCAGAGCTCT ACGGCGGTCA AGCTGTGGTG
AGTAGGACGA GCCCGCCGCG CTTCTACACG AGCACCCTGG AACCTAGGGT TATGACCAAG
AGAAAGGACA CCTCCCTACT CCTGCGACAC GGTGAAAAAG GGATAATTGA CCGCGTCTTC
ATAATGGAGA GCCCCGGAGG CATAAAGCTC GCCAAGGTAA GAGTGAGAGA TCTGCGCCCC
ACGGAGCTCG GGGATAAGTT CGCCTCGCGC CATGGGCAGA AAGGCGTCGT AGGGATGCTT
GTACCACAGG AAGATATGCC GTTCACGGAA GAGGGGATAA CCCCTGACCT AATAATCAAC
CCGCACGCTA TTCCTTCGAG GATGACCGTC GGACAGCTAC TCGAGGCGAT AACGGGGAAG
GCTGCCGCGC TCGCCGGTAG GAGGATCGAT GCTACGGCTT TTGAACCGCC GTCGTTAGAT
GAGATAAGAG AGATACTCAG GAGCTATGGC TTCAGGAGCG ACGGGAAAGA GGTTCTCTAC
GACGGGGTTA CCGGGGAGAA GTTGGAGGCC GAGATATTTA TCGGTGTCGT GTACTACGAG
AAGCTACACC ACCTCGTTGC CGACAAGATG CACGCGAGAG CCAGGGGTAG GGTACAGATA
CTAACGCGGC AACCCACGGA GGGTAGAGCG CGGGAAGGAG GTCTGAGGTT CGGCGAGATG
GAGAAGGACT GCCTAGTCGG GCACGGAGCC TCCATGCTCC TTAGAGAGCG TCTCCTCGAA
AGCTCAGACA AGACTACGAT ATGGGTTTGC GAGAACTGCG GCTATATGGG GTGGTTCGAC
GCGAGGAAGA ATACCCCCGT ATGCCCTGTC TGCGGCGATA AGGGAAGGCT TAGCCCCGTC
GAGGTATCCT ATGCGTTTAA GCTACTTTTG CAGGAGCTTA CGGGGCTAGG GCTCTCTGTG
CGACTGATCC TTAAAGACAA AATCCAGTCG TGA
 
Protein sequence
MSSRAVQAKL SREDRWRVVK AYVDELGLVR QHLDSFNAFL ERGLQEIVDE VGGIKVESQG 
VEIKFGKIEV GQPTFREADG SDLALTPMIA RLRNITYAAP LYLTMTLYVD GEERRTESVY
IGSLPIMVKS KKCVLYGLKS EDEIVKYGED PYDPGGYFIV NGSERVIVMQ EDLSVNRVLV
DYGGASGSVT HTAKVFSVAA GQRSPLTVER TKDGMIYASF PACPSKIPVV VLMRALGLKT
DQEIAYAIGN DPIIQQEFLP VLMEQSKIAA TPEEALDYIG SRVSPGQPRN VRIERAQAVL
DENLLPHIGR GPAARISKAF FVGQMVSRLL ELKLGMRGPD DKDHLANKRI RQAGELIAQV
FRSAFRQLVK EMTYSIERHT SKTRDINLVS IVRPDIITER LNHALATGNW VGGRTGVSQI
LDRTNYLSTI SHLRRVVSPL SRTQPHFEAR ELHPTQWGRL CPVESPEGQN CGLVKHLALL
ATLSNGTDEK QVYDLLVGRL GVVPVEKTVG KNISGARVYL NGRLIGYVED GKGLAETLRK
LRREGRISHE VNVAFYSHEY TVGGVKGRIE EVYVNCDAGR IRRPLIVVEN GEPRLKHEHV
ELLRKGEWTW SDLIENGIVE YLDAEEEENA YIATDVSELT PQHTHLEIVP AAILGIIAMT
IPFIEYNQSP RNSYQAAMAK QSLGIPHYNF KLRMDPRMHV MYYPQKPLVK TRIFDLLPLD
NLPYGTNMVV AVLTGGGYNI QDAVVINKAA IERGMSRSVF FRTYEAEERR YPGGLEDRFE
KPSLEKDLLD VKPPQAYEAI DPVDGIAYVE AELYGGQAVV SRTSPPRFYT STLEPRVMTK
RKDTSLLLRH GEKGIIDRVF IMESPGGIKL AKVRVRDLRP TELGDKFASR HGQKGVVGML
VPQEDMPFTE EGITPDLIIN PHAIPSRMTV GQLLEAITGK AAALAGRRID ATAFEPPSLD
EIREILRSYG FRSDGKEVLY DGVTGEKLEA EIFIGVVYYE KLHHLVADKM HARARGRVQI
LTRQPTEGRA REGGLRFGEM EKDCLVGHGA SMLLRERLLE SSDKTTIWVC ENCGYMGWFD
ARKNTPVCPV CGDKGRLSPV EVSYAFKLLL QELTGLGLSV RLILKDKIQS