Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_0286 |
Symbol | |
ID | 4602096 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | - |
Start bp | 251931 |
End bp | 255323 |
Gene Length | 3393 bp |
Protein Length | 1130 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 639773042 |
Product | DNA-directed RNA polymerase subunit B |
Protein accession | YP_919699 |
Protein GI | 119719204 |
COG category | [K] Transcription |
COG ID | [COG0085] DNA-directed RNA polymerase, beta subunit/140 kD subunit |
TIGRFAM ID | [TIGR03670] DNA-directed RNA polymerase subunit B |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCTCGC GCGCCGTGCA GGCAAAGCTA TCCAGGGAGG ATAGGTGGAG AGTGGTAAAA GCATACGTAG ACGAGCTAGG CCTCGTCAGG CAACACTTAG ACTCCTTCAA CGCTTTTCTA GAGCGTGGAC TCCAGGAGAT AGTGGACGAG GTCGGCGGTA TCAAGGTTGA GTCGCAGGGT GTTGAGATAA AGTTTGGGAA AATAGAGGTT GGTCAGCCCA CGTTTAGGGA GGCTGACGGT AGCGATCTCG CTTTGACACC TATGATAGCA AGGCTGAGAA ACATAACCTA CGCGGCACCG CTCTACCTTA CAATGACGCT CTACGTCGAC GGGGAGGAGA GGCGGACGGA GTCCGTCTAC ATAGGTAGTC TGCCCATAAT GGTGAAAAGC AAGAAGTGCG TTCTCTACGG TCTCAAGTCG GAAGACGAGA TAGTTAAGTA CGGCGAGGAC CCGTACGACC CCGGAGGGTA CTTCATCGTT AACGGCTCTG AGAGAGTAAT CGTGATGCAG GAGGATCTGT CGGTGAACAG GGTTCTCGTG GACTACGGCG GCGCTAGCGG CTCCGTCACG CATACAGCAA AGGTTTTCAG CGTTGCCGCT GGCCAAAGGT CGCCGCTTAC AGTCGAGAGA ACGAAGGATG GGATGATATA CGCCTCCTTC CCGGCTTGCC CGTCGAAAAT ACCAGTAGTA GTGTTGATGC GCGCGCTGGG TCTAAAGACG GACCAGGAGA TAGCCTACGC CATAGGCAAC GATCCGATAA TTCAGCAGGA GTTTCTGCCC GTATTAATGG AGCAATCGAA GATCGCCGCT ACGCCCGAGG AGGCTCTCGA CTACATAGGC TCCAGGGTGT CCCCGGGCCA GCCGAGAAAC GTGAGGATAG AGAGGGCGCA AGCCGTTCTA GATGAAAACT TGTTGCCGCA CATAGGTAGA GGCCCCGCTG CTAGGATTTC TAAGGCTTTC TTCGTGGGGC AGATGGTTTC CCGGTTGCTC GAACTCAAGC TCGGCATGCG CGGGCCCGAT GATAAGGATC ACCTCGCGAA TAAGAGGATA CGGCAGGCCG GCGAGCTGAT AGCGCAGGTG TTCAGGAGCG CGTTTAGACA GCTAGTAAAA GAGATGACTT ACTCCATCGA GAGGCACACT TCCAAGACTC GGGATATCAA CCTGGTGAGT ATAGTCAGGC CGGACATAAT TACCGAGAGG TTAAACCACG CACTCGCGAC TGGCAACTGG GTGGGCGGCA GGACGGGTGT GAGCCAGATT CTGGACAGGA CGAACTACCT CTCGACAATC TCCCACTTGA GGAGGGTTGT CTCGCCGCTA TCCAGGACTC AGCCGCACTT CGAGGCTAGG GAGCTTCACC CGACGCAGTG GGGCAGGCTT TGCCCCGTAG AAAGCCCCGA GGGGCAGAAC TGCGGCTTGG TAAAGCACTT GGCGCTTCTG GCTACTCTCT CGAACGGGAC GGACGAGAAG CAGGTGTACG ACCTGCTGGT AGGCAGGCTG GGAGTAGTCC CCGTGGAGAA GACGGTGGGG AAGAATATCT CGGGGGCTAG GGTCTACCTC AACGGGCGGC TCATAGGCTA CGTTGAGGAC GGCAAGGGTC TCGCGGAGAC TTTGAGGAAG CTTCGAAGAG AGGGAAGAAT AAGTCACGAG GTCAACGTTG CCTTCTATTC TCATGAATAC ACCGTCGGCG GGGTTAAGGG TAGGATAGAG GAGGTTTACG TCAACTGCGA CGCTGGGAGG ATACGAAGAC CTCTGATAGT AGTCGAAAAC GGTGAACCGA GGCTTAAACA CGAACATGTA GAGCTGTTGA GGAAAGGCGA GTGGACTTGG AGCGACCTCA TAGAGAACGG CATAGTCGAG TACCTAGACG CAGAGGAAGA GGAGAACGCC TACATAGCTA CGGATGTGTC CGAGCTAACT CCTCAGCACA CTCACCTCGA GATTGTCCCG GCGGCGATCC TAGGCATTAT CGCGATGACG ATACCCTTTA TCGAGTACAA TCAGTCGCCG AGAAACTCGT ATCAGGCGGC GATGGCTAAG CAGTCCCTGG GAATACCGCA CTACAACTTC AAGCTCCGCA TGGACCCCAG GATGCACGTG ATGTACTACC CCCAGAAACC GCTCGTGAAG ACTCGCATCT TCGACCTGCT ACCCTTAGAC AACCTGCCCT ACGGCACAAA CATGGTGGTA GCGGTTCTGA CAGGCGGAGG ATACAACATC CAGGATGCGG TGGTCATCAA CAAGGCTGCG ATAGAAAGAG GCATGTCGAG GTCCGTCTTC TTTAGAACAT ACGAAGCCGA GGAGAGGAGG TATCCCGGTG GGCTCGAGGA TAGGTTCGAA AAGCCCTCCC TAGAAAAGGA CCTTCTAGAC GTTAAGCCTC CTCAGGCTTA CGAGGCTATA GACCCCGTGG ACGGCATAGC CTACGTGGAG GCAGAGCTCT ACGGCGGTCA AGCTGTGGTG AGTAGGACGA GCCCGCCGCG CTTCTACACG AGCACCCTGG AACCTAGGGT TATGACCAAG AGAAAGGACA CCTCCCTACT CCTGCGACAC GGTGAAAAAG GGATAATTGA CCGCGTCTTC ATAATGGAGA GCCCCGGAGG CATAAAGCTC GCCAAGGTAA GAGTGAGAGA TCTGCGCCCC ACGGAGCTCG GGGATAAGTT CGCCTCGCGC CATGGGCAGA AAGGCGTCGT AGGGATGCTT GTACCACAGG AAGATATGCC GTTCACGGAA GAGGGGATAA CCCCTGACCT AATAATCAAC CCGCACGCTA TTCCTTCGAG GATGACCGTC GGACAGCTAC TCGAGGCGAT AACGGGGAAG GCTGCCGCGC TCGCCGGTAG GAGGATCGAT GCTACGGCTT TTGAACCGCC GTCGTTAGAT GAGATAAGAG AGATACTCAG GAGCTATGGC TTCAGGAGCG ACGGGAAAGA GGTTCTCTAC GACGGGGTTA CCGGGGAGAA GTTGGAGGCC GAGATATTTA TCGGTGTCGT GTACTACGAG AAGCTACACC ACCTCGTTGC CGACAAGATG CACGCGAGAG CCAGGGGTAG GGTACAGATA CTAACGCGGC AACCCACGGA GGGTAGAGCG CGGGAAGGAG GTCTGAGGTT CGGCGAGATG GAGAAGGACT GCCTAGTCGG GCACGGAGCC TCCATGCTCC TTAGAGAGCG TCTCCTCGAA AGCTCAGACA AGACTACGAT ATGGGTTTGC GAGAACTGCG GCTATATGGG GTGGTTCGAC GCGAGGAAGA ATACCCCCGT ATGCCCTGTC TGCGGCGATA AGGGAAGGCT TAGCCCCGTC GAGGTATCCT ATGCGTTTAA GCTACTTTTG CAGGAGCTTA CGGGGCTAGG GCTCTCTGTG CGACTGATCC TTAAAGACAA AATCCAGTCG TGA
|
Protein sequence | MSSRAVQAKL SREDRWRVVK AYVDELGLVR QHLDSFNAFL ERGLQEIVDE VGGIKVESQG VEIKFGKIEV GQPTFREADG SDLALTPMIA RLRNITYAAP LYLTMTLYVD GEERRTESVY IGSLPIMVKS KKCVLYGLKS EDEIVKYGED PYDPGGYFIV NGSERVIVMQ EDLSVNRVLV DYGGASGSVT HTAKVFSVAA GQRSPLTVER TKDGMIYASF PACPSKIPVV VLMRALGLKT DQEIAYAIGN DPIIQQEFLP VLMEQSKIAA TPEEALDYIG SRVSPGQPRN VRIERAQAVL DENLLPHIGR GPAARISKAF FVGQMVSRLL ELKLGMRGPD DKDHLANKRI RQAGELIAQV FRSAFRQLVK EMTYSIERHT SKTRDINLVS IVRPDIITER LNHALATGNW VGGRTGVSQI LDRTNYLSTI SHLRRVVSPL SRTQPHFEAR ELHPTQWGRL CPVESPEGQN CGLVKHLALL ATLSNGTDEK QVYDLLVGRL GVVPVEKTVG KNISGARVYL NGRLIGYVED GKGLAETLRK LRREGRISHE VNVAFYSHEY TVGGVKGRIE EVYVNCDAGR IRRPLIVVEN GEPRLKHEHV ELLRKGEWTW SDLIENGIVE YLDAEEEENA YIATDVSELT PQHTHLEIVP AAILGIIAMT IPFIEYNQSP RNSYQAAMAK QSLGIPHYNF KLRMDPRMHV MYYPQKPLVK TRIFDLLPLD NLPYGTNMVV AVLTGGGYNI QDAVVINKAA IERGMSRSVF FRTYEAEERR YPGGLEDRFE KPSLEKDLLD VKPPQAYEAI DPVDGIAYVE AELYGGQAVV SRTSPPRFYT STLEPRVMTK RKDTSLLLRH GEKGIIDRVF IMESPGGIKL AKVRVRDLRP TELGDKFASR HGQKGVVGML VPQEDMPFTE EGITPDLIIN PHAIPSRMTV GQLLEAITGK AAALAGRRID ATAFEPPSLD EIREILRSYG FRSDGKEVLY DGVTGEKLEA EIFIGVVYYE KLHHLVADKM HARARGRVQI LTRQPTEGRA REGGLRFGEM EKDCLVGHGA SMLLRERLLE SSDKTTIWVC ENCGYMGWFD ARKNTPVCPV CGDKGRLSPV EVSYAFKLLL QELTGLGLSV RLILKDKIQS
|
| |