Gene Pnec_0042 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPnec_0042 
SymbolrpoB 
ID6184223 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePolynucleobacter necessarius subsp. necessarius STIR1 
KingdomBacteria 
Replicon accessionNC_010531 
Strand
Start bp44492 
End bp48592 
Gene Length4101 bp 
Protein Length1366 aa 
Translation table11 
GC content47% 
IMG OID641670784 
ProductDNA-directed RNA polymerase subunit beta 
Protein accessionYP_001796984 
Protein GI171462871 
COG category[K] Transcription 
COG ID[COG0085] DNA-directed RNA polymerase, beta subunit/140 kD subunit 
TIGRFAM ID[TIGR02013] DNA-directed RNA polymerase, beta subunit 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value1.24956e-16 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAACTATA GCTTCACCGA ACGCAAGCGA GTCCGCAAAA GCTTTGCTAA GCGAGTAAAT 
AACCATCAGG TTCCATACTT GATCGCAACG CAGCTGGAAT CCTACGCTAA ATTTTTACAG
GCTGATAAGC CAGCAATGTC TCGTCTTACT GAGGGACTTC AAGCTGCCTT TACATCAGCA
TTCCCAATTG TGTCTAACAA CGGCTATGCA CGTATGGAAT ACGTGTCTTA CCAGTTATCA
CAGCCACCGT TTGACGTTAA AGAATGTCAA CAACGTGGTT ACACATACCA CTCTGCCTTA
CGCGCAAAAG TTCGCTTGAT TATTTATGAT CGCGAAGCGC CCACCAAGGT TAAAGAGGTA
AAAGAGAGCG AAGTCTACAT GGGTGAAATT CCACTCATGA CAGAAAACGG TTCTTTTGTA
ATTAACGGCA CTGAGCGCGT CATCGTTTCT CAGTTGCATC GTTCCCCAGG CGTGTTCTTT
GAGCACGATA AGGGCAAGAC ACACAGCTCA GGTAAATTGC TGTTCTCAGC ACGCATCATT
CCTTACCGTG GTTCATGGCT CGACTTCGAG TTTGATCCAA AAGATATTCT CTATTTCCGC
ATTGACCGTC GTCGTAAGAT GCCTGTCACC ATTTTGCTTA AAGCAATTGG TTTAAACAAC
GAACAGATTC TTGCGAACTT CTTTAACTTT GACCATTTCT CATTGACTGC TAACGGTGGC
TCAATGGAAT TTGTGCCAGA GCGTTTACGT GGTCAGTTGG CCAGCTTTGA TGTGCTCGAC
GAGAATGGCG TTGTAGTCAT TCAAAAAGAC AAGCGTATCA ATACAAAACA CATCCGCGAA
CTCGAAGCTG CTAAGACAAA AACGATTGCT GTACCAGATG ACTATTTAAT TGGTCGTGTT
GTTGCGCGCA ATATTGTTGA TCCAGACTCT GGTGAAATCT TGGCTTATGC TAATGATGAA
ATCACTGAAG AGTTGTTGGC GACATTGCGC GATGCAGGTA TCAAGCAATT GGAAACCATC
TACACCAATG ATTTAGATTC TGGTGCGTAT ATTTCTCAGA CATTGCGTAC TGATGAAACA
GCAGACCAAA TGGCTGCTCG TATCGCCATC TACCGCATGT TGCGTCCTGG TGAGCCTCCA
ATAGAAGATG CTGTTGAAGT CTTGTTCCAG CGCCTGTTCT ACAGCGAAGA TACTTATGAT
TTGTCACGCG TTGGCCGTAT GAAGGTCAAC AGCCGTTTGA ACCGCCCAGA AATGGAAGGT
CCAATGGTTC TGTCAAGCGA AGATATTCTC GACACCATTA AGTCCCTCGT GGACTTGCGT
AACGGCAAAG GTGAAGTAGA CGACATCGAT CACTTAGGTA ATCGTCGTGT ACGTTGCGTT
GGTGAGTTGG CTGAAAACCA ATTCCGTGCT GGTTTGTCAC GTGTTGAGCG TGCGGTTAAA
GAGCGTCTCG GCCAAGCCGA AACTGAAAAC CTCATGCCGC ATGACTTGAC TAACAGCAAG
CCAATCTCTT CTGCTATTCG TGAGTTCTTC GGTTCTTCAC AGTTGTCCCA GTTTATGGAC
CAAACCAACC CACTCTCAGA GATTACGCAC AAGCGTCGTA TTTCTGCATT GGGACCTGGT
GGTTTGATGC GCGAGCGCGC AGGCTTCGAA GTGCGCGACG TGCATCCAAC CCACTACGGA
CGTGTTTGCC CAATCGAAAC TCCAGAAGGA CCAAACATTG GTTTGATCAA CTCACTCGCA
CTGTTTGCGC GTTTGAATGA GCATGGTTTC CTTGAGACTC CATACCGCAA GGTTTCCAAT
AGCAAGGTAA GCGATGAAGT TGTATACCTC TCTGCGATTG AAGAAGCCAA GTATGTGATT
GCTCAGGCGA ATGCAACGAT TGATAAGAGC GGTAAATTGG CTGACGAATT GGTTTCTGCT
CGTCAAGCTG GTGAAACCAT GATGGTTAGT CCAGAACGCA TTGATTTCAT TGACGTTGCT
CCTAGCCAAA TCGTTTCAGC TGCCGCCTCA CTCGTTCCAT TCCTAGAGCA CGATGATGCG
AACCGTGCGT TGATGGGTGC GAACATGCAG CGCCAAGCGG TTCCTTGCTT GCGTCCAGAT
AAGCCGTTGG TTGGTACTGG TTTAGAGCGT ATTGTTGCGG TTGACTCCGG CACTGTTGTA
TTAGCAGCAC GTGGCGGTAT TGTTGATTAC GTTGATGCAA ATCGTGTTGT GATTCGCGTG
AACGATGATG AGACTACAGC TGGTGAAGTT GGTGTGGATA TTTATAACCT CATTAAATAC
ACCCGCTCAA ACCAAAACAC CAACATCAAC CAACGTCCAA TCGTTAAGGT TGGCGATCGT
GTAGCCCGCG GCGACGTGGT TGCTGACGGC GCATCTACCG ATTTGGGCGA ATTGGCCTTG
GGTCAGAACA TGACTGTGGC ATTTATGCCA TGGAATGGTT ACAACTTCGA AGATTCAATC
TTGATTTCTG AGAAAGTTGT TGCTGATGAT CGCTACACCT CTATTCATAT TGAAGAGTTG
TCAGTGGTTG CGCGTGATAC CAAGCTTGGT TCAGAAGAAA TTACTCGCGA TATTTCCAAT
TTGGCAGAGT CACAACTCTC CCGTTTGGAC GAAAGCGGCA TTGTTTACAT CGGCGCTGAA
GTTGAGGCCG GCGATGTGTT GGTTGGTAAG GTGACTCCAA AGGGCGAGAC AACTCTCACT
CCTGAAGAGA AGCTCCTCCG TGCGATCTTT GGCGAAAAAG CATCTGACGT TAAAGATACC
TCTTTACGCG TTCCATCAGG AATGATCGGT ACTATTATCG ATGTTCAAGT CTTCACTCGT
GAAGGTATTG AGCGCGATGC CCGTGCACAG TCCATCATTC AAGAAGAATT ACAACGCTAT
CGTTTGGGCT TAAACGATCA GTTGCGTATT GTTGAGGGCG ATGCCTTCAT GCGTTTAGAG
AAGTTGTTGA TTGGCAAAGT TGCCAACGGC GGCCCTAAGA AATTAGCTAA AGGCACCAAG
ATCGACAAGG AATACCTTGC TGATTTGGAT AAATACCATC GGTTTGATGT TCGTCCAGCA
GATGATGAAG TTGCCTCACA AGTTGAAGCA ATTAAATCCT CTATCGAAGC GAAACGTAAA
CAGTTTGATG AAGCTTTTGA AGAGAAGCGC ACCAAACTTA CTCAGGGCGA TGATTTGCAG
CCTGGCGTAA CGAAGATGGT TAAGGTGTAC TTGGCTGTTA AACGTCGTTT GCAGCCTGGT
GACAAGATGG CCGGTCGTCA CGGTAACAAG GGCGTGGTTT CTAAAATTGC CCCTGCGGAA
GACATGCCAT TTATGGCTGA CGGACGCCCT GTTGACATCG TCTTGAACCC ATTGGGCGTT
CCTTCCCGTA TGAACGTTGG TCAGATCTTG GAAACCCACT TAGGTTGGGC GGCTCAAGGT
ATTGGTAAGC GTGTTGACGA GATGGTTCGT CAACAAGCCA AACAAGCTGA GTTACGTGAG
TTCCTCAAAC AACTTTACAA CGAAACAGGC CGTATTGAAG ACATTGACAA CTTCACTGAC
GAGCAAATCA CAGTATTGGC TGAAAATCTT CGCCAAGGCT TGCCATTTGC AACTCCAGTG
TTTGACGGTG CTACAGAAGC AGAAATCGGA CGCATGCTCG AGTTGGCTTA TCCAGAAGAA
GTAGCTACTT CTTTGAAGAT GACGCCTTCA CGTCAGCAAA TGATTTTGTG TGACGGTCGT
ACTGGTGATC AGTTTGAGCG CCCAGCAACC GTTGGCGTAA TGCACGTCTT GAAACTCCAC
CATTTGGTTG ATGACAAGAT GCATGCTCGT TCAACTGGAC CTTACTCTTT AGTAACGCAA
CAGCCATTGG GCGGTAAAGC CCAATTCGGT GGTCAGCGCT TTGGTGAGAT GGAAGTCTGG
GCCCTCGAAG CATACGGTGC TTCATATGTC TTGCAGGAAA TGCTGACAGT GAAGTCTGAT
GACGTCGCAG GCCGTACCAA GGTTTACGAA AACATCGTCA AGGGCGAGCA CACGATTGAT
GCTGGCATGC CCGAATCCTT CAACGTGCTG GTAAAAGAAA TCCGTTCGTT GGGTATTGAG
ATTGACATGG AGCGCAACTG A
 
Protein sequence
MNYSFTERKR VRKSFAKRVN NHQVPYLIAT QLESYAKFLQ ADKPAMSRLT EGLQAAFTSA 
FPIVSNNGYA RMEYVSYQLS QPPFDVKECQ QRGYTYHSAL RAKVRLIIYD REAPTKVKEV
KESEVYMGEI PLMTENGSFV INGTERVIVS QLHRSPGVFF EHDKGKTHSS GKLLFSARII
PYRGSWLDFE FDPKDILYFR IDRRRKMPVT ILLKAIGLNN EQILANFFNF DHFSLTANGG
SMEFVPERLR GQLASFDVLD ENGVVVIQKD KRINTKHIRE LEAAKTKTIA VPDDYLIGRV
VARNIVDPDS GEILAYANDE ITEELLATLR DAGIKQLETI YTNDLDSGAY ISQTLRTDET
ADQMAARIAI YRMLRPGEPP IEDAVEVLFQ RLFYSEDTYD LSRVGRMKVN SRLNRPEMEG
PMVLSSEDIL DTIKSLVDLR NGKGEVDDID HLGNRRVRCV GELAENQFRA GLSRVERAVK
ERLGQAETEN LMPHDLTNSK PISSAIREFF GSSQLSQFMD QTNPLSEITH KRRISALGPG
GLMRERAGFE VRDVHPTHYG RVCPIETPEG PNIGLINSLA LFARLNEHGF LETPYRKVSN
SKVSDEVVYL SAIEEAKYVI AQANATIDKS GKLADELVSA RQAGETMMVS PERIDFIDVA
PSQIVSAAAS LVPFLEHDDA NRALMGANMQ RQAVPCLRPD KPLVGTGLER IVAVDSGTVV
LAARGGIVDY VDANRVVIRV NDDETTAGEV GVDIYNLIKY TRSNQNTNIN QRPIVKVGDR
VARGDVVADG ASTDLGELAL GQNMTVAFMP WNGYNFEDSI LISEKVVADD RYTSIHIEEL
SVVARDTKLG SEEITRDISN LAESQLSRLD ESGIVYIGAE VEAGDVLVGK VTPKGETTLT
PEEKLLRAIF GEKASDVKDT SLRVPSGMIG TIIDVQVFTR EGIERDARAQ SIIQEELQRY
RLGLNDQLRI VEGDAFMRLE KLLIGKVANG GPKKLAKGTK IDKEYLADLD KYHRFDVRPA
DDEVASQVEA IKSSIEAKRK QFDEAFEEKR TKLTQGDDLQ PGVTKMVKVY LAVKRRLQPG
DKMAGRHGNK GVVSKIAPAE DMPFMADGRP VDIVLNPLGV PSRMNVGQIL ETHLGWAAQG
IGKRVDEMVR QQAKQAELRE FLKQLYNETG RIEDIDNFTD EQITVLAENL RQGLPFATPV
FDGATEAEIG RMLELAYPEE VATSLKMTPS RQQMILCDGR TGDQFERPAT VGVMHVLKLH
HLVDDKMHAR STGPYSLVTQ QPLGGKAQFG GQRFGEMEVW ALEAYGASYV LQEMLTVKSD
DVAGRTKVYE NIVKGEHTID AGMPESFNVL VKEIRSLGIE IDMERN