Gene RPB_2286 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_2286 
Symbol 
ID3909667 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp2638678 
End bp2642880 
Gene Length4203 bp 
Protein Length1400 aa 
Translation table11 
GC content64% 
IMG OID637884183 
ProductDNA-directed RNA polymerase subunit beta' 
Protein accessionYP_485902 
Protein GI86749406 
COG category[K] Transcription 
COG ID[COG0086] DNA-directed RNA polymerase, beta' subunit/160 kD subunit 
TIGRFAM ID[TIGR02386] DNA-directed RNA polymerase, beta' subunit, predominant form 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.604145 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACCAAG AAATCATGAA TCTGTTCAAT CCGACGACGC CGGCTCAGGT CTTCGACCAG 
ATCCGGATCT CGATCGCGTC GCCGGAGAAG ATTCTGTCGT GGTCCTACGG CGAGATCAAG
AAGCCGGAAA CCATCAATTA CCGCACCTTC AAGCCCGAGC GCGACGGCCT GTTCTGCGCC
CGCATCTTCG GGCCGATCAA GGATTACGAG TGCTTGTGCG GCAAGTACAA GCGGATGAAG
TACAAGGGCA TCATCTGCGA GAAGTGCTCG GTCGAAGTGA CGCTGTCGCG CGTCCGACGC
GAGCGCATGG GCCACATCGA GCTGGCGGCA CCGGTCGCCC ACATCTGGTT CCTGAAGTCC
TTGCCGTCGC GGATCGGGCA GTTGCTCGAC ATGACGCTGA AGGACCTCGA GCGGATTCTG
TACTTCGAAT ACTACGTCGT GCTGGAGCCG GGCCTGACCG ACCTCAAGGA GCGTCAGCTG
CTGTCGGAGG AGGAGTACCT CCGCGCCCAG GATCAGTACG GCCAGGACAG CTTCACCGCC
ATGATCGGCG CCGAAGCGAT CCGCGAATTG CTGAAGGGAC TCGAGCTCGA GAAGATCGAT
GCGCAGCTGC GCGTCGAGAT GGCCGAGACC GACTCCGACA TCAAGCACAA GAAGCTCGCC
AAGCGGCTGA AGATCGTCGA GGCGTTCCGC TTCTCCGGCA ACAAGCCGGA GTGGATGATC
CTGACCGTGG TGCCGGTGAT TCCGCCGGAC CTGCGGCCGC TGGTGCCGCT CGACGGCGGC
CGGTTCGCGA CCTCGGATCT GAACGACCTG TATCGCCGCG TCATCAACCG TAACAACCGG
TTGAAGCGGC TGATGGAGCT GCGCGCGCCG GACATCATCA TCCGCAACGA AAAGCGGATG
CTGCAGGAAG CCGTCGACGC GTTGTTCGAC AACGGCCGCC GCGGCCGCGT CATCACCGGC
GCCAACAAGC GTCCGTTGAA GTCGCTCGCC GACATGCTGA AGGGCAAGCA GGGTCGGTTC
CGTCAGAACC TGCTCGGCAA GCGCGTCGAC TATTCGGGCC GTTCGGTCAT CGTCGTCGGC
CCCGAGCTGA AGCTGCACCA GTGCGGCCTG CCGAAGAAGA TGGCGCTCGA ACTGTTCAAG
CCGTTCATCT ATTCGCGGCT CGACGCCAAG GGTCTGTCGA CCACGGTGAA GCAGGCGAAG
AAGCTCGTCG AAAAGGAGCG TCCCGAGGTC TGGGACATCC TCGACGAAGT CATTCGCGAG
CATCCGGTGC TGCTGAACCG CGCCCCGACG CTGCATCGCC TCGGCATCCA GGCGTTCGAG
CCGGTGCTGA TCGAGGGCAA GGCGATCCAG CTTCACCCGC TGGTCTGCGC CGCGTTCAAC
GCCGATTTCG ACGGCGACCA GATGGCCGTG CACGTCCCGC TGTCGCTCGA AGCGCAGCTC
GAAGCGCGCG TGCTGATGAT GTCGACCAAC AACATCCTGC ATCCGGCGAA CGGCCAGCCG
ATCATCGTGC CGTCGCAGGA CATCGTGCTC GGCCTGTACT ACCTGTCGAT CCTGCGGGAA
GGGCTGCCGG GCGAGGGCAA GGTGTTCGGC GACCTCGCCG AGCTGGAGCA CGCGCTGTTC
TCCAAGGTCA TCCACCTGCA CACCAAGATC AAATATCGCT GGGACTCGCT CGACGACGAG
GGCAAGCCGT ACCAGCGGCT GATCGAGACC ACCGCCGGCC GCATCCTGCT CGGCCAGGTT
CTGCCGAAAT CGGTGAAGCT GCCGTTCGAG GTCATCAACA AGCTGATGAC CAAGCGCGAG
ATCTCCAGCG TGATCGATCA GGTCTATCGC CACTGCGGCC AGAAGGAGAC GGTGATCTTC
TGCGACCGCA TCATGGCGCT GGGCTTCTTC AACGCGTTCA AGGCGGGCAT CTCGTTCGGC
AAGGACGACA TGGTCGTGCC GGCCTCGAAG TGGAAGATCG TCGACACCAC GCGTACGCTG
GCGAAGGATT TCGAGCAGCA GTACAACGAC GGTCTGATCA CCCACGGCGA GAAGTACAAC
AAGGTGGTCG ACGCCTGGTC GAAGGCCACC GAGGAAATCG CCAAGGAGAT GATGAAGGAG
ATCTCCGCGG TCCGGAAGAA CGCATCCGGC GCGGAGACCC AGGTCAACTC GATCTACATG
ATGGCCCATT CCGGCGCGCG TGGTTCGCCC GCCCAGATGC GTCAGCTCGC CGGCATGCGC
GGCCTGATGG CCAAGCCGTC GGGTGAGATC ATCGAGACGC CGATCATCTC GAACTTCAAG
GAAGGCCTCT CGGTTCTCGA GTACTTCAAC TCGACCCACG GCGCCCGTAA GGGCCTCGCG
GACACCGCGT TGAAGACCGC GAACTCCGGC TACCTGACCC GCCGTCTGGT CGACGTGGCG
CAGGACTGCA TCATCACCCA GGATGATTGC GGCACCTCGC TCGGCATCAA GATGCGGGCG
ATCATCGACG CCGGCACCGT GGTGGCGTCG CTCGGCTCGC GCATCCTCGG CCGCACCGCG
GGCGAAGACG TGCGCGATCC GCAGACCAAC GAGGTGATCG TCAAGAAGGG CCAGCTGATG
GAGGAGCGCG ACGTCGAGGC GATCCACCAG GCCGGCGTCC AGGAAGTGAA GATCCGCTCG
GCGCTGACCT GCGAACTGGT CAACGGCATC TGCGGCAAGT GCTACGGGCG CGATCTCGCC
CGCGGCACCC CGGTCAACCA CGGCGAGGCG GTCGGTGTCA TCGCCGCGCA GTCGATCGGC
GAACCCGGCA CGCAGCTGAC GATGCGTACC TTCCACATCG GCGGCGCGGC GCAGATCAAC
GAGCAGTCGG TGATCGAGTC GAACTTCGAG GGTAAGGTCG TCATCAAGAA CAAGGCGATC
GCCCGCAACG GCGAGAACCA CAGCGTCGCG ATGGTTCGCA ACATGGTGGT TGCGATCGTC
GATCCGGACG GCACCGAACG GGCGACCCAC CGCATCCAGT ACGGCGCGCG CATGCACGTC
GACGAGGGCG ACACGGTGAA GCGCGGCCAG CGCATCGCCG AGTGGGATCC GTACAGCCGC
CCGGTGCTGA CCGAGGTCGA GGGCACGATC GATTTCGAGG ATCTGGTCGA AGACCAGTCG
ATCTCGGAAA CGCTCGACGA ATCCACCGGC ATCGCCAAGC GTATCGTCAT CGACTGGCGC
TCGACCCGCG GCGGCGCCGA TCTGCGCCCG GCGATCGTGG TCAAGGGCAA GGACGGCAAG
GTGCTGAAGC TGGCGCGCGG CGGTGACGCC CGCTACATGC TGTCGGTCGA CGCCATTCTG
TCGGTCGACG TCGGCGCCAA GGTGAAGCCG GGCGACATCC TCGCGCGTAT CTCGACCGAG
AGCGCGAAGA CCCGCGACAT CACCGGCGGT CTGCCGCGCG TCGCCGAACT GTTCGAGGCC
CGCAAGCCGA AGGACGCGGC GATCATCGCG GAAATCGCCG GCACCATCCG GTTCGGCCGC
GACTACAAGA ACAAGCGTCG GATCTCGATC GAACCGATGG ACAGCGAAGA AGAGGCGCGC
GAGTACCTGA TCCCGAAGGG CAAGCACATC CACCTTCAGG ACGGCGACAT CGTGGAAAAG
GGCGACTTCA TCGTCGAAGG CAACCCGGCG CCGCACGACA TCCTGGCGAT CAAGGGCATC
GAGGAACTCG CTGCCTATCT GGTCAACGAA ATCCAGGAGG TCTACCGGCT CCAGGGCGTG
TTGATCAACG ACAAGCACAT CGAGGTGATC GTTCGCCAGA TGCTGCAGAA GGTCGAGATC
ACCGACCAGG GCGAGACCGA CATGATCTCG GGCGAACAGA TCGACAAGAT CGAGTTCGAC
CAGCTCAACG CCAAGGCGCG CGACGAGGGC AAGAAGATCG CCACGGGAAC GCCGGTTCTG
CTCGGCATCA CCAAGGCGAG CCTGCAGACC CGCTCGTTCT TCTCGGCGGC GTCGTTCCAG
GAGACCACCC GCGTGCTCAC CGAAGCCGCC GTCAACGGCA AGGTGGACCC GCTGGAAGGC
CTCAAGGAAA ACGTCATCGT CGGCCGGCTG ATCCCGGCCG GCACCGGCGC CTCGATGGCC
AAGCTCCGCG AAGTCGCGAT GAAGCGGGAT CGCATGATCC TCGACGAACG CGAGAAGCAG
GCGACCATCG TGCCGCCCGC GGCCCCGGAA GCCGAGCCGC TGGCGCTGCC GCCGGTCGAA
TAA
 
Protein sequence
MNQEIMNLFN PTTPAQVFDQ IRISIASPEK ILSWSYGEIK KPETINYRTF KPERDGLFCA 
RIFGPIKDYE CLCGKYKRMK YKGIICEKCS VEVTLSRVRR ERMGHIELAA PVAHIWFLKS
LPSRIGQLLD MTLKDLERIL YFEYYVVLEP GLTDLKERQL LSEEEYLRAQ DQYGQDSFTA
MIGAEAIREL LKGLELEKID AQLRVEMAET DSDIKHKKLA KRLKIVEAFR FSGNKPEWMI
LTVVPVIPPD LRPLVPLDGG RFATSDLNDL YRRVINRNNR LKRLMELRAP DIIIRNEKRM
LQEAVDALFD NGRRGRVITG ANKRPLKSLA DMLKGKQGRF RQNLLGKRVD YSGRSVIVVG
PELKLHQCGL PKKMALELFK PFIYSRLDAK GLSTTVKQAK KLVEKERPEV WDILDEVIRE
HPVLLNRAPT LHRLGIQAFE PVLIEGKAIQ LHPLVCAAFN ADFDGDQMAV HVPLSLEAQL
EARVLMMSTN NILHPANGQP IIVPSQDIVL GLYYLSILRE GLPGEGKVFG DLAELEHALF
SKVIHLHTKI KYRWDSLDDE GKPYQRLIET TAGRILLGQV LPKSVKLPFE VINKLMTKRE
ISSVIDQVYR HCGQKETVIF CDRIMALGFF NAFKAGISFG KDDMVVPASK WKIVDTTRTL
AKDFEQQYND GLITHGEKYN KVVDAWSKAT EEIAKEMMKE ISAVRKNASG AETQVNSIYM
MAHSGARGSP AQMRQLAGMR GLMAKPSGEI IETPIISNFK EGLSVLEYFN STHGARKGLA
DTALKTANSG YLTRRLVDVA QDCIITQDDC GTSLGIKMRA IIDAGTVVAS LGSRILGRTA
GEDVRDPQTN EVIVKKGQLM EERDVEAIHQ AGVQEVKIRS ALTCELVNGI CGKCYGRDLA
RGTPVNHGEA VGVIAAQSIG EPGTQLTMRT FHIGGAAQIN EQSVIESNFE GKVVIKNKAI
ARNGENHSVA MVRNMVVAIV DPDGTERATH RIQYGARMHV DEGDTVKRGQ RIAEWDPYSR
PVLTEVEGTI DFEDLVEDQS ISETLDESTG IAKRIVIDWR STRGGADLRP AIVVKGKDGK
VLKLARGGDA RYMLSVDAIL SVDVGAKVKP GDILARISTE SAKTRDITGG LPRVAELFEA
RKPKDAAIIA EIAGTIRFGR DYKNKRRISI EPMDSEEEAR EYLIPKGKHI HLQDGDIVEK
GDFIVEGNPA PHDILAIKGI EELAAYLVNE IQEVYRLQGV LINDKHIEVI VRQMLQKVEI
TDQGETDMIS GEQIDKIEFD QLNAKARDEG KKIATGTPVL LGITKASLQT RSFFSAASFQ
ETTRVLTEAA VNGKVDPLEG LKENVIVGRL IPAGTGASMA KLREVAMKRD RMILDEREKQ
ATIVPPAAPE AEPLALPPVE