Gene RPD_3194 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_3194 
Symbol 
ID4023699 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp3539592 
End bp3543794 
Gene Length4203 bp 
Protein Length1400 aa 
Translation table11 
GC content63% 
IMG OID637963395 
ProductDNA-directed RNA polymerase subunit beta' 
Protein accessionYP_570321 
Protein GI91977662 
COG category[K] Transcription 
COG ID[COG0086] DNA-directed RNA polymerase, beta' subunit/160 kD subunit 
TIGRFAM ID[TIGR02386] DNA-directed RNA polymerase, beta' subunit, predominant form 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0226545 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.686552 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACCAAG AAATCATGAA TCTGTTCAAT CCGACGACGC CGGCTCAGGT CTTCGACCAG 
ATCCGGATCT CGATCGCGTC GCCGGAGAAG ATTCTGTCGT GGTCCTACGG CGAGATCAAG
AAGCCGGAAA CCATCAACTA CCGCACCTTC AAGCCCGAGC GTGACGGCCT GTTCTGCGCC
CGCATCTTCG GGCCGATCAA GGACTACGAG TGCTTGTGCG GCAAGTACAA GCGGATGAAG
TACAAGGGCA TCATCTGCGA GAAGTGCTCG GTCGAAGTGA CGCTCTCCCG CGTCCGTCGC
GAGCGCATGG GCCACATCGA GCTGGCCGCG CCGGTCGCCC ACATCTGGTT CCTGAAGTCC
TTGCCGTCGC GTATCGGGCA GCTGCTCGAC ATGACGTTGA AGGACCTCGA GCGGATTCTG
TATTTCGAAT ACTACGTCGT GCTCGAGCCG GGCCTGACCG ACCTCAAGGA GCGTCAGCTC
CTGTCCGAGG AAGAGTATCT GCGCGCCCAG GACCAATACG GCCAGGACAG CTTCACCGCC
ATGATCGGCG CCGAAGCGAT CCGCGAGTTG CTGAAGGGCC TCGAGCTCGA AAAGATCGAT
GCGCAGCTGC GCGTCGAGAT GGCCGAGACC GACAGCGACA TCAAGCACAA GAAGCTCGCC
AAGCGGCTGA AGATCGTCGA GGCGTTCCGC TACTCCGGCA ACAAGCCCGA GTGGATGATC
CTGACCGTGG TGCCGGTGAT TCCGCCGGAC CTGCGGCCGC TGGTGCCGCT CGACGGCGGC
CGGTTCGCGA CCTCCGATCT CAACGACCTG TACCGCCGCG TCATCAACCG TAACAACCGC
TTGAAGCGGC TGATGGAGCT GCGCGCGCCG GACATCATCA TCCGCAACGA AAAGCGCATG
CTGCAGGAGG CCGTCGACGC GCTGTTCGAC AACGGCCGCC GCGGCCGCGT CATCACCGGC
GCCAACAAGC GTCCGCTGAA GTCGCTGGCC GACATGCTGA AGGGCAAGCA GGGCCGGTTC
CGTCAGAACC TGCTCGGCAA GCGCGTCGAC TATTCCGGCC GTTCGGTGAT CGTGGTCGGT
CCGGAGCTCA AGCTGCACCA GTGCGGCCTG CCGAAGAAGA TGGCGCTCGA ACTGTTCAAG
CCGTTCATCT ATTCGCGGCT CGACGCCAAG GGTCTGTCGA CCACCGTGAA GCAGGCGAAG
AAGCTGGTCG AGAAGGAGCG TCCCGAGGTC TGGGACATCC TCGACGAGGT GATCCGCGAG
CATCCGGTGC TGCTCAACCG CGCGCCGACG CTGCATCGCC TGGGCATTCA GGCGTTCGAG
CCGGTGCTGA TCGAGGGCAA GGCGATCCAG CTTCACCCGC TGGTCTGCGC CGCGTTCAAC
GCCGACTTCG ACGGCGACCA GATGGCCGTG CACGTCCCGC TGTCGCTCGA AGCGCAGCTG
GAAGCGCGCG TGCTGATGAT GTCTACCAAC AACATCCTGC ATCCCGCGAA CGGCCAGCCG
ATCATCGTGC CGTCGCAGGA CATCGTGCTC GGCCTGTACT ATCTGTCGAT CCTGCGGGAA
GGGCTGCCGG GCGAGGGCAA GGTGTTCGGC GATCTCGCGG AGCTCGAGCA CGCGCTGCAC
GCCAAGGTCA TCCACCTCCA CACCAAGATC AAGTATCGCT GGGATTCCCT CGACGACGAG
GGCAAGCCGT ACAAGCGTCT GATCGAGACC ACCGCGGGCC GCATCCTGCT CGGTCAGGTT
CTGCCGAAGT CGGTGAAGCT GCCTTACGAG ACCATCAACA AGCTGATGAC GAAGCGCGAG
ATCTCGAGCG TCATCGATCA GGTCTATCGT CACTGCGGCC AGAAGGAGAC GGTGATCTTC
TGCGACCGCA TCATGGCGCT CGGCTTCTTC AACGCGTTCA AGGCGGGCAT TTCGTTCGGC
AAGGACGACA TGGTCGTGCC GGCCTCGAAG TGGAAGATCG TCGACACCAC CCGTACGCTC
GCGAAGGATT TCGAGCAGCA GTACAACGAC GGTCTGATCA CCCACGGCGA GAAGTACAAC
AAGGTGGTCG ACGCCTGGTC GAAGGCCACC GAGGAAATCG CCAAGGAGAT GATGAAGGAA
ATCTCCGCGG TCCGGAAGAA TGCTTCGGGC GCGGAGAGCC AGGTCAACTC GATCTACATG
ATGGCCCACT CCGGTGCGCG TGGTTCGCCG GCGCAGATGC GTCAGCTCGC CGGTATGCGC
GGCCTGATGG CCAAGCCGTC GGGTGAGATC ATCGAGACGC CGATCATCTC GAACTTCAAG
GAAGGCCTCT CGGTTCTCGA ATACTTCAAC TCGACCCACG GCGCCCGCAA GGGTCTCGCC
GACACCGCGT TGAAGACCGC GAACTCGGGT TACCTGACCC GCCGTCTGGT CGACGTGGCG
CAGGACTGCA TCATCACCCA GGATGATTGC GGCACCTCGC TCGGCATCAA GATGCGGGCG
ATCATCGACG CCGGCACCGT GGTGGCGTCG CTCGGCTCCC GCATCCTCGG CCGCACCGCG
GGCGAAGACG TGCGCGATCC GCAGACCAAC GAGGTGATCG TCAAGAAGGG CGAGCTGATG
GAGGAGCGCG ATATCGAAGC GATCCACCAG GCCGGCGTCC AGGAAGTGAA GATCCGCTCG
GCGCTGACCT GCGAGCTGGT CAACGGCATC TGCGGAAAGT GCTACGGGCG CGATCTCGCC
CGCGGCACGC CGGTCAACCA CGGCGAGGCG GTCGGCGTCA TCGCGGCGCA GTCGATCGGC
GAGCCGGGCA CGCAGCTCAC GATGCGTACC TTCCACATCG GCGGTGCGGC GCAGATCAAC
GAGCAATCGG TGATCGAGTC GAACTTCGAG GGTAAGGTCG TCATCAAGAA CAAGGCGATC
GCCCGGAACG GCGAGAACCA CAACGTGGCG ATGGTCCGCA ACATGGTCGT TGCGATCGTC
GACCCGGATG GCACCGAGCG TGCGACCCAT CGCATCCAGT ACGGCGCGCG GATGCACGTC
GACGAGGGCG ATACGATCAA GCGCGGCCAT CGCATCGCCG AGTGGGACCC GTACAGCCGG
CCGGTTCTGA CCGAGGTCGA AGGTACGATC GATTTCGAGG ATCTGATCGA AGATCAGTCG
ATCTCGGAAA CGCTCGACGA ATCGACCGGT ATCGCCAAGC GTATCGTGAT CGACTGGCGC
TCGACCCGCG GCGGCGCGGA TCTGCGTCCG GCGATCGTGA TCAAGGGCAA GGACGGCAAG
GTGCTGAAGC TGGCGCGCGG CGGCGACGCC CGCTACATGC TGTCGGTCGA CGCCATCCTG
TCGGTCGACG TCGGCGCCAA GGTCAAGCCG GGCGACATTC TCGCGCGTAT CTCGACCGAA
AGCGCGAAGA CCCGCGACAT CACCGGCGGT CTGCCGCGCG TCGCGGAACT GTTCGAGGCC
CGCAAGCCGA AGGACGCCGC GATCATCGCC GAAATCGCCG GCACCATCCG GTTCGGACGC
GACTACAAGA ACAAGCGTCG GATCTCGATC GAGCCGATGG ACAAGGAAGA AGAGGCGCGC
GAGTACCTGA TCCCGAAGGG CAAGCACATC CACCTTCAGG ACGGCGACAT CGTCGAAAAG
GGCGACTTCA TCGTCGAAGG CAACCCGGCG CCGCACGACA TCCTGGCGAT CAAGGGCATC
GAGGAACTCG CTGCCTATCT GGTCAACGAA ATCCAGGAGG TCTATCGGCT CCAGGGCGTG
TTGATCAACG ACAAGCACAT CGAGGTGATC GTTCGCCAGA TGCTGCAGAA GGTCGAGATC
ACCGACCAGG GCGAGACCGA CATGATCTCG GGCGAACAGA TCGACAAGAT CGAGTTCGAC
CAGCTCAACG TCAAAGCGAG GGACGAGGGC AAGAAGATCG CCACGGGAAC GCCGGTTCTG
CTCGGCATCA CCAAAGCGAG CCTGCAGACC CGCTCGTTCT TCTCGGCGGC GTCGTTCCAG
GAGACCACCC GCGTGCTCAC CGAAGCCGCC GTCAACGGCA AGGTGGACCC GCTGGAAGGC
CTCAAGGAAA ACGTCATCGT CGGCCGGCTG ATTCCGGCTG GCACCGGCGC CTCGATGGCC
AAGATCCGCG AAGTCGCGAT GAAGCGGGAT CGCATGATCC TCGACGAGCG CGAGAAGCAG
GCGACCATCG TACCGCCGGC CGCTCCGGAA GCCGAGCCGC TGGCGCTGCC GCCGGCCGAG
TAA
 
Protein sequence
MNQEIMNLFN PTTPAQVFDQ IRISIASPEK ILSWSYGEIK KPETINYRTF KPERDGLFCA 
RIFGPIKDYE CLCGKYKRMK YKGIICEKCS VEVTLSRVRR ERMGHIELAA PVAHIWFLKS
LPSRIGQLLD MTLKDLERIL YFEYYVVLEP GLTDLKERQL LSEEEYLRAQ DQYGQDSFTA
MIGAEAIREL LKGLELEKID AQLRVEMAET DSDIKHKKLA KRLKIVEAFR YSGNKPEWMI
LTVVPVIPPD LRPLVPLDGG RFATSDLNDL YRRVINRNNR LKRLMELRAP DIIIRNEKRM
LQEAVDALFD NGRRGRVITG ANKRPLKSLA DMLKGKQGRF RQNLLGKRVD YSGRSVIVVG
PELKLHQCGL PKKMALELFK PFIYSRLDAK GLSTTVKQAK KLVEKERPEV WDILDEVIRE
HPVLLNRAPT LHRLGIQAFE PVLIEGKAIQ LHPLVCAAFN ADFDGDQMAV HVPLSLEAQL
EARVLMMSTN NILHPANGQP IIVPSQDIVL GLYYLSILRE GLPGEGKVFG DLAELEHALH
AKVIHLHTKI KYRWDSLDDE GKPYKRLIET TAGRILLGQV LPKSVKLPYE TINKLMTKRE
ISSVIDQVYR HCGQKETVIF CDRIMALGFF NAFKAGISFG KDDMVVPASK WKIVDTTRTL
AKDFEQQYND GLITHGEKYN KVVDAWSKAT EEIAKEMMKE ISAVRKNASG AESQVNSIYM
MAHSGARGSP AQMRQLAGMR GLMAKPSGEI IETPIISNFK EGLSVLEYFN STHGARKGLA
DTALKTANSG YLTRRLVDVA QDCIITQDDC GTSLGIKMRA IIDAGTVVAS LGSRILGRTA
GEDVRDPQTN EVIVKKGELM EERDIEAIHQ AGVQEVKIRS ALTCELVNGI CGKCYGRDLA
RGTPVNHGEA VGVIAAQSIG EPGTQLTMRT FHIGGAAQIN EQSVIESNFE GKVVIKNKAI
ARNGENHNVA MVRNMVVAIV DPDGTERATH RIQYGARMHV DEGDTIKRGH RIAEWDPYSR
PVLTEVEGTI DFEDLIEDQS ISETLDESTG IAKRIVIDWR STRGGADLRP AIVIKGKDGK
VLKLARGGDA RYMLSVDAIL SVDVGAKVKP GDILARISTE SAKTRDITGG LPRVAELFEA
RKPKDAAIIA EIAGTIRFGR DYKNKRRISI EPMDKEEEAR EYLIPKGKHI HLQDGDIVEK
GDFIVEGNPA PHDILAIKGI EELAAYLVNE IQEVYRLQGV LINDKHIEVI VRQMLQKVEI
TDQGETDMIS GEQIDKIEFD QLNVKARDEG KKIATGTPVL LGITKASLQT RSFFSAASFQ
ETTRVLTEAA VNGKVDPLEG LKENVIVGRL IPAGTGASMA KIREVAMKRD RMILDEREKQ
ATIVPPAAPE AEPLALPPAE