Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPD_3194 |
Symbol | |
ID | 4023699 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB5 |
Kingdom | Bacteria |
Replicon accession | NC_007958 |
Strand | - |
Start bp | 3539592 |
End bp | 3543794 |
Gene Length | 4203 bp |
Protein Length | 1400 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 637963395 |
Product | DNA-directed RNA polymerase subunit beta' |
Protein accession | YP_570321 |
Protein GI | 91977662 |
COG category | [K] Transcription |
COG ID | [COG0086] DNA-directed RNA polymerase, beta' subunit/160 kD subunit |
TIGRFAM ID | [TIGR02386] DNA-directed RNA polymerase, beta' subunit, predominant form |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.0226545 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.686552 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACCAAG AAATCATGAA TCTGTTCAAT CCGACGACGC CGGCTCAGGT CTTCGACCAG ATCCGGATCT CGATCGCGTC GCCGGAGAAG ATTCTGTCGT GGTCCTACGG CGAGATCAAG AAGCCGGAAA CCATCAACTA CCGCACCTTC AAGCCCGAGC GTGACGGCCT GTTCTGCGCC CGCATCTTCG GGCCGATCAA GGACTACGAG TGCTTGTGCG GCAAGTACAA GCGGATGAAG TACAAGGGCA TCATCTGCGA GAAGTGCTCG GTCGAAGTGA CGCTCTCCCG CGTCCGTCGC GAGCGCATGG GCCACATCGA GCTGGCCGCG CCGGTCGCCC ACATCTGGTT CCTGAAGTCC TTGCCGTCGC GTATCGGGCA GCTGCTCGAC ATGACGTTGA AGGACCTCGA GCGGATTCTG TATTTCGAAT ACTACGTCGT GCTCGAGCCG GGCCTGACCG ACCTCAAGGA GCGTCAGCTC CTGTCCGAGG AAGAGTATCT GCGCGCCCAG GACCAATACG GCCAGGACAG CTTCACCGCC ATGATCGGCG CCGAAGCGAT CCGCGAGTTG CTGAAGGGCC TCGAGCTCGA AAAGATCGAT GCGCAGCTGC GCGTCGAGAT GGCCGAGACC GACAGCGACA TCAAGCACAA GAAGCTCGCC AAGCGGCTGA AGATCGTCGA GGCGTTCCGC TACTCCGGCA ACAAGCCCGA GTGGATGATC CTGACCGTGG TGCCGGTGAT TCCGCCGGAC CTGCGGCCGC TGGTGCCGCT CGACGGCGGC CGGTTCGCGA CCTCCGATCT CAACGACCTG TACCGCCGCG TCATCAACCG TAACAACCGC TTGAAGCGGC TGATGGAGCT GCGCGCGCCG GACATCATCA TCCGCAACGA AAAGCGCATG CTGCAGGAGG CCGTCGACGC GCTGTTCGAC AACGGCCGCC GCGGCCGCGT CATCACCGGC GCCAACAAGC GTCCGCTGAA GTCGCTGGCC GACATGCTGA AGGGCAAGCA GGGCCGGTTC CGTCAGAACC TGCTCGGCAA GCGCGTCGAC TATTCCGGCC GTTCGGTGAT CGTGGTCGGT CCGGAGCTCA AGCTGCACCA GTGCGGCCTG CCGAAGAAGA TGGCGCTCGA ACTGTTCAAG CCGTTCATCT ATTCGCGGCT CGACGCCAAG GGTCTGTCGA CCACCGTGAA GCAGGCGAAG AAGCTGGTCG AGAAGGAGCG TCCCGAGGTC TGGGACATCC TCGACGAGGT GATCCGCGAG CATCCGGTGC TGCTCAACCG CGCGCCGACG CTGCATCGCC TGGGCATTCA GGCGTTCGAG CCGGTGCTGA TCGAGGGCAA GGCGATCCAG CTTCACCCGC TGGTCTGCGC CGCGTTCAAC GCCGACTTCG ACGGCGACCA GATGGCCGTG CACGTCCCGC TGTCGCTCGA AGCGCAGCTG GAAGCGCGCG TGCTGATGAT GTCTACCAAC AACATCCTGC ATCCCGCGAA CGGCCAGCCG ATCATCGTGC CGTCGCAGGA CATCGTGCTC GGCCTGTACT ATCTGTCGAT CCTGCGGGAA GGGCTGCCGG GCGAGGGCAA GGTGTTCGGC GATCTCGCGG AGCTCGAGCA CGCGCTGCAC GCCAAGGTCA TCCACCTCCA CACCAAGATC AAGTATCGCT GGGATTCCCT CGACGACGAG GGCAAGCCGT ACAAGCGTCT GATCGAGACC ACCGCGGGCC GCATCCTGCT CGGTCAGGTT CTGCCGAAGT CGGTGAAGCT GCCTTACGAG ACCATCAACA AGCTGATGAC GAAGCGCGAG ATCTCGAGCG TCATCGATCA GGTCTATCGT CACTGCGGCC AGAAGGAGAC GGTGATCTTC TGCGACCGCA TCATGGCGCT CGGCTTCTTC AACGCGTTCA AGGCGGGCAT TTCGTTCGGC AAGGACGACA TGGTCGTGCC GGCCTCGAAG TGGAAGATCG TCGACACCAC CCGTACGCTC GCGAAGGATT TCGAGCAGCA GTACAACGAC GGTCTGATCA CCCACGGCGA GAAGTACAAC AAGGTGGTCG ACGCCTGGTC GAAGGCCACC GAGGAAATCG CCAAGGAGAT GATGAAGGAA ATCTCCGCGG TCCGGAAGAA TGCTTCGGGC GCGGAGAGCC AGGTCAACTC GATCTACATG ATGGCCCACT CCGGTGCGCG TGGTTCGCCG GCGCAGATGC GTCAGCTCGC CGGTATGCGC GGCCTGATGG CCAAGCCGTC GGGTGAGATC ATCGAGACGC CGATCATCTC GAACTTCAAG GAAGGCCTCT CGGTTCTCGA ATACTTCAAC TCGACCCACG GCGCCCGCAA GGGTCTCGCC GACACCGCGT TGAAGACCGC GAACTCGGGT TACCTGACCC GCCGTCTGGT CGACGTGGCG CAGGACTGCA TCATCACCCA GGATGATTGC GGCACCTCGC TCGGCATCAA GATGCGGGCG ATCATCGACG CCGGCACCGT GGTGGCGTCG CTCGGCTCCC GCATCCTCGG CCGCACCGCG GGCGAAGACG TGCGCGATCC GCAGACCAAC GAGGTGATCG TCAAGAAGGG CGAGCTGATG GAGGAGCGCG ATATCGAAGC GATCCACCAG GCCGGCGTCC AGGAAGTGAA GATCCGCTCG GCGCTGACCT GCGAGCTGGT CAACGGCATC TGCGGAAAGT GCTACGGGCG CGATCTCGCC CGCGGCACGC CGGTCAACCA CGGCGAGGCG GTCGGCGTCA TCGCGGCGCA GTCGATCGGC GAGCCGGGCA CGCAGCTCAC GATGCGTACC TTCCACATCG GCGGTGCGGC GCAGATCAAC GAGCAATCGG TGATCGAGTC GAACTTCGAG GGTAAGGTCG TCATCAAGAA CAAGGCGATC GCCCGGAACG GCGAGAACCA CAACGTGGCG ATGGTCCGCA ACATGGTCGT TGCGATCGTC GACCCGGATG GCACCGAGCG TGCGACCCAT CGCATCCAGT ACGGCGCGCG GATGCACGTC GACGAGGGCG ATACGATCAA GCGCGGCCAT CGCATCGCCG AGTGGGACCC GTACAGCCGG CCGGTTCTGA CCGAGGTCGA AGGTACGATC GATTTCGAGG ATCTGATCGA AGATCAGTCG ATCTCGGAAA CGCTCGACGA ATCGACCGGT ATCGCCAAGC GTATCGTGAT CGACTGGCGC TCGACCCGCG GCGGCGCGGA TCTGCGTCCG GCGATCGTGA TCAAGGGCAA GGACGGCAAG GTGCTGAAGC TGGCGCGCGG CGGCGACGCC CGCTACATGC TGTCGGTCGA CGCCATCCTG TCGGTCGACG TCGGCGCCAA GGTCAAGCCG GGCGACATTC TCGCGCGTAT CTCGACCGAA AGCGCGAAGA CCCGCGACAT CACCGGCGGT CTGCCGCGCG TCGCGGAACT GTTCGAGGCC CGCAAGCCGA AGGACGCCGC GATCATCGCC GAAATCGCCG GCACCATCCG GTTCGGACGC GACTACAAGA ACAAGCGTCG GATCTCGATC GAGCCGATGG ACAAGGAAGA AGAGGCGCGC GAGTACCTGA TCCCGAAGGG CAAGCACATC CACCTTCAGG ACGGCGACAT CGTCGAAAAG GGCGACTTCA TCGTCGAAGG CAACCCGGCG CCGCACGACA TCCTGGCGAT CAAGGGCATC GAGGAACTCG CTGCCTATCT GGTCAACGAA ATCCAGGAGG TCTATCGGCT CCAGGGCGTG TTGATCAACG ACAAGCACAT CGAGGTGATC GTTCGCCAGA TGCTGCAGAA GGTCGAGATC ACCGACCAGG GCGAGACCGA CATGATCTCG GGCGAACAGA TCGACAAGAT CGAGTTCGAC CAGCTCAACG TCAAAGCGAG GGACGAGGGC AAGAAGATCG CCACGGGAAC GCCGGTTCTG CTCGGCATCA CCAAAGCGAG CCTGCAGACC CGCTCGTTCT TCTCGGCGGC GTCGTTCCAG GAGACCACCC GCGTGCTCAC CGAAGCCGCC GTCAACGGCA AGGTGGACCC GCTGGAAGGC CTCAAGGAAA ACGTCATCGT CGGCCGGCTG ATTCCGGCTG GCACCGGCGC CTCGATGGCC AAGATCCGCG AAGTCGCGAT GAAGCGGGAT CGCATGATCC TCGACGAGCG CGAGAAGCAG GCGACCATCG TACCGCCGGC CGCTCCGGAA GCCGAGCCGC TGGCGCTGCC GCCGGCCGAG TAA
|
Protein sequence | MNQEIMNLFN PTTPAQVFDQ IRISIASPEK ILSWSYGEIK KPETINYRTF KPERDGLFCA RIFGPIKDYE CLCGKYKRMK YKGIICEKCS VEVTLSRVRR ERMGHIELAA PVAHIWFLKS LPSRIGQLLD MTLKDLERIL YFEYYVVLEP GLTDLKERQL LSEEEYLRAQ DQYGQDSFTA MIGAEAIREL LKGLELEKID AQLRVEMAET DSDIKHKKLA KRLKIVEAFR YSGNKPEWMI LTVVPVIPPD LRPLVPLDGG RFATSDLNDL YRRVINRNNR LKRLMELRAP DIIIRNEKRM LQEAVDALFD NGRRGRVITG ANKRPLKSLA DMLKGKQGRF RQNLLGKRVD YSGRSVIVVG PELKLHQCGL PKKMALELFK PFIYSRLDAK GLSTTVKQAK KLVEKERPEV WDILDEVIRE HPVLLNRAPT LHRLGIQAFE PVLIEGKAIQ LHPLVCAAFN ADFDGDQMAV HVPLSLEAQL EARVLMMSTN NILHPANGQP IIVPSQDIVL GLYYLSILRE GLPGEGKVFG DLAELEHALH AKVIHLHTKI KYRWDSLDDE GKPYKRLIET TAGRILLGQV LPKSVKLPYE TINKLMTKRE ISSVIDQVYR HCGQKETVIF CDRIMALGFF NAFKAGISFG KDDMVVPASK WKIVDTTRTL AKDFEQQYND GLITHGEKYN KVVDAWSKAT EEIAKEMMKE ISAVRKNASG AESQVNSIYM MAHSGARGSP AQMRQLAGMR GLMAKPSGEI IETPIISNFK EGLSVLEYFN STHGARKGLA DTALKTANSG YLTRRLVDVA QDCIITQDDC GTSLGIKMRA IIDAGTVVAS LGSRILGRTA GEDVRDPQTN EVIVKKGELM EERDIEAIHQ AGVQEVKIRS ALTCELVNGI CGKCYGRDLA RGTPVNHGEA VGVIAAQSIG EPGTQLTMRT FHIGGAAQIN EQSVIESNFE GKVVIKNKAI ARNGENHNVA MVRNMVVAIV DPDGTERATH RIQYGARMHV DEGDTIKRGH RIAEWDPYSR PVLTEVEGTI DFEDLIEDQS ISETLDESTG IAKRIVIDWR STRGGADLRP AIVIKGKDGK VLKLARGGDA RYMLSVDAIL SVDVGAKVKP GDILARISTE SAKTRDITGG LPRVAELFEA RKPKDAAIIA EIAGTIRFGR DYKNKRRISI EPMDKEEEAR EYLIPKGKHI HLQDGDIVEK GDFIVEGNPA PHDILAIKGI EELAAYLVNE IQEVYRLQGV LINDKHIEVI VRQMLQKVEI TDQGETDMIS GEQIDKIEFD QLNVKARDEG KKIATGTPVL LGITKASLQT RSFFSAASFQ ETTRVLTEAA VNGKVDPLEG LKENVIVGRL IPAGTGASMA KIREVAMKRD RMILDEREKQ ATIVPPAAPE AEPLALPPAE
|
| |