Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_2286 |
Symbol | |
ID | 3909667 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | + |
Start bp | 2638678 |
End bp | 2642880 |
Gene Length | 4203 bp |
Protein Length | 1400 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 637884183 |
Product | DNA-directed RNA polymerase subunit beta' |
Protein accession | YP_485902 |
Protein GI | 86749406 |
COG category | [K] Transcription |
COG ID | [COG0086] DNA-directed RNA polymerase, beta' subunit/160 kD subunit |
TIGRFAM ID | [TIGR02386] DNA-directed RNA polymerase, beta' subunit, predominant form |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.604145 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACCAAG AAATCATGAA TCTGTTCAAT CCGACGACGC CGGCTCAGGT CTTCGACCAG ATCCGGATCT CGATCGCGTC GCCGGAGAAG ATTCTGTCGT GGTCCTACGG CGAGATCAAG AAGCCGGAAA CCATCAATTA CCGCACCTTC AAGCCCGAGC GCGACGGCCT GTTCTGCGCC CGCATCTTCG GGCCGATCAA GGATTACGAG TGCTTGTGCG GCAAGTACAA GCGGATGAAG TACAAGGGCA TCATCTGCGA GAAGTGCTCG GTCGAAGTGA CGCTGTCGCG CGTCCGACGC GAGCGCATGG GCCACATCGA GCTGGCGGCA CCGGTCGCCC ACATCTGGTT CCTGAAGTCC TTGCCGTCGC GGATCGGGCA GTTGCTCGAC ATGACGCTGA AGGACCTCGA GCGGATTCTG TACTTCGAAT ACTACGTCGT GCTGGAGCCG GGCCTGACCG ACCTCAAGGA GCGTCAGCTG CTGTCGGAGG AGGAGTACCT CCGCGCCCAG GATCAGTACG GCCAGGACAG CTTCACCGCC ATGATCGGCG CCGAAGCGAT CCGCGAATTG CTGAAGGGAC TCGAGCTCGA GAAGATCGAT GCGCAGCTGC GCGTCGAGAT GGCCGAGACC GACTCCGACA TCAAGCACAA GAAGCTCGCC AAGCGGCTGA AGATCGTCGA GGCGTTCCGC TTCTCCGGCA ACAAGCCGGA GTGGATGATC CTGACCGTGG TGCCGGTGAT TCCGCCGGAC CTGCGGCCGC TGGTGCCGCT CGACGGCGGC CGGTTCGCGA CCTCGGATCT GAACGACCTG TATCGCCGCG TCATCAACCG TAACAACCGG TTGAAGCGGC TGATGGAGCT GCGCGCGCCG GACATCATCA TCCGCAACGA AAAGCGGATG CTGCAGGAAG CCGTCGACGC GTTGTTCGAC AACGGCCGCC GCGGCCGCGT CATCACCGGC GCCAACAAGC GTCCGTTGAA GTCGCTCGCC GACATGCTGA AGGGCAAGCA GGGTCGGTTC CGTCAGAACC TGCTCGGCAA GCGCGTCGAC TATTCGGGCC GTTCGGTCAT CGTCGTCGGC CCCGAGCTGA AGCTGCACCA GTGCGGCCTG CCGAAGAAGA TGGCGCTCGA ACTGTTCAAG CCGTTCATCT ATTCGCGGCT CGACGCCAAG GGTCTGTCGA CCACGGTGAA GCAGGCGAAG AAGCTCGTCG AAAAGGAGCG TCCCGAGGTC TGGGACATCC TCGACGAAGT CATTCGCGAG CATCCGGTGC TGCTGAACCG CGCCCCGACG CTGCATCGCC TCGGCATCCA GGCGTTCGAG CCGGTGCTGA TCGAGGGCAA GGCGATCCAG CTTCACCCGC TGGTCTGCGC CGCGTTCAAC GCCGATTTCG ACGGCGACCA GATGGCCGTG CACGTCCCGC TGTCGCTCGA AGCGCAGCTC GAAGCGCGCG TGCTGATGAT GTCGACCAAC AACATCCTGC ATCCGGCGAA CGGCCAGCCG ATCATCGTGC CGTCGCAGGA CATCGTGCTC GGCCTGTACT ACCTGTCGAT CCTGCGGGAA GGGCTGCCGG GCGAGGGCAA GGTGTTCGGC GACCTCGCCG AGCTGGAGCA CGCGCTGTTC TCCAAGGTCA TCCACCTGCA CACCAAGATC AAATATCGCT GGGACTCGCT CGACGACGAG GGCAAGCCGT ACCAGCGGCT GATCGAGACC ACCGCCGGCC GCATCCTGCT CGGCCAGGTT CTGCCGAAAT CGGTGAAGCT GCCGTTCGAG GTCATCAACA AGCTGATGAC CAAGCGCGAG ATCTCCAGCG TGATCGATCA GGTCTATCGC CACTGCGGCC AGAAGGAGAC GGTGATCTTC TGCGACCGCA TCATGGCGCT GGGCTTCTTC AACGCGTTCA AGGCGGGCAT CTCGTTCGGC AAGGACGACA TGGTCGTGCC GGCCTCGAAG TGGAAGATCG TCGACACCAC GCGTACGCTG GCGAAGGATT TCGAGCAGCA GTACAACGAC GGTCTGATCA CCCACGGCGA GAAGTACAAC AAGGTGGTCG ACGCCTGGTC GAAGGCCACC GAGGAAATCG CCAAGGAGAT GATGAAGGAG ATCTCCGCGG TCCGGAAGAA CGCATCCGGC GCGGAGACCC AGGTCAACTC GATCTACATG ATGGCCCATT CCGGCGCGCG TGGTTCGCCC GCCCAGATGC GTCAGCTCGC CGGCATGCGC GGCCTGATGG CCAAGCCGTC GGGTGAGATC ATCGAGACGC CGATCATCTC GAACTTCAAG GAAGGCCTCT CGGTTCTCGA GTACTTCAAC TCGACCCACG GCGCCCGTAA GGGCCTCGCG GACACCGCGT TGAAGACCGC GAACTCCGGC TACCTGACCC GCCGTCTGGT CGACGTGGCG CAGGACTGCA TCATCACCCA GGATGATTGC GGCACCTCGC TCGGCATCAA GATGCGGGCG ATCATCGACG CCGGCACCGT GGTGGCGTCG CTCGGCTCGC GCATCCTCGG CCGCACCGCG GGCGAAGACG TGCGCGATCC GCAGACCAAC GAGGTGATCG TCAAGAAGGG CCAGCTGATG GAGGAGCGCG ACGTCGAGGC GATCCACCAG GCCGGCGTCC AGGAAGTGAA GATCCGCTCG GCGCTGACCT GCGAACTGGT CAACGGCATC TGCGGCAAGT GCTACGGGCG CGATCTCGCC CGCGGCACCC CGGTCAACCA CGGCGAGGCG GTCGGTGTCA TCGCCGCGCA GTCGATCGGC GAACCCGGCA CGCAGCTGAC GATGCGTACC TTCCACATCG GCGGCGCGGC GCAGATCAAC GAGCAGTCGG TGATCGAGTC GAACTTCGAG GGTAAGGTCG TCATCAAGAA CAAGGCGATC GCCCGCAACG GCGAGAACCA CAGCGTCGCG ATGGTTCGCA ACATGGTGGT TGCGATCGTC GATCCGGACG GCACCGAACG GGCGACCCAC CGCATCCAGT ACGGCGCGCG CATGCACGTC GACGAGGGCG ACACGGTGAA GCGCGGCCAG CGCATCGCCG AGTGGGATCC GTACAGCCGC CCGGTGCTGA CCGAGGTCGA GGGCACGATC GATTTCGAGG ATCTGGTCGA AGACCAGTCG ATCTCGGAAA CGCTCGACGA ATCCACCGGC ATCGCCAAGC GTATCGTCAT CGACTGGCGC TCGACCCGCG GCGGCGCCGA TCTGCGCCCG GCGATCGTGG TCAAGGGCAA GGACGGCAAG GTGCTGAAGC TGGCGCGCGG CGGTGACGCC CGCTACATGC TGTCGGTCGA CGCCATTCTG TCGGTCGACG TCGGCGCCAA GGTGAAGCCG GGCGACATCC TCGCGCGTAT CTCGACCGAG AGCGCGAAGA CCCGCGACAT CACCGGCGGT CTGCCGCGCG TCGCCGAACT GTTCGAGGCC CGCAAGCCGA AGGACGCGGC GATCATCGCG GAAATCGCCG GCACCATCCG GTTCGGCCGC GACTACAAGA ACAAGCGTCG GATCTCGATC GAACCGATGG ACAGCGAAGA AGAGGCGCGC GAGTACCTGA TCCCGAAGGG CAAGCACATC CACCTTCAGG ACGGCGACAT CGTGGAAAAG GGCGACTTCA TCGTCGAAGG CAACCCGGCG CCGCACGACA TCCTGGCGAT CAAGGGCATC GAGGAACTCG CTGCCTATCT GGTCAACGAA ATCCAGGAGG TCTACCGGCT CCAGGGCGTG TTGATCAACG ACAAGCACAT CGAGGTGATC GTTCGCCAGA TGCTGCAGAA GGTCGAGATC ACCGACCAGG GCGAGACCGA CATGATCTCG GGCGAACAGA TCGACAAGAT CGAGTTCGAC CAGCTCAACG CCAAGGCGCG CGACGAGGGC AAGAAGATCG CCACGGGAAC GCCGGTTCTG CTCGGCATCA CCAAGGCGAG CCTGCAGACC CGCTCGTTCT TCTCGGCGGC GTCGTTCCAG GAGACCACCC GCGTGCTCAC CGAAGCCGCC GTCAACGGCA AGGTGGACCC GCTGGAAGGC CTCAAGGAAA ACGTCATCGT CGGCCGGCTG ATCCCGGCCG GCACCGGCGC CTCGATGGCC AAGCTCCGCG AAGTCGCGAT GAAGCGGGAT CGCATGATCC TCGACGAACG CGAGAAGCAG GCGACCATCG TGCCGCCCGC GGCCCCGGAA GCCGAGCCGC TGGCGCTGCC GCCGGTCGAA TAA
|
Protein sequence | MNQEIMNLFN PTTPAQVFDQ IRISIASPEK ILSWSYGEIK KPETINYRTF KPERDGLFCA RIFGPIKDYE CLCGKYKRMK YKGIICEKCS VEVTLSRVRR ERMGHIELAA PVAHIWFLKS LPSRIGQLLD MTLKDLERIL YFEYYVVLEP GLTDLKERQL LSEEEYLRAQ DQYGQDSFTA MIGAEAIREL LKGLELEKID AQLRVEMAET DSDIKHKKLA KRLKIVEAFR FSGNKPEWMI LTVVPVIPPD LRPLVPLDGG RFATSDLNDL YRRVINRNNR LKRLMELRAP DIIIRNEKRM LQEAVDALFD NGRRGRVITG ANKRPLKSLA DMLKGKQGRF RQNLLGKRVD YSGRSVIVVG PELKLHQCGL PKKMALELFK PFIYSRLDAK GLSTTVKQAK KLVEKERPEV WDILDEVIRE HPVLLNRAPT LHRLGIQAFE PVLIEGKAIQ LHPLVCAAFN ADFDGDQMAV HVPLSLEAQL EARVLMMSTN NILHPANGQP IIVPSQDIVL GLYYLSILRE GLPGEGKVFG DLAELEHALF SKVIHLHTKI KYRWDSLDDE GKPYQRLIET TAGRILLGQV LPKSVKLPFE VINKLMTKRE ISSVIDQVYR HCGQKETVIF CDRIMALGFF NAFKAGISFG KDDMVVPASK WKIVDTTRTL AKDFEQQYND GLITHGEKYN KVVDAWSKAT EEIAKEMMKE ISAVRKNASG AETQVNSIYM MAHSGARGSP AQMRQLAGMR GLMAKPSGEI IETPIISNFK EGLSVLEYFN STHGARKGLA DTALKTANSG YLTRRLVDVA QDCIITQDDC GTSLGIKMRA IIDAGTVVAS LGSRILGRTA GEDVRDPQTN EVIVKKGQLM EERDVEAIHQ AGVQEVKIRS ALTCELVNGI CGKCYGRDLA RGTPVNHGEA VGVIAAQSIG EPGTQLTMRT FHIGGAAQIN EQSVIESNFE GKVVIKNKAI ARNGENHSVA MVRNMVVAIV DPDGTERATH RIQYGARMHV DEGDTVKRGQ RIAEWDPYSR PVLTEVEGTI DFEDLVEDQS ISETLDESTG IAKRIVIDWR STRGGADLRP AIVVKGKDGK VLKLARGGDA RYMLSVDAIL SVDVGAKVKP GDILARISTE SAKTRDITGG LPRVAELFEA RKPKDAAIIA EIAGTIRFGR DYKNKRRISI EPMDSEEEAR EYLIPKGKHI HLQDGDIVEK GDFIVEGNPA PHDILAIKGI EELAAYLVNE IQEVYRLQGV LINDKHIEVI VRQMLQKVEI TDQGETDMIS GEQIDKIEFD QLNAKARDEG KKIATGTPVL LGITKASLQT RSFFSAASFQ ETTRVLTEAA VNGKVDPLEG LKENVIVGRL IPAGTGASMA KLREVAMKRD RMILDEREKQ ATIVPPAAPE AEPLALPPVE
|
| |