Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPC_3457 |
Symbol | |
ID | 3972125 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB18 |
Kingdom | Bacteria |
Replicon accession | NC_007925 |
Strand | - |
Start bp | 3836056 |
End bp | 3840261 |
Gene Length | 4206 bp |
Protein Length | 1401 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 637926568 |
Product | DNA-directed RNA polymerase subunit beta' |
Protein accession | YP_533316 |
Protein GI | 90424946 |
COG category | [K] Transcription |
COG ID | [COG0086] DNA-directed RNA polymerase, beta' subunit/160 kD subunit |
TIGRFAM ID | [TIGR02386] DNA-directed RNA polymerase, beta' subunit, predominant form |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.416565 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACCAAG AAATTATGAA TCTGTTTAAT CCGACGACGC CGGCTCAGGT CTTCGACCAG ATCCGCATTT CGATCGCGTC GCCGGAAAAG ATTCTGTCGT GGTCCTACGG CGAGATCAAG AAGCCGGAAA CCATCAATTA CCGCACCTTC AAGCCGGAGC GCGACGGGCT GTTCTGCGCC CGCATTTTCG GGCCGATCAA GGATTACGAG TGCTTGTGCG GCAAGTACAA GCGGATGAAG TACAAGGGCA TCATCTGCGA GAAGTGCTCG GTCGAAGTGA CGTTGTCCCG CGTCCGTCGC GAGCGCATGG GCCACATCGA GCTCGCCGCC CCGGTCGCCC ACATCTGGTT CCTGAAATCC TTGCCGTCGC GGATCGGGCT TCTGCTCGAC ATGACGCTGA AGGATCTCGA ACGGATTCTG TATTTCGAAT ATTACGTCGT CCTCGAGCCG GGTCTGACCG CGCTGAAGGA TCGGCAACTG CTGTCCGAAG AAGAGTATCT GCGCGCGCAG GACGAATACG GCCAGGATTC CTTCACCGCC ATGATCGGCG CCGAAGCGAT CCGCGAGCTG TTGAAGGGCT TGGAGCTGGA GAAGCTCGAG GCCTCGCTGC GCGTCGAGAT GCAGGAAACC GAATCCGACA TCAAGCACAA GAAGCTCGCC AAGCGCTTGA AGATCGTCGA GGCGTTCCGC TTCTCCGGCA ACAAGCCGGA ATGGATGATC CTGACCGTGG TGCCGGTGAT TCCGCCGGAC CTGCGCCCCT TGGTGCCGCT CGACGGCGGC CGCTTTGCGA CCTCCGATCT GAATGATCTC TATCGCCGCG TCATCAACCG TAACAACCGC TTGAAGCGGC TGATGGAGCT GCGCGCGCCG GACATCATCA TCCGCAACGA AAAGCGCATG CTGCAGGAGG CCGTCGACGC ACTGTTCGAC AACGGTCGCC GCGGCCGCGT CATCACCGGC GCCAACAAGC GTCCGCTGAA GTCGCTCGCC GACATGCTGA AGGGCAAGCA GGGTCGGTTC CGGCAGAACC TGCTCGGCAA GCGCGTCGAC TATTCCGGCC GTTCGGTGAT CACCGTGGGT CCGGAACTGC GGCTGCATCA GTGCGGCCTG CCGAAGAAGA TGGCGCTCGA ACTGTTCAAG CCGTTCATCT ATTCGCGGCT CGACGCCAAG GGGCTGTCGA CCACGGTCAA GCAGGCCAAG AAGCTCGTGG AAAAAGAGCG TCCCGAGGTT TGGGACATCC TCGACGAGGT GATCCGCGAA CACCCGATCC TGTTGAACCG CGCGCCGACG CTGCATCGGC TCGGCATTCA GGCGTTCGAG CCGGTGTTGA TCGAGGGCAA GGCGATCCAG CTTCACCCGC TGGTTTGCGC CGCGTTCAAC GCCGACTTCG ACGGCGACCA GATGGCGGTG CACGTGCCGC TGTCGCTGGA AGCCCAGCTC GAAGCCCGCG TCTTGATGAT GTCGACCAAC AACATCCTGC ATCCGGCCAA TGGTCTGCCG ATCATCGTGC CGTCGCAGGA CATCGTGCTC GGCCTGTACT ACCTGTCGAT CCTGCGTGAA GGATTGCCGG GCGAGGGCAA GCTGTTCGGC GAGGCCGCCG AGATCGAGCA CGCGCTGCAC GCCAAGGTCA TCCACCTGCA CACCAAGATC AAGTATCGCT GGGAAGGCCT CGACGAGAAT GGCAAGCAGG TCAGCCGCTG GTACGAGACC ACGGCCGGAC GCACCATGCT CGGCCAGGTG CTGCCGAAGT CGGTGAAGAT GCCGTTCGAC GTCATCAACA AGCTGATGAC CAAGAAGGAA ATCTCCGGCG TCATCGATCA GGTGTATCGC CACTGCGGTC AGAAAGAGAC CGTGATGTTC TGCGACCGCA TCATGGCGCT CGGCTTCTAC AACGCGTTCA AGGCCGGCAT TTCGTTCGGC AAGGACGACA TGGTGGTGCC GGCGTCGAAG TGGAAGACCG TCGAGGATAC CCGCACGCTC GCCAAGGAAT TCGAGCAGCA GTACAATGAC GGCCTGATCA CCCACGGCGA AAAGTACAAC AAGGTGGTCG ACGCCTGGTC GAAGTGCACC AAGAAGATCT CGGAAGACAT GATGACGGAA ATCTCCGCCG TCAAAAAGAA TCCGAAGGGC GGCGAAGCCC AGATCAACTC GATCTTCATG ATGTCGAACT CCGGCGCCCG TGGTTCGCAG GACCAGATGC GCCAGCTCGC CGGCATGCGC GGCCTGATGG CCAAGCCGTC GGGCGAGATC ATCGAGACGC CGATCATCTC GAACTTCAAG GAAGGCCTCT CGGTTCTCGA ATACTTCAAC TCGACCCACG GCGCCCGTAA GGGTCTGGCC GACACCGCGT TGAAGACCGC GAACTCCGGC TACCTGACGC GGCGTCTGGT CGACGTCGCG CAGGACTGCA TCATCACCGC GGACGATTGC GGCACCAAGC TCGGCATCAA GATGCGCGCC ATCATCGACG CCGGCACCGT GGTGGCGTCG TTGGCCTCGC GGATTCTCGG CCGCACCGCG GGCGAGGATC TGCGCGATCC GTTGACCAAC AAGGTGGTGG TGAAGCGCGG CACGCTGATG GAAGAGAGCC ACGTCGACGC GCTGCAGCAG GCCGGCATCC AGGAAGTGAA GATCCGCTCG GCTTTGACCT GCGAACTGGT CAACGGCATC TGCGGCAAGT GCTACGGGCG CGATCTCGCC CGCGGCACCC CGGTCAACCA CGGCGAGGCT GTCGGCGTCA TCGCCGCGCA GTCGATCGGC GAGCCCGGCA CCCAGCTGAC GATGCGCACG TTCCACATCG GCGGCGCGGC GCAGATCAAC GAGCAGTCGT TCATCGAGTC GAACTTCGAC GGCAAGGTGA CGATCAAGAA CAAGGCGATC GCCAAGAACG GCGAGGGCCA TCTGGTGGCG ATGGTGCGCA ACATGGTCGT TGCGGTCACC GACGCCGACG GCACCGAACG CGCCACCCAT CGCATCCAGT ACGGCGCGCG GATGCGCGTC GACGAAGGCG ACATGGTGAA GCGCGGCCAG CGCATCGCCG AGTGGGATCC CTACACCCGC CCGGTGCTGA CCGAAGTGGA AGGCATCATC GGCTTCGAGG ATCTGGTCGA AGGCCAGTCG ATCTCGGAGA CGCTGGACGA ATCCACCGGC ATCGCCAAGC GCGTGGTGAT CGACTGGCGC AGCCAGCGCG GTGGCGCTGA CCTGCGTCCG GCGATCGTCA TCAAGGGCAA GGACGGCAAG ATCCTCAAGC TCGCGCGTGG CGGCGAAGCC CGCTACATGC TGTCGGTCGA CGCCATTCTG TCGGTCGACG TCGGCGCCAA GGTGAAGACC GGCGACATCC TCGCCCGTAT CTCCACCGAA AGCGCCAAGA CCCGCGACAT CACCGGCGGT CTGCCGCGGG TGGCGGAACT GTTCGAGGCC CGCAAGCCGA AGGACGCCGC CATCATCGCG GAAATCTCCG GCACCATCCG GTTCGGACGC GACTACAAGA ACAAGCGTCG GATTTCGATC GAGCCGGTGG ACACCACCGA GGAGACCCGC GAGTACTTGA TCCCGAAGGG CAAGCACATC CATCTGCAGG ACGGCGACAT CGTCGAAAAG GGCGATTTCA TCGTCGAAGG CAATCCGGCG CCGCACGACA TTCTGGCGAT CAAGGGCATC GAGGAACTCG CTGCCTATCT GGTCAACGAA ATCCAGGAGG TCTATCGGCT GCAGGGCGTG TTGATCAACG ACAAGCACAT CGAGGTGATC GTTCGCCAGA TGCTGCAGAA GGTCGAGGTC ACCGATCAGG GCGAGACCGA CATGATTTCG GGCGAGCAGA TCGACAAGAT CGAATTCGAC CAGATCAACG CCAAGGCCAA GGAAGAGGGC AAGAAGATCG CCACCGGCAC CCCGGTGCTG CTCGGCATCA CCAAGGCCTC TCTGCAGACC CGCTCGTTCT TCTCGGCGGC GTCGTTCCAG GAGACCACCC GCGTGCTCAC CGAAGCCGCC GTCAACGGCA AGGTGGATCC GCTGGAAGGC CTCAAGGAGA ACGTCATCGT CGGCCGGCTG ATCCCGGCGG GCACCGGCGC CTCGATGGCC AAGATCCGCG AAGTGGCGGT GAAGCGCGAC AAGCTGATTC TCGACGAGCG CGAGAAGCAG GCGACCATCG TGCCGAGCGC CCCGGAACCG GAACCGCTGG CGCTGCCGAC CCCCGAACAG AGCTAA
|
Protein sequence | MNQEIMNLFN PTTPAQVFDQ IRISIASPEK ILSWSYGEIK KPETINYRTF KPERDGLFCA RIFGPIKDYE CLCGKYKRMK YKGIICEKCS VEVTLSRVRR ERMGHIELAA PVAHIWFLKS LPSRIGLLLD MTLKDLERIL YFEYYVVLEP GLTALKDRQL LSEEEYLRAQ DEYGQDSFTA MIGAEAIREL LKGLELEKLE ASLRVEMQET ESDIKHKKLA KRLKIVEAFR FSGNKPEWMI LTVVPVIPPD LRPLVPLDGG RFATSDLNDL YRRVINRNNR LKRLMELRAP DIIIRNEKRM LQEAVDALFD NGRRGRVITG ANKRPLKSLA DMLKGKQGRF RQNLLGKRVD YSGRSVITVG PELRLHQCGL PKKMALELFK PFIYSRLDAK GLSTTVKQAK KLVEKERPEV WDILDEVIRE HPILLNRAPT LHRLGIQAFE PVLIEGKAIQ LHPLVCAAFN ADFDGDQMAV HVPLSLEAQL EARVLMMSTN NILHPANGLP IIVPSQDIVL GLYYLSILRE GLPGEGKLFG EAAEIEHALH AKVIHLHTKI KYRWEGLDEN GKQVSRWYET TAGRTMLGQV LPKSVKMPFD VINKLMTKKE ISGVIDQVYR HCGQKETVMF CDRIMALGFY NAFKAGISFG KDDMVVPASK WKTVEDTRTL AKEFEQQYND GLITHGEKYN KVVDAWSKCT KKISEDMMTE ISAVKKNPKG GEAQINSIFM MSNSGARGSQ DQMRQLAGMR GLMAKPSGEI IETPIISNFK EGLSVLEYFN STHGARKGLA DTALKTANSG YLTRRLVDVA QDCIITADDC GTKLGIKMRA IIDAGTVVAS LASRILGRTA GEDLRDPLTN KVVVKRGTLM EESHVDALQQ AGIQEVKIRS ALTCELVNGI CGKCYGRDLA RGTPVNHGEA VGVIAAQSIG EPGTQLTMRT FHIGGAAQIN EQSFIESNFD GKVTIKNKAI AKNGEGHLVA MVRNMVVAVT DADGTERATH RIQYGARMRV DEGDMVKRGQ RIAEWDPYTR PVLTEVEGII GFEDLVEGQS ISETLDESTG IAKRVVIDWR SQRGGADLRP AIVIKGKDGK ILKLARGGEA RYMLSVDAIL SVDVGAKVKT GDILARISTE SAKTRDITGG LPRVAELFEA RKPKDAAIIA EISGTIRFGR DYKNKRRISI EPVDTTEETR EYLIPKGKHI HLQDGDIVEK GDFIVEGNPA PHDILAIKGI EELAAYLVNE IQEVYRLQGV LINDKHIEVI VRQMLQKVEV TDQGETDMIS GEQIDKIEFD QINAKAKEEG KKIATGTPVL LGITKASLQT RSFFSAASFQ ETTRVLTEAA VNGKVDPLEG LKENVIVGRL IPAGTGASMA KIREVAVKRD KLILDEREKQ ATIVPSAPEP EPLALPTPEQ S
|
| |