Gene Rpal_3687 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_3687 
SymbolrpoB 
ID6411363 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp3942983 
End bp3947107 
Gene Length4125 bp 
Protein Length1374 aa 
Translation table11 
GC content64% 
IMG OID642713567 
ProductDNA-directed RNA polymerase subunit beta 
Protein accessionYP_001992662 
Protein GI192292057 
COG category[K] Transcription 
COG ID[COG0085] DNA-directed RNA polymerase, beta subunit/140 kD subunit 
TIGRFAM ID[TIGR02013] DNA-directed RNA polymerase, beta subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.335052 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGCAGC AGACGTTCAC CGGTCGCAAA CGCGTTCGCA AGTTTTTCGG TCACATCCGG 
GAAGTCGCGG AGATGCCGAA CCTCATCGAG GTTCAGAAGG CATCTTACGA CCAGTTCCTG
ATGGTCGCCG AGCCTCCCGG AGGACGGCCG GACGAGGGCC TGCAGGCGGT GTTCCGGTCG
GTCTTCCCGA TTTCGGACTT CTCCAACGCC TCGATGCTCG AATTCGTTCG CTACGAATTC
GAGCCGCCGA AGTACGACGT CGACGAGTGC CGCCAGCGCG GCATGACCTA TGCTGCGCCG
CTGAAGGTGA CGCTGCGCCT CATCGTGTTC GATATCGACG AGGAAACCGG CGCCCGCTCC
GTGAAGGACA TCAAGGAGCA GGATGTCTAC ATGGGTGACA TCCCGTTGAT GACGATGAAT
GGTACCTTCA TCGTCAACGG TACCGAGCGC GTCATCGTCT CGCAGATGCA CCGGTCGCCG
GGTGTGTTCT TCGACCACGA TAAGGGCAAG ACCCACTCGT CGGGCAAGCT GCTGTTTGCC
GCCCGCATCA TCCCGTATCG CGGCTCCTGG CTCGACATCG AGTTCGACGC CAAGGACATC
GTCTATGCGC GTATCGACCG TCGCCGCAAG CTGCCGGTGA CGTCGCTGAT GTTCGCCCTC
GGCCTCGACG GCGAAGAGAT CCTGTCGACC TTCTACAACA AGATCCTCTA CAAGCGGACC
AAGGAAGGCT GGCGCGTTCC GTTCGACGTC AACCGCTTCC GTGGCTACTC GACCGTCAAC
GACCTGATCG ACGCCGACAC CGGCAAGGTC GTGCTCGAGG CCGGCAAGAA GCTGACTGTG
CGTGCAGCCC GTCAGCTGCA GGAAAAGGGC CTCAAGGCGT TGCGGATGTC CGACGAGGAG
CTCGTTGGCA ACTATCTGGC CGAGGATCTG GTCAACCCGA AGACGGGTGA GATCTATGCG
GAAGCCGGTG AGGAAATCAC CGACAAGACG CTGAAGATGT TGAACGAGCA GGGCTACAAG
GAGCTGCCGC TGCTCGACAT CGACCATGTC AACGTCGGCC CGTACATCCG CAACACGCTG
AACGCCGACA AGAACATGAC GCGCGAAGAC GCGCTGTTCG ACATCTACCG GGTGATGCGT
CCGGGCGAGC CGCCGACGCT CGACTCCGCG CAGAACATGT TCCAGTCGCT GTTCTTCGAC
GCCGAGCGCT ACGACCTGTC GGCCGTGGGC CGCGTCAAGA TGAACATGCG CCTCGACCTC
GATGCGCCGG ACACCCATCG CACGCTGCGC AAGGAAGACA TCCTGGCGGT GATCAAGACC
CTGGTCGGCC TGCGGGACGG CAAGGGCGAG ATCGACGACA TCGACCACCT TGGCAACCGC
CGTGTGCGTT CGGTCGGCGA GCTGATGGAG AACCAGTACC GCATCGGCCT GCTCCGCATG
GAGCGCGCCA TCAAGGAGCG GATGAGCTCG GTCGACATCG ACACCGTGAT GCCGCAGGAC
CTGATCAACG CCAAGCCGGC GGCGGCGGCG GTGCGCGAGT TCTTCGGCTC GTCGCAGCTC
TCGCAGTTCA TGGACCAGAC CAACCCGCTG TCGGAGATCA CCCACAAGCG GCGCCTGTCG
GCGCTCGGCC CGGGCGGTCT GACCCGTGAG CGTGCCGGCT TCGAAGTTCG CGACGTGCAT
CCGACCCACT ACGGCCGTAT CTGCCCGATC GAGACGCCGG AAGGTCCGAA CATCGGTCTG
ATCAACTCGC TGGCGACCTT CGCCCGCGTG AACAAGTACG GCTTCGTCGA GACTCCGTAC
CGCAAGGTCA AGGAAGGCCG CGTCACCGAC GAGGTGGTGT ATCTGTCGGC GATGGAAGAG
GGCCGTTACG CGGTGGCGCA GGCCAACGTG TCGCTCGATG CCAAGGGCAA GTTCACCGAC
GATCTGGTGG TCTGCCGCGC CGGCGGCACC CGCGACGTTG TGCCGATGCC GGCCGACCAG
GTCGACTACA TGGACGTGTC GCCGAAGCAG CTGGTCTCGG TCGCCGCGGC GCTGATCCCG
TTCCTCGAGA ACGACGACGC CAACCGCGCG CTGATGGGCT CGAACATGCA GCGCCAGGCG
GTGCCGCTGG TTCGCGCCGA GGCGCCGTTC GTCGGCACCG GCATGGAAGG CGTCGTCGCC
CGCGACTCGG GCGCGGCGAT CGCCGCGCGC CGCACCGGCA TCATCGACCA GATCGACGCG
ACGCGTATCG TCATCCGCGC CACTGAGGAT CTCGATCCGA CCAAGTCGGG CGTCGATATC
TACCGGCTGA TGAAGTACCA GCGCTCCAAC CAGTCGACCT GCATCAATCA GCGTCCGCTG
GTGAAGGTCG GTGACCACGT CAAGAAGGGC GACATCATCG CCGACGGTCC GTCGACCGAT
CTCGGTGAGC TCGCGCTCGG CCGCAACGTG CTCGTCGCGT TCATGCCGTG GAACGGCTAC
AACTTCGAAG ACTCGATCCT GCTCTCCGAG CGGATCGTGA AGGAAGACGT CTTCACCTCG
ATCCACATCG AGGAATTCGA GGTGATGGCC CGCGACACCA AGCTCGGCCC CGAGGAAATC
ACCCGCGATA TTCCGAACGT TTCGGAAGAA GCGCTGAAGA ACCTCGACGA AGCCGGCATC
GTCTACATCG GCGCCGAAGT CCGCGCTGGC GACATCCTGG TCGGCAAGAT CACGCCGAAG
GGCGAAAGCC CGATGACGCC GGAAGAGAAG CTGCTGCGCG CCATCTTCGG TGAAAAGGCC
TCGGACGTTC GCGACACCTC GCTGCGGGTG CCGCCGGGCG TGCAGGGCAC CATCGTCGAA
GTCCGCGTGT TCAACCGCCA CGGCGTCGAC AAGGACGAGC GTGCGCTGGC GATCGAGCGG
GAGGAGATCG AGCGCCTCGC CAAGGACCGC GACGACGAGC AGGCGATTCT CGACCGTAAC
GTTTACGGCC GTCTCGCCGA CCTGCTCGAC GGTCGTCAGG GCATTGCCGG TCCGAAGGGC
TTCAAGAAGG ACACCAAGAT CACCCGTGCG GTGCTCGAGG AGTATCCGAA GTCGCAGTGG
TGGCTGTTCG CTGCCCCGAA CGACAAGCTG ATGGCCGAAA TCGAGGCCAT GCGGAAGCAG
TACGACGAGT CGAAGAAGGG CCTCGAACAG CGCTTCCTCG ACAAGGTCGA GAAGCTGCAG
CGCGGCGACG AATTGCCGCC CGGCGTGATG AAGATGGTCA AGGTCTTCGT CGCGGTGAAG
CGCAAGATCC AGCCGGGCGA CAAGATGGCC GGCCGCCACG GCAACAAGGG TGTGGTGTCG
AAGATCGTTC CGATCGAGGA CATGCCGTTC CTCGAGGACG GCACTCACGC CGACATCGTG
CTGAACCCGC TCGGCGTGCC GAGCCGCATG AACGTCGGTC AGATCCTCGA GACGCATCTC
GGCTGGGCGT GCGCCGGTCT CGGCAAGCGC ATCGGTGAGA CGATCGACGC CTACTACCAG
AGCCAGGATC TCAAGCCGCT GCGCGAGACC CTGCGGAAGA TCTACGGCGA GGACGAGACC
ATCAAGTCGC TCGACGATGG TGAGCTGCTC GAACTCGGCC GCAATCTCAG CCACGGCGTG
CCGATTGCGA CCCCGGTGTT CGACGGTGCC AAGGAGGCCG ACATCGAAGA GATGCTGAAG
CTCGCGGGCT TCGACGCTTC GGGTCAGTCG ACTGTGTACG ACGGCCGCAC CGGCGATCAG
TTCGATCGTC GCGTCACCGT CGGCTACATC TACATGCTGA AGCTGCATCA CCTCGTGGAC
GACAAGATCC ACGCCCGGTC GATCGGTCCG TACTCGCTCG TCACCCAGCA GCCGCTGGGC
GGTAAGGCGC AGTTCGGCGG CCAGCGCTTC GGCGAAATGG AGGTGTGGGC GCTCGAAGCT
TACGGCGCGG CCTACACGCT GCAGGAAATG CTGACGGTGA AGTCGGACGA CGTCGCCGGC
CGCACCAAGG TGTACGAGGC GATCGTGCGC GGCGACGACA CGTTCGAGGC CGGTATTCCG
GAATCGTTCA ACGTGCTCGT CAAGGAAATG CGCTCGCTCG GCCTCAACGT CGACCTGCAC
AACTCCAAGC TGGCGGCGCC GCCTCCGGCC GAGGCTGCCG AGTAA
 
Protein sequence
MAQQTFTGRK RVRKFFGHIR EVAEMPNLIE VQKASYDQFL MVAEPPGGRP DEGLQAVFRS 
VFPISDFSNA SMLEFVRYEF EPPKYDVDEC RQRGMTYAAP LKVTLRLIVF DIDEETGARS
VKDIKEQDVY MGDIPLMTMN GTFIVNGTER VIVSQMHRSP GVFFDHDKGK THSSGKLLFA
ARIIPYRGSW LDIEFDAKDI VYARIDRRRK LPVTSLMFAL GLDGEEILST FYNKILYKRT
KEGWRVPFDV NRFRGYSTVN DLIDADTGKV VLEAGKKLTV RAARQLQEKG LKALRMSDEE
LVGNYLAEDL VNPKTGEIYA EAGEEITDKT LKMLNEQGYK ELPLLDIDHV NVGPYIRNTL
NADKNMTRED ALFDIYRVMR PGEPPTLDSA QNMFQSLFFD AERYDLSAVG RVKMNMRLDL
DAPDTHRTLR KEDILAVIKT LVGLRDGKGE IDDIDHLGNR RVRSVGELME NQYRIGLLRM
ERAIKERMSS VDIDTVMPQD LINAKPAAAA VREFFGSSQL SQFMDQTNPL SEITHKRRLS
ALGPGGLTRE RAGFEVRDVH PTHYGRICPI ETPEGPNIGL INSLATFARV NKYGFVETPY
RKVKEGRVTD EVVYLSAMEE GRYAVAQANV SLDAKGKFTD DLVVCRAGGT RDVVPMPADQ
VDYMDVSPKQ LVSVAAALIP FLENDDANRA LMGSNMQRQA VPLVRAEAPF VGTGMEGVVA
RDSGAAIAAR RTGIIDQIDA TRIVIRATED LDPTKSGVDI YRLMKYQRSN QSTCINQRPL
VKVGDHVKKG DIIADGPSTD LGELALGRNV LVAFMPWNGY NFEDSILLSE RIVKEDVFTS
IHIEEFEVMA RDTKLGPEEI TRDIPNVSEE ALKNLDEAGI VYIGAEVRAG DILVGKITPK
GESPMTPEEK LLRAIFGEKA SDVRDTSLRV PPGVQGTIVE VRVFNRHGVD KDERALAIER
EEIERLAKDR DDEQAILDRN VYGRLADLLD GRQGIAGPKG FKKDTKITRA VLEEYPKSQW
WLFAAPNDKL MAEIEAMRKQ YDESKKGLEQ RFLDKVEKLQ RGDELPPGVM KMVKVFVAVK
RKIQPGDKMA GRHGNKGVVS KIVPIEDMPF LEDGTHADIV LNPLGVPSRM NVGQILETHL
GWACAGLGKR IGETIDAYYQ SQDLKPLRET LRKIYGEDET IKSLDDGELL ELGRNLSHGV
PIATPVFDGA KEADIEEMLK LAGFDASGQS TVYDGRTGDQ FDRRVTVGYI YMLKLHHLVD
DKIHARSIGP YSLVTQQPLG GKAQFGGQRF GEMEVWALEA YGAAYTLQEM LTVKSDDVAG
RTKVYEAIVR GDDTFEAGIP ESFNVLVKEM RSLGLNVDLH NSKLAAPPPA EAAE