Gene Rsph17025_2541 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17025_2541 
Symbol 
ID5083994 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17025 
KingdomBacteria 
Replicon accessionNC_009428 
Strand
Start bp2579147 
End bp2583394 
Gene Length4248 bp 
Protein Length1415 aa 
Translation table11 
GC content65% 
IMG OID640484103 
ProductDNA-directed RNA polymerase subunit beta' 
Protein accessionYP_001168734 
Protein GI146278575 
COG category[K] Transcription 
COG ID[COG0086] DNA-directed RNA polymerase, beta' subunit/160 kD subunit 
TIGRFAM ID[TIGR02386] DNA-directed RNA polymerase, beta' subunit, predominant form 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACCAGG AACTCTCGAC CAATCCGTTC AACCCGGTTG CGCCGGTCAA GACCTTCGAC 
GAGATCAAGA TCTCGCTCGC CTCGCCGGAA CGGATCCTCT CGTGGTCCTA CGGCGAGATC
AAGAAGCCCG AGACCATCAA CTACCGGACC TTCAAGCCGG AGCGGGACGG CCTGTTCTGC
GCGCGGATCT TCGGGCCGAT CAAGGACTAC GAGTGCCTCT GCGGCAAGTA CAAGCGCATG
AAGTATCGCG GCGTCGTCTG CGAGAAGTGC GGCGTGGAAG TGACGCTCCA GAAGGTGCGG
CGCGAGCGGA TGGGCCATAT CGAGCTCGCC GCTCCGGTCG CGCACATCTG GTTCCTGAAG
TCGCTGCCGA GCCGCATCGG CCTCATGCTC GACATGACGC TGCGGGATCT TGAGCGCATC
CTGTATTTCG AGAACTACGT CGTCATCGAA CCGGGCCTGA CCGACCTCAC TTACGGCCAG
CTGATGACCG AGGAAGAGTT CCTCGACGCG CAGGACCAGT ACGGCGCCGA CGCCTTCACC
GCCAACATCG GCGCCGAGGC GATCCGCGAG ATGCTGTCGG CCATCGACCT CGAGCAGACC
GCCGAGACGC TCCGCGAGGA GTTGAAGGAG GCGACGGGGG AACTCAAGCC GAAGAAGATC
ATCAAGCGGC TGAAGATCGT TGAATCCTTC CTCGAGTCGG GCAACCGGCC GGAGTGGATG
ATCCTGACCG TGCTGCCGGT GATCCCGCCG GAACTGCGCC CGCTGGTCCC GCTGGACGGC
GGCCGGTTCG CCACGTCCGA CCTCAACGAC CTCTACCGTC GCGTGATCAA CCGGAACAAC
CGCCTCAAGC GGCTGATCGA GCTGCGCGCG CCGGACATCA TCGTGCGCAA CGAAAAGCGG
ATGCTGCAAG AGGCGGTTGA CGCGCTGTTC GACAACGGCC GTCGCGGCCG CGTCATCACG
GGCACCAACA AGCGCCCGCT GAAGTCGCTG TCGGACATGC TGAAGGGCAA GCAGGGCCGG
TTCCGCCAGA ACCTGCTCGG CAAGCGCGTG GACTTCTCGG GCCGTTCGGT CATCGTGACC
GGCCCCGAGC TGAAGCTGCA CCAGTGCGGC CTGCCGAAGA AGATGGCGCT CGAACTGTTC
AAGCCGTTCA TCTACTCGCG GCTGGAGGCG AAGGGGCTTT CCAGCACCGT GAAGCAGGCG
AAGAAGCTGG TCGAGAAGGA GCGTCCGGAG GTCTGGGACA TCCTCGACGA GGTGATCCGC
GAACATCCGG TGCTGCTGAA CCGTGCGCCG ACGCTGCACC GTCTGGGCAT CCAGGCGTTC
GAGCCGATCC TGATCGAAGG CAAGGCGATC CAGCTTCACC CGCTGGTCTG CTCGGCCTTC
AACGCCGACT TCGACGGTGA CCAGATGGCC GTCCACGTTC CGCTTTCGCT GGAAGCCCAG
CTGGAAGCGC GCGTGCTGAT GATGTCCACG AACAACGTGC TGTCGCCCGC CAACGGCGCA
CCGATCATCG TGCCGTCGCA GGACATGGTG CTCGGGCTCT ATTACACCAC GATGGAGCGC
CGCGGCATGA AGGGCGAGGG CATGGCCTTC TCGTCCGTCG AAGAGGTCGA ACATGCCCTT
GCCGCCGGCG AGGTGCACCT GCACGCGACC ATCACCGCCC GGATCAAGCA GATCGACGAC
GAGGGCAACG AGGTCGTCAA GCGCTACCAG ACCACTCCCG GCCGCCTGCG GCTGGGCAAC
CTGCTGCCGC TCAACGCCAA GGCCCCGTTC GAGCTGGTGA ACCGGCTCCT GCGGAAGAAG
GACGTGCAGA ACGTCATCGA CACCGTCTAC CGCTACTGCG GCCAGAAGGA GTCGGTGATC
TTCTGCGACC AGATCATGGG CATGGGCTTC CGCGAGGCGT TCAAGGCCGG CATCTCGTTC
GGCAAGGACG ACATGCTGAT CCCGGACACC AAGTGGCCGA TCGTGAACGA GGTTCGCGAT
CAGGTGAAGG AGTTCGAACA GCAGTACATG GACGGCCTGA TCACACAGGG CGAGAAGTAC
AACAAGGTCG TCGATGCCTG GTCCAAATGC TCGGACAAGG TGGCGGGCGA GATGATGGCC
GAAATCTCGG CGGTGCGCTA CGACGACGCC GGTGCCGAGA AAGAGCCGAA CTCGGTCTAT
ATGATGTCCC ACTCCGGCGC GCGGGGTTCG CCGGCGCAGA TGAAGCAGCT CGGCGGGATG
CGCGGCCTGA TGGCCAAGCC GAACGGCGAA ATCATCGAGA CGCCGATCAT CTCGAACTTC
AAGGAAGGTC TGACCGTTCT TGAATACTTC AACTCGACCC ACGGCGCCCG GAAGGGTCTG
GCCGACACCG CGCTCAAGAC GGCGAACTCG GGCTACCTGA CCCGCCGTCT GGTGGACGTG
GCGCAGGACT GCATCGTGCG CAGCCACGAC TGCGGCACCG AGAACGCGAT CACCGCCTCG
GCCGCGGTGA ATGATGGCGA AGTGGTCAGC CCGCTCTCCG AGCGCGTGCT GGGCCGTGTC
GCGGCCGAGG ACATCCTGGT CCCCGGCACT GACGAGGTCG TGGTCGCCCG CGGCGAGCTG
ATCGACGAGC GTCGCGCCGA TCTGATCGAC CAGGCGAACG TGGCGCTGGT GCGCATCCGC
AGCCCGCTGA CCTGCGAGGC CGAGGAAGGC GTCTGCGCCA TGTGCTACGG GCGCGACCTT
GCCCGCGGCA CGCTGGTCAA CATCGGCGAG GCGGTGGGCA TCATCGCCGC GCAGTCGATC
GGTGAACCGG GCACGCAGCT GACGATGCGG ACCTTCCACA TCGGCGGCAT CGCGCAGGGT
GGCCAGCAGT CGTTCCTTGA AGCGAGCCAG GAAGGCCGGA TCGAGTTCCG CAACCCGAAC
CTGCTCGAGA ACGCCAACGG CGAACAGATC GTCATGGGCC GCAACATGCA GCTCGCGATC
ATCGACGAGG CGGGGCAGGA GCGGGCGACG CACAAGCTGA CCTACGGCGC CAAGGTCCAT
GTGAAGGACG GCCAGTCGGT GAAGCGCGGC ACGCGGCTCT TCGAATGGGA CCCCTACACC
CTGCCGATCA TCGCCGAAAA GGGCGGTGTG GCGCGGTTCG TGGATCTCGT CTCGGGGATC
TCGGTGCGCG AGGATACCGA CGAAGCCACC GGCATGACCC AGAAGATCGT GTCGGACTGG
CGCTCGACCC CGAAGGGCGG CGACCTGAAG CCCGAGATCA TCATCATGGA CCCCGACACG
GGCAATCCGA TGCGGAACGA GGCGGGCAAC CCGATCTCGT ATCCGATGTC GGTGGAGGCC
ATCCTCTCGG TCGAGGACGG CCAGACCGTG CGGGCCGGCG ACGTGGTGGC GCGTATCCCG
CGCGAAGGTG CCCGGACCAA GGACATCACC GGGGGTCTTC CCCGCGTGGC GGAACTGTTC
GAGGCCCGTC GCCCGAAGGA TCACGCGATC ATCGCCGAGA ACGACGGCTA TGTGCGCTTC
GGCAAGGACT ACAAGAACAA GCGCCGCATC ACGATCGAGC CGGTGGACGA GACGCTGAAC
TCGGTCGAGT ACATGGTGCC CAAGGGCAAG CACATCCCGG TGCAGGAAGG CGACTTCGTG
CAGAAGGGTG ACTACATCAT GGACGGCAAC CCGGCTCCGC ACGACATCCT GCGGATCATG
GGGGTCGAGG CGCTGGCGAA CTACATGATC GACGAGGTGC AGGAGGTCTA CCGACTGCAG
GGCGTGAAGA TCAACGACAA GCACATCGAG GTGATCGTGC GGCAGATGCT GCAGAAATAC
GAGATCCTCG ATTCGGGCGA GACCACGCTG CTCAAGGGCG AGCATGTGGA CAAGGCCGAG
CTTGACGAGA CCAACGAGAA GGCGATCCAG CACGGCATGC GTCCGGCTCA TGCCGAACCG
ATCCTGCTCG GGATCACCAA GGCGTCGCTG CAGACCCGCA GCTTCATCTC GGCGGCCTCG
TTCCAGGAGA CGACGCGCGT GCTCACCGAA GCCTCGGTGC AGGGCAAGCG CGACAAGCTT
GTCGGTCTGA AGGAGAACGT GATCGTCGGC CGGCTGATCC CGGCCGGGAC GGGCGGGGCG
ACCTCGCGCG TCAAGAAGAT CGCGCACGAT CGCGACCAGA CCGTGATCGA CGCTCGCCGC
GCCGAAGCCG AGTCCGCTGC GGCGCTCGCC GCGCCCACGG ACGAGGTGAT CGACCTCGGC
CCCGAGGATT CGGGTCTGGT GGAAACGGTG GAGAGCCGCA AGGAGTGA
 
Protein sequence
MNQELSTNPF NPVAPVKTFD EIKISLASPE RILSWSYGEI KKPETINYRT FKPERDGLFC 
ARIFGPIKDY ECLCGKYKRM KYRGVVCEKC GVEVTLQKVR RERMGHIELA APVAHIWFLK
SLPSRIGLML DMTLRDLERI LYFENYVVIE PGLTDLTYGQ LMTEEEFLDA QDQYGADAFT
ANIGAEAIRE MLSAIDLEQT AETLREELKE ATGELKPKKI IKRLKIVESF LESGNRPEWM
ILTVLPVIPP ELRPLVPLDG GRFATSDLND LYRRVINRNN RLKRLIELRA PDIIVRNEKR
MLQEAVDALF DNGRRGRVIT GTNKRPLKSL SDMLKGKQGR FRQNLLGKRV DFSGRSVIVT
GPELKLHQCG LPKKMALELF KPFIYSRLEA KGLSSTVKQA KKLVEKERPE VWDILDEVIR
EHPVLLNRAP TLHRLGIQAF EPILIEGKAI QLHPLVCSAF NADFDGDQMA VHVPLSLEAQ
LEARVLMMST NNVLSPANGA PIIVPSQDMV LGLYYTTMER RGMKGEGMAF SSVEEVEHAL
AAGEVHLHAT ITARIKQIDD EGNEVVKRYQ TTPGRLRLGN LLPLNAKAPF ELVNRLLRKK
DVQNVIDTVY RYCGQKESVI FCDQIMGMGF REAFKAGISF GKDDMLIPDT KWPIVNEVRD
QVKEFEQQYM DGLITQGEKY NKVVDAWSKC SDKVAGEMMA EISAVRYDDA GAEKEPNSVY
MMSHSGARGS PAQMKQLGGM RGLMAKPNGE IIETPIISNF KEGLTVLEYF NSTHGARKGL
ADTALKTANS GYLTRRLVDV AQDCIVRSHD CGTENAITAS AAVNDGEVVS PLSERVLGRV
AAEDILVPGT DEVVVARGEL IDERRADLID QANVALVRIR SPLTCEAEEG VCAMCYGRDL
ARGTLVNIGE AVGIIAAQSI GEPGTQLTMR TFHIGGIAQG GQQSFLEASQ EGRIEFRNPN
LLENANGEQI VMGRNMQLAI IDEAGQERAT HKLTYGAKVH VKDGQSVKRG TRLFEWDPYT
LPIIAEKGGV ARFVDLVSGI SVREDTDEAT GMTQKIVSDW RSTPKGGDLK PEIIIMDPDT
GNPMRNEAGN PISYPMSVEA ILSVEDGQTV RAGDVVARIP REGARTKDIT GGLPRVAELF
EARRPKDHAI IAENDGYVRF GKDYKNKRRI TIEPVDETLN SVEYMVPKGK HIPVQEGDFV
QKGDYIMDGN PAPHDILRIM GVEALANYMI DEVQEVYRLQ GVKINDKHIE VIVRQMLQKY
EILDSGETTL LKGEHVDKAE LDETNEKAIQ HGMRPAHAEP ILLGITKASL QTRSFISAAS
FQETTRVLTE ASVQGKRDKL VGLKENVIVG RLIPAGTGGA TSRVKKIAHD RDQTVIDARR
AEAESAAALA APTDEVIDLG PEDSGLVETV ESRKE