Gene Rpal_3686 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_3686 
Symbol 
ID6411362 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp3938638 
End bp3942840 
Gene Length4203 bp 
Protein Length1400 aa 
Translation table11 
GC content63% 
IMG OID642713566 
ProductDNA-directed RNA polymerase subunit beta' 
Protein accessionYP_001992661 
Protein GI192292056 
COG category[K] Transcription 
COG ID[COG0086] DNA-directed RNA polymerase, beta' subunit/160 kD subunit 
TIGRFAM ID[TIGR02386] DNA-directed RNA polymerase, beta' subunit, predominant form 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.152394 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACCAGG AAATTATGAA TCTGTTCAAT CCGACGACGC CGGCTCAGGT CTTCGACCAG 
ATCCGGATCT CGATCGCGTC GCCGGAGAAG ATTCTGTCGT GGTCGTACGG CGAGATCAAG
AAGCCGGAAA CGATCAACTA CCGCACCTTC AAGCCCGAGC GCGACGGCCT GTTCTGCGCC
CGCATCTTCG GGCCGATCAA GGACTACGAG TGCTTGTGCG GCAAGTACAA GCGGATGAAG
TACAAGGGCA TCATCTGCGA GAAGTGCTCG GTCGAAGTGA CTCTGTCGCG CGTCCGTCGC
GAGCGCATGG GCCACATCGA GCTGGCCGCG CCGGTCGCCC ACATCTGGTT CCTGAAGTCC
TTGCCGTCGC GTATCGGGCA GCTGCTCGAT ATGACGCTGA AGGACCTCGA GCGCATCCTG
TACTTCGAAT ACTACGTGGT GCTGGAGCCG GGCCTCACCG ACCTCAAGGA GCGTCAGCTC
CTGTCGGAGG AGGAGTACCT GCGCGCCCAG GATCAGTACG GCCAGGATTC GTTCACCGCC
ATGATCGGCG CCGAAGCGAT CCGTGAGCTG CTGAAGGGGC TCGAACTCGA AAAGATCGAC
GCGCAGCTGC GCGCCGAGAT GGCCGAGACC GACTCTGACA TCAAGCACAA GAAGCTCGCC
AAGCGCCTGA AGATCGTCGA AGCGTTCCGC TATTCCGGCA ACAAGCCGGA GTGGATGATC
CTCACGGTCG TGCCGGTGAT CCCGCCGGAT CTGCGCCCGC TGGTGCCGCT CGACGGCGGC
CGGTTCGCGA CCTCGGACCT CAACGACCTG TATCGCCGCG TCATCAACCG TAACAACCGC
TTGAAGCGGC TGATGGAGCT GCGCGCGCCG GACATCATCA TCCGCAACGA AAAGCGCATG
CTGCAGGAGG CTGTGGACGC GCTGTTCGAC AACGGTCGCC GCGGCCGCGT CATCACCGGC
GCCAACAAGC GTCCGCTGAA GTCGCTCGCC GATATGCTGA AGGGCAAGCA GGGCCGGTTC
CGTCAGAACC TGCTCGGCAA GCGCGTCGAC TATTCGGGCC GTTCGGTGAT CGTGGTCGGT
CCCGAGCTCA AGCTGCATCA GTGCGGCCTG CCGAAGAAGA TGGCGCTCGA ACTGTTCAAG
CCGTTCATCT ATTCGCGGCT CGACGCCAAG GGTCTGTCGA CCACCGTCAA GCAGGCCAAG
AAGCTGGTCG AGAAGGAGCG GCCGGAGGTT TGGGACATCC TCGACGAGGT GATCCGCGAA
CATCCGGTGC TGCTAAACCG CGCCCCGACG CTGCATCGTC TCGGCATTCA GGCGTTCGAG
CCGGTGCTGA TCGAAGGCAA GGCGATCCAG CTGCATCCGC TGGTGTGCTC GGCGTTCAAC
GCCGACTTCG ACGGCGACCA GATGGCCGTG CACGTTCCGC TGTCGCTCGA AGCGCAGCTG
GAAGCGCGCG TCCTGATGAT GTCGACCAAC AACATCCTGC ATCCGGCGAA CGGCCAGCCG
ATCATCGTGC CGTCGCAGGA CATCGTGCTC GGCCTGTACT ACCTGTCGAT CATGCGGGAA
GGCCTGCCGG GCGAGGGCAA AGTGTTTGCC GACCTCGCCG AGCTCGAGCA CGCGCTGTAC
TCCAAGGTCA TCCACCTCCA CACCAAGATC AAGTATCGCT GGCATTGGGT GAACGAGGAA
GGCGAGAACA CCGTCCGTCT GCTGGAGACC ACCGCCGGCC GCATCCTGCT TGGGCAGGTG
CTGCCGAAGT CGCCGAAGCT GCCGTTCGAC GTCATCAACA AGCTGATGAC CAAGCGCGAG
ATCTCCGGCG TCATCGACCA AGTCTATCGC CACTGCGGTC AGAAGGAGAC GGTGATCTTC
TGCGACCGGA TCATGGCGCT CGGCTTCTTC AACGCGTTCA AGGCCGGCAT CTCGTTCGGT
AAGGACGACA TGGTCGTGCC GGGCTCGAAG TGGAAGATCG TCGACTCGAC CCGTACGCTG
GCGAAGGACT TCGAGCAGCA GTACAACGAC GGCCTCATCA CCCACGGCGA GAAGTACAAC
AAGGTGGTCG ACGCCTGGTC GAAGGCCACC GAAGAAATCG CCAAGGAGAT GATGAAGGAG
ATCTCCGCGG TTCGGAAGGC GCCTGACGGC TCCGAACAGC AGGTCAACTC GATCTACATG
ATGGCCCACT CCGGTGCGCG TGGTTCGCCC GCGCAGATGC GTCAGCTCGC CGGTATGCGC
GGCCTGATGG CCAAGCCGTC GGGTGAAATC ATCGAGACGC CGATCATTTC CAACTTCAAG
GAAGGTCTGT CGGTTCTCGA GTACTTCAAC TCGACCCACG GCGCCCGTAA GGGTCTGGCC
GACACCGCGC TCAAGACCGC GAACTCGGGT TACCTGACCC GTCGTCTGGT CGACGTGGCG
CAGGACTGCA TCATCACGCA GGCTGACTGC GGCACCTCGC TCGGCATCAA GATGCGGGCG
ATCGTCGACG CCGGCACCGT GGTCGCCTCG CTGGGCAGCC GTATTCTCGG CCGCACCGCG
GGCGAGGACG TGCGCGACCC GGCCACCAAC GAGATCATCG TCAAGCGCGG TGATCTGATG
GAGGAGCGGG ACGTCGAGGC GATCCACCAG GCCGGCGTGC AGGAAGTGAA GATCCGCTCG
GCGCTGACCT GCGAGCTGGT CAACGGCATC TGCGGCAAGT GCTACGGGCG CGATCTTGCC
CGCGGTACTC CGGTCAACCA CGGCGAAGCG GTCGGCGTCA TCGCGGCGCA GTCGATCGGT
GAGCCGGGCA CCCAGCTGAC GATGCGTACC TTCCACATCG GCGGTGCGGC GCAGATCAAC
GAGCAGTCGG TGATCGAGTC GAACTTCGAC GGCAAGATCG TCATCAAGAA CCGCGCCATC
GCCCGTAACG GCGAAGGCCA CAATGTTGCG ATGGTCCGCA ACATGGTGAT CGCGATCGTC
GATCCGGACG GCACCGAGCG TGCGACCCAT CGCATCCAGT ACGGCGCGCG CGTGCACGTC
GACGAGGGCG ATATGGTCAA GCGTGGCCAG CGTATCGCCG AGTGGGATCC GTACACTCGT
CCGATCCTCA CCGAGGTCGA GGGTACCATC GACTTCGAAG ATCTGATCGA GGATCAGTCG
ATCTCCGAAA CGCTCGACGA GTCGACCGGT ATTGCCAAGC GTATCGTCAT CGATTGGCGC
TCGACCCGCG GCGGCGCGGA CCTGCGTCCG GCGATCGTGA TCAAGGGCAA GGATGGCAAG
GTGCTGAAGC TGGCGCGTGG CGGCGACGCC CGCTACATGC TCTCGGTCGA CGCCATTCTG
TCGGTCGACG TCGGCGCCCA GGTCAAGCCC GGCGACATCC TCGCGCGTAT CTCGACCGAG
AGCGCCAAGA CCCGCGACAT CACCGGCGGT CTGCCGCGGG TGGCGGAGCT GTTCGAGGCG
CGGCGGCCGA AGGATGCGGC GATCATCGCC GAGATCGCCG GCACTATCCG GTTCGGTCGC
GACTACAAGA ACAAGCGCCG GCTCTCGATC GAGCCGCTCG ACAAGAACGA GGAAGCGCGC
GAGTACCTGA TCCCGAAGGG CAAGCACATC CACTTGCAGG ACGGCGACGT CGTCGAAAAG
GGCGACTTCA TCGTCGAGGG CAATCCGGCG CCGCACGACA TCCTGGCGAT CAAGGGCATC
GAGGAACTCG CGGCCTATCT CGTCAACGAA ATCCAGGAGG TCTATCGACT CCAGGGCGTG
TTGATCAACG ACAAGCACAT CGAGGTGATC GTTCGCCAGA TGCTGCAGAA GATCGAGATC
ACCGACCAGG GCGATACCGA CATGATCTCG GGCGAGCAGG TCGACAAGAT CGAGTTCAAC
GCGCTCAACG CCAAGGCGGT CGAGGAGGGC AAGAAGCCGG CAACCGGCAA TCCTGTGCTG
CTCGGCATCA CCAAGGCCAG CTTGCAGACC CGCTCGTTCT TCTCGGCGGC GTCGTTCCAG
GAGACCACCC GGGTGCTCAC CGAAGCCGCG GTCAACGGCA AGGTGGATCC GCTGGAAGGC
CTCAAGGAAA ACGTCATCGT CGGCCGGCTG ATCCCGGCGG GCACCGGCGC CTCGATGGCC
AAGATCCGCG AAGTGGCGGT GAAGCGCGAC CGGCTGATCC TCGACGAGCG CGAGAAGCAG
GCGGCGATCG TTCCGGCCGC TGCGCCGGAA GCCGAACCGC TGTCGCTGCC GCCGGCAGAG
TAA
 
Protein sequence
MNQEIMNLFN PTTPAQVFDQ IRISIASPEK ILSWSYGEIK KPETINYRTF KPERDGLFCA 
RIFGPIKDYE CLCGKYKRMK YKGIICEKCS VEVTLSRVRR ERMGHIELAA PVAHIWFLKS
LPSRIGQLLD MTLKDLERIL YFEYYVVLEP GLTDLKERQL LSEEEYLRAQ DQYGQDSFTA
MIGAEAIREL LKGLELEKID AQLRAEMAET DSDIKHKKLA KRLKIVEAFR YSGNKPEWMI
LTVVPVIPPD LRPLVPLDGG RFATSDLNDL YRRVINRNNR LKRLMELRAP DIIIRNEKRM
LQEAVDALFD NGRRGRVITG ANKRPLKSLA DMLKGKQGRF RQNLLGKRVD YSGRSVIVVG
PELKLHQCGL PKKMALELFK PFIYSRLDAK GLSTTVKQAK KLVEKERPEV WDILDEVIRE
HPVLLNRAPT LHRLGIQAFE PVLIEGKAIQ LHPLVCSAFN ADFDGDQMAV HVPLSLEAQL
EARVLMMSTN NILHPANGQP IIVPSQDIVL GLYYLSIMRE GLPGEGKVFA DLAELEHALY
SKVIHLHTKI KYRWHWVNEE GENTVRLLET TAGRILLGQV LPKSPKLPFD VINKLMTKRE
ISGVIDQVYR HCGQKETVIF CDRIMALGFF NAFKAGISFG KDDMVVPGSK WKIVDSTRTL
AKDFEQQYND GLITHGEKYN KVVDAWSKAT EEIAKEMMKE ISAVRKAPDG SEQQVNSIYM
MAHSGARGSP AQMRQLAGMR GLMAKPSGEI IETPIISNFK EGLSVLEYFN STHGARKGLA
DTALKTANSG YLTRRLVDVA QDCIITQADC GTSLGIKMRA IVDAGTVVAS LGSRILGRTA
GEDVRDPATN EIIVKRGDLM EERDVEAIHQ AGVQEVKIRS ALTCELVNGI CGKCYGRDLA
RGTPVNHGEA VGVIAAQSIG EPGTQLTMRT FHIGGAAQIN EQSVIESNFD GKIVIKNRAI
ARNGEGHNVA MVRNMVIAIV DPDGTERATH RIQYGARVHV DEGDMVKRGQ RIAEWDPYTR
PILTEVEGTI DFEDLIEDQS ISETLDESTG IAKRIVIDWR STRGGADLRP AIVIKGKDGK
VLKLARGGDA RYMLSVDAIL SVDVGAQVKP GDILARISTE SAKTRDITGG LPRVAELFEA
RRPKDAAIIA EIAGTIRFGR DYKNKRRLSI EPLDKNEEAR EYLIPKGKHI HLQDGDVVEK
GDFIVEGNPA PHDILAIKGI EELAAYLVNE IQEVYRLQGV LINDKHIEVI VRQMLQKIEI
TDQGDTDMIS GEQVDKIEFN ALNAKAVEEG KKPATGNPVL LGITKASLQT RSFFSAASFQ
ETTRVLTEAA VNGKVDPLEG LKENVIVGRL IPAGTGASMA KIREVAVKRD RLILDEREKQ
AAIVPAAAPE AEPLSLPPAE