Gene RPD_0178 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_0178 
Symbol 
ID4020635 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp201922 
End bp203562 
Gene Length1641 bp 
Protein Length546 aa 
Translation table11 
GC content64% 
IMG OID637960356 
ProductRNA polymerase factor sigma-54 
Protein accessionYP_567319 
Protein GI91974660 
COG category[K] Transcription 
COG ID[COG1508] DNA-directed RNA polymerase specialized sigma subunit, sigma54 homolog 
TIGRFAM ID[TIGR02395] RNA polymerase sigma-54 factor 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.570253 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGCTCA CGCAACGCCT CGAATTTCGC CAATCGCAGT CGCTGGTGAT GACGCCGCAG 
CTGATGCAGG CGATCAAGCT GCTGCAACTG TCCAATTTGG ATCTCGCGGT GTTCGTCGAG
GACGAGCTCG AGAAGAACCC GCTGCTGGAC CGCGCCAGCG ACAACGCCGA ACCGCTGGTT
GCCGGCGAAG CGTCGATGGA CCGCGCCGAG AATGCGGGTG ACGAGTTCGG CGGCAGCGAG
GCCGGCGGCG AGGCATCGGA CTTCGCCGAC AACGCAGGCG GCGATTCCTT CGAGCCGGGC
AGCGAAGAAT GGATGCACCG CGATCTCGGC AGCCGCAGCG AGATCGAGCA GACACTCGAT
ACCGGCATGG AGAACGTGTT CCCGGAGGAG CCGGCCGAGG CTGCGGCCCG CGCCGCTCAG
GACGCGGCGC CGGCGTCATA TACCGAGTGG GGCGGCGGCG CCTCCAGCGA CGAGGGCTAC
AATCTCGAAG CCTTCGTTGC GGCCGAGACC TCGCTGGCCG ACCATCTCAG CGAGCAGCTC
GCAGTCGCGC TGTCCTCACC ATCGGAACGC ATGATCGGGC AATATCTGAT CGACCTCGTC
GACGATGCGG GCTACCTGCC AGCCGATCTC GGTGAGGCCG CCGAGCGTCT CGGAACGACC
CAGACCGAAG TCGAAGCCGT GGTCGGCGTG CTGCAAACCT TCGACCCGCC GGGCATCTGC
GCGCGTTCTT TGGCCGAATG CCTCGCCATT CAATTGCGTG AACTCGACCG GTTCGACCCG
GCGATGCAGG CGCTGATCGA GCACCTCGAT CTCTTGGCCA AGCGCGACGT CGTCAGCTTG
CGCAAGATTT GCGGCGTCGA TGACGAGGAC CTCGCCGACA TGATCGGCGA AATCCGTCAC
CTCGATCCGA AACCTGGCCT GAAGTTCAAC TCCTCTCGCG TGCAGACCGT CGTGCCCGAT
GTGTTCGTGC GCCCCGGCCC CGACGGAGGC TGGCTGGTCG AACTCAACAG CGACACGCTG
CCGAAGGTGC TGGTCAACCA GTCCTATTAC TCGGAGTTGT CGAAGACGAT CCGCAAGGAT
GGCGACAAAT CCTACTTCTC CGACTGCCTG CAGACCGCAA CCTGGCTGGT GCGCGCGCTC
GACCAGCGCG CTCGCACCAT TTTGAAGGTC GCGACCGAGA TCGTGCGCCA GCAGGACGGC
TTCTTCACCC ACGGCGTCGC GCATCTGCGA CCACTGAATC TGAAAGCGGT GGCCGACGCG
ATCCAGATGC ACGAGTCGAC GGTGTCGCGC GTGACCGCCA ACAAATATAT GGCGACCAAC
CGTGGCACGT TCGAACTGAA GTATTTCTTT ACCGCGTCAA TCGCCTCCGC CGATGGCGGC
GAGGCGCATT CCGCCGAAGC CGTTCGTCAT CACATCCGGC AATTGATCGA TGGCGAGGAT
CCGTCAGCAA TTCTGTCGGA CGACACGATC GTGGAGAGGC TGCGCGGAGC CGGCATCGAT
ATCGCACGTC GTACCGTCGC GAAATATCGC GAGGCGATGC GCATCCCCTC GTCGGTGCAA
CGGCGACGCG ACAAGCACAG CATGCTCGGC ACGGCCCTGA CGGCGCCAGC CGATCGGTCC
CGCGACACCG CTCCGGCTTG A
 
Protein sequence
MALTQRLEFR QSQSLVMTPQ LMQAIKLLQL SNLDLAVFVE DELEKNPLLD RASDNAEPLV 
AGEASMDRAE NAGDEFGGSE AGGEASDFAD NAGGDSFEPG SEEWMHRDLG SRSEIEQTLD
TGMENVFPEE PAEAAARAAQ DAAPASYTEW GGGASSDEGY NLEAFVAAET SLADHLSEQL
AVALSSPSER MIGQYLIDLV DDAGYLPADL GEAAERLGTT QTEVEAVVGV LQTFDPPGIC
ARSLAECLAI QLRELDRFDP AMQALIEHLD LLAKRDVVSL RKICGVDDED LADMIGEIRH
LDPKPGLKFN SSRVQTVVPD VFVRPGPDGG WLVELNSDTL PKVLVNQSYY SELSKTIRKD
GDKSYFSDCL QTATWLVRAL DQRARTILKV ATEIVRQQDG FFTHGVAHLR PLNLKAVADA
IQMHESTVSR VTANKYMATN RGTFELKYFF TASIASADGG EAHSAEAVRH HIRQLIDGED
PSAILSDDTI VERLRGAGID IARRTVAKYR EAMRIPSSVQ RRRDKHSMLG TALTAPADRS
RDTAPA