Gene RPB_0455 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_0455 
Symbol 
ID3909800 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp502023 
End bp502922 
Gene Length900 bp 
Protein Length299 aa 
Translation table11 
GC content65% 
IMG OID637882342 
ProductRNA polymerase factor sigma-32 
Protein accessionYP_484077 
Protein GI86747581 
COG category[K] Transcription 
COG ID[COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) 
TIGRFAM ID[TIGR02392] alternative sigma factor RpoH
[TIGR02937] RNA polymerase sigma factor, sigma-70 family 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.676672 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.248951 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCCGTG CAGCTACGCT ACCCGTTCTC AACGGTGAAT CCGGACTCGC TCGGTATCTC 
GCGGAAATCC GCAAGTTCCC GATGCTCGAG CCGCAACAGG AATACATGTT CGCCAAGCGC
TGGCGCGAGC ACGATGATCG CGACGCCGCG CATCACCTCG TCACCAGCCA TCTGCGGCTC
GTCGCCAAGA TCGCCATGGG CTATCGCGGC TACGGCCTGC CGATCTCCGA GGTCGTCTCG
GAAGGCAATG TCGGCCTGAT GCAGGCGGTG AAGCGGTTCG AGCCGGACAA AGGCTTCCGC
CTCGCCACCT ACGCGATGTG GTGGATCAAG GCGTCGATTC AAGAATACAT CCTGCGTTCG
TGGTCGCTCG TGAAGATGGG CACCACCGCG AACCAGAAGA AGCTGTTCTT CAATCTGCGC
AAGGCGAAGA GCAAGATCTC GGCGCTGGAC GAGGGTGATA TGCACCCCGA CCAGGTCAAG
CTGATCGCCA AGCGGCTCGG CGTCACCGAG CAGGACGTGA TCGACATGAA TCGCCGCCTC
GGTGGCGACG CGTCGCTCAA CGCCCCGATC CGCGACGACG GCGAGCCCGG CGAATGGCAG
GACTGGCTGG TCGACCAGTC GCCGAATCAG GAAGCCGTGA TGGCCGAGCA CGAGGAGCTC
GATCATCGCC GCGCCGCGCT GAACGGTGCG ATCGGCGTGC TCAACCCGCG CGAACGGCGG
ATCTTCGAGG CGCGCCGCCT CGCCGACGAG CCGATGACGC TGGAAGACCT CGCCGCCGAG
TTCGGCGTCT CGCGCGAGCG CGTCCGCCAG ATCGAGGTGC GTGCCTTCGA GAAGGTGCAG
AGCGCCGTCA AGGGCACCAT CGCGCGTCAG GAACAGGCGG CGCTCGAAGC CGCCCACTGA
 
Protein sequence
MARAATLPVL NGESGLARYL AEIRKFPMLE PQQEYMFAKR WREHDDRDAA HHLVTSHLRL 
VAKIAMGYRG YGLPISEVVS EGNVGLMQAV KRFEPDKGFR LATYAMWWIK ASIQEYILRS
WSLVKMGTTA NQKKLFFNLR KAKSKISALD EGDMHPDQVK LIAKRLGVTE QDVIDMNRRL
GGDASLNAPI RDDGEPGEWQ DWLVDQSPNQ EAVMAEHEEL DHRRAALNGA IGVLNPRERR
IFEARRLADE PMTLEDLAAE FGVSRERVRQ IEVRAFEKVQ SAVKGTIARQ EQAALEAAH