Gene RPD_0360 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_0360 
Symbol 
ID4020826 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp426165 
End bp427064 
Gene Length900 bp 
Protein Length299 aa 
Translation table11 
GC content65% 
IMG OID637960545 
ProductRNA polymerase factor sigma-32 
Protein accessionYP_567499 
Protein GI91974840 
COG category[K] Transcription 
COG ID[COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) 
TIGRFAM ID[TIGR02392] alternative sigma factor RpoH
[TIGR02937] RNA polymerase sigma factor, sigma-70 family 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000866831 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGGCCCGTG CAGCTACGCT ACCCGTTCTC AACGGTGAAT CCGGACTCGC TCGGTACCTC 
GCGGAAATCC GCAAGTTCCC GATGCTCGAG CCCCAGCAGG AATACATGTT CGCCAAGCGC
TGGCGCGAAC ACGACGATCG CGACGCCGCG CATCATCTCG TCACCAGCCA TCTGCGGCTC
GTGGCCAAGA TCGCCATGGG CTATCGCGGC TATGGTCTGC CGATTTCCGA AGTTGTCTCG
GAAGGCAATG TCGGTCTGAT GCAGGCGGTG AAGCGGTTCG AGCCCGACAA GGGCTTCCGC
CTCGCCACCT ACGCGATGTG GTGGATCAAG GCGTCGATTC AAGAGTACAT TCTGCGTTCG
TGGTCGCTCG TGAAGATGGG CACCACCGCG AACCAGAAGA AGCTGTTCTT CAATCTGCGC
AAGGCCAAGA GCAAGATCTC GGCGCTGGAC GAGGGTGATA TGCACCCCGA CCAGGTCAAG
CTGATCGCCA CCCGCCTCGG CGTCACCGAG CAGGACGTGA TCGACATGAA CCGCCGCCTC
GGCGGCGACG CGTCGCTCAA CGCTCCGATC CGGGACGACG GCGAGCCCGG CGAATGGCAG
GACTGGCTGG TCGACCAGTC GCCGAGCCAG GAAGCCGTGA TGGCCGAGCA CGAAGAGCTC
GATCATCGCC GCGCCGCGCT GAACGGCGCG ATCGGCGTGC TCAACCCGCG CGAACGGCGC
ATCTTCGAGG CGCGCCGCCT CGCCGACGAG CCGATGACAC TGGAAGATCT CGCCGCCGAG
TTCGGCGTCT CGCGCGAGCG GGTGCGTCAG ATCGAGGTGC GCGCCTTCGA AAAGGTGCAG
AGCGCGGTGA AGGGCACCAT CGCGCGTCAG GAGCAGGCGG CTCTCGAGGC CGCCCACTGA
 
Protein sequence
MARAATLPVL NGESGLARYL AEIRKFPMLE PQQEYMFAKR WREHDDRDAA HHLVTSHLRL 
VAKIAMGYRG YGLPISEVVS EGNVGLMQAV KRFEPDKGFR LATYAMWWIK ASIQEYILRS
WSLVKMGTTA NQKKLFFNLR KAKSKISALD EGDMHPDQVK LIATRLGVTE QDVIDMNRRL
GGDASLNAPI RDDGEPGEWQ DWLVDQSPSQ EAVMAEHEEL DHRRAALNGA IGVLNPRERR
IFEARRLADE PMTLEDLAAE FGVSRERVRQ IEVRAFEKVQ SAVKGTIARQ EQAALEAAH