Gene Rsph17029_2254 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_2254 
SymbolrpoH2 
ID4896822 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009049 
Strand
Start bp2387324 
End bp2388202 
Gene Length879 bp 
Protein Length292 aa 
Translation table11 
GC content66% 
IMG OID640112848 
ProductRNA polymerase factor sigma-32 
Protein accessionYP_001044129 
Protein GI126463015 
COG category[K] Transcription 
COG ID[COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) 
TIGRFAM ID[TIGR02392] alternative sigma factor RpoH
[TIGR02937] RNA polymerase sigma factor, sigma-70 family 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.142039 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.411948 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCACTGG ACGGATATAC CGATCAAACG ATGTCGCGCC AAGCGATGAG GGCGGAGCTT 
CTGGACGCAG AGACGGAGCT GCGCCTCGCC TATGCCTGGC GGGACCAGCG CGACGAGCGG
GCGCTCCACC GTCTGATCAC CGCCTACATG CGCCTTGCGA TCTCGATGGC CTCGAAATAC
CGCCGCTACG GCGCGCCGAT GAACGACCTG ATTCAGGAGG CGTCGCTCGG GCTGATGAAG
GCCGCCGACA AGTTCGACCC GGACCGCGGC GTGCGCTTCT CGACCTATGC CGTCTGGTGG
ATCAAGGCCT CGATCCAGGA TTACGTCATG CGGAACTGGT CTCTGGTGCG GACCGGCTCC
ACCTCCAGCC AGAAGGCGCT GTTCTTCAAC ATGCGGCGGG TGCAGGCCCG GCTCGAGCGC
GAGGCCTCGC AGCGGGGCGA GGCGCTCGAT GCGCATCAGC TGCGCGAGAT GGTGGCTCAT
GACGTGGGCG TGCCGCTCTC GGATGTCGAG ATGATGGAGG GGCGGCTCTC GGGCTCGGAC
TTCTCGCTCA ACGCCACCCA GTCGGCCGAT GACGAGGGGC GGGAGTGGAT CGACGCGCTC
GAGGACGATT CGGCACAGGC GGCCGAGGTG GTGGAGGGCT CGCTCGATGC CGCGCGGCTG
CGGGGTTGGC TGGTGTCGGC CATGCAGCAG CTGAACGCGC GCGAGCGGTT CATCGTGACC
GAACGCAAGC TGCGCGACGT GCCCCGGACG CTCGAAAGCC TCGGCGAAGA ACTGAAGCTT
TCCAAGGAGC GGGTGCGGCA GCTTGAAGCA GCGGCCTTTG CGAAGATGCG GCGGAGCCTT
GAAGCCCAGT CGCGGGAGGT GCATCACTTC CTCCTATGA
 
Protein sequence
MALDGYTDQT MSRQAMRAEL LDAETELRLA YAWRDQRDER ALHRLITAYM RLAISMASKY 
RRYGAPMNDL IQEASLGLMK AADKFDPDRG VRFSTYAVWW IKASIQDYVM RNWSLVRTGS
TSSQKALFFN MRRVQARLER EASQRGEALD AHQLREMVAH DVGVPLSDVE MMEGRLSGSD
FSLNATQSAD DEGREWIDAL EDDSAQAAEV VEGSLDAARL RGWLVSAMQQ LNARERFIVT
ERKLRDVPRT LESLGEELKL SKERVRQLEA AAFAKMRRSL EAQSREVHHF LL