Gene TM1040_0456 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_0456 
SymbolrpoH2 
ID4078338 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp473334 
End bp474212 
Gene Length879 bp 
Protein Length292 aa 
Translation table11 
GC content61% 
IMG OID638005752 
ProductRNA polymerase factor sigma-32 
Protein accessionYP_612451 
Protein GI99080297 
COG category[K] Transcription 
COG ID[COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) 
TIGRFAM ID[TIGR02392] alternative sigma factor RpoH
[TIGR02937] RNA polymerase sigma factor, sigma-70 family 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.369824 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCACTGG ACGAGACCAC CAAGAACCCC CTCCCGAAGC AGGCCATGAA AGCCGAGCTT 
CTGGATGCAG AGACGGAATT GGCGTTGGCC TACGCATGGC GCGATGAGAA GGACGTGCAG
GCGCTCCATC GTCTGATCAC AGCCTACATG CGGCTCGCAG TGTCGATGGC GTCAAAATTC
CGCCGCTATG GCGCCCCGAT GAATGATCTC ATCCAAGAGG CAGGGCTTGG CCTGATGAAG
GCGGCAGAGA AATTTGATCC CGACCGCGGC GTGCGGTTCT CGACCTACGC GGTCTGGTGG
ATCAAGGCGT CCATTCAGGA CTACGTGATG CGCAACTGGT CGATGGTGCG CACTGGCTCG
ACCTCCTCGC AGAAGTCCCT GTTCTTCAAC ATGCGCCGTG TGCAGGCGCA ACTCGAGCGC
GAGGCAGCCG GCGAAGGCGT GGAACTCGAC CGTCACAAGC TGATGCAGAT GGTTGCAGAA
GAGATCGGCG TGCCGCTGCG CGATGTCGAG ATGATGGACG GCCGCCTCGC GGGTGCTGAT
TTCTCGCTGA ACGCAGTGCA GTCGGCAGAC GAGGACGGGC GCGAGTGGAT CGACGCCTTG
GAAGATGACA GCGAACAGGC TGCGGACCGC GTGGAGGCCG ATCATGATCG TCGCCAGCTG
CGCGAGTGGC TGCTCTCGGC GCTGAATGGT CTCAATGAGC GCGAGCGCTT TATCGTGCGC
GAGCGCAAAC TGAGAGAAGA GGCGCGCACT CTGGAGAGCC TCGGCAATGA ATTGGGACTG
TCAAAAGAGC GCGTGCGTCA GCTGGAAGCC GCCGCCTTCT CAAAGATGAG AAAGAGCCTT
GAAGGTCAGT CCCGTGAGGT GCAGAACTTC CTTGCATGA
 
Protein sequence
MALDETTKNP LPKQAMKAEL LDAETELALA YAWRDEKDVQ ALHRLITAYM RLAVSMASKF 
RRYGAPMNDL IQEAGLGLMK AAEKFDPDRG VRFSTYAVWW IKASIQDYVM RNWSMVRTGS
TSSQKSLFFN MRRVQAQLER EAAGEGVELD RHKLMQMVAE EIGVPLRDVE MMDGRLAGAD
FSLNAVQSAD EDGREWIDAL EDDSEQAADR VEADHDRRQL REWLLSALNG LNERERFIVR
ERKLREEART LESLGNELGL SKERVRQLEA AAFSKMRKSL EGQSREVQNF LA