Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_0456 |
Symbol | rpoH2 |
ID | 4078338 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008044 |
Strand | + |
Start bp | 473334 |
End bp | 474212 |
Gene Length | 879 bp |
Protein Length | 292 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 638005752 |
Product | RNA polymerase factor sigma-32 |
Protein accession | YP_612451 |
Protein GI | 99080297 |
COG category | [K] Transcription |
COG ID | [COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) |
TIGRFAM ID | [TIGR02392] alternative sigma factor RpoH [TIGR02937] RNA polymerase sigma factor, sigma-70 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.369824 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCACTGG ACGAGACCAC CAAGAACCCC CTCCCGAAGC AGGCCATGAA AGCCGAGCTT CTGGATGCAG AGACGGAATT GGCGTTGGCC TACGCATGGC GCGATGAGAA GGACGTGCAG GCGCTCCATC GTCTGATCAC AGCCTACATG CGGCTCGCAG TGTCGATGGC GTCAAAATTC CGCCGCTATG GCGCCCCGAT GAATGATCTC ATCCAAGAGG CAGGGCTTGG CCTGATGAAG GCGGCAGAGA AATTTGATCC CGACCGCGGC GTGCGGTTCT CGACCTACGC GGTCTGGTGG ATCAAGGCGT CCATTCAGGA CTACGTGATG CGCAACTGGT CGATGGTGCG CACTGGCTCG ACCTCCTCGC AGAAGTCCCT GTTCTTCAAC ATGCGCCGTG TGCAGGCGCA ACTCGAGCGC GAGGCAGCCG GCGAAGGCGT GGAACTCGAC CGTCACAAGC TGATGCAGAT GGTTGCAGAA GAGATCGGCG TGCCGCTGCG CGATGTCGAG ATGATGGACG GCCGCCTCGC GGGTGCTGAT TTCTCGCTGA ACGCAGTGCA GTCGGCAGAC GAGGACGGGC GCGAGTGGAT CGACGCCTTG GAAGATGACA GCGAACAGGC TGCGGACCGC GTGGAGGCCG ATCATGATCG TCGCCAGCTG CGCGAGTGGC TGCTCTCGGC GCTGAATGGT CTCAATGAGC GCGAGCGCTT TATCGTGCGC GAGCGCAAAC TGAGAGAAGA GGCGCGCACT CTGGAGAGCC TCGGCAATGA ATTGGGACTG TCAAAAGAGC GCGTGCGTCA GCTGGAAGCC GCCGCCTTCT CAAAGATGAG AAAGAGCCTT GAAGGTCAGT CCCGTGAGGT GCAGAACTTC CTTGCATGA
|
Protein sequence | MALDETTKNP LPKQAMKAEL LDAETELALA YAWRDEKDVQ ALHRLITAYM RLAVSMASKF RRYGAPMNDL IQEAGLGLMK AAEKFDPDRG VRFSTYAVWW IKASIQDYVM RNWSMVRTGS TSSQKSLFFN MRRVQAQLER EAAGEGVELD RHKLMQMVAE EIGVPLRDVE MMDGRLAGAD FSLNAVQSAD EDGREWIDAL EDDSEQAADR VEADHDRRQL REWLLSALNG LNERERFIVR ERKLREEART LESLGNELGL SKERVRQLEA AAFSKMRKSL EGQSREVQNF LA
|
| |