Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mext_3881 |
Symbol | rpoH2 |
ID | 5832292 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium extorquens PA1 |
Kingdom | Bacteria |
Replicon accession | NC_010172 |
Strand | + |
Start bp | 4308481 |
End bp | 4309353 |
Gene Length | 873 bp |
Protein Length | 290 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 641369671 |
Product | RNA polymerase factor sigma-32 |
Protein accession | YP_001641324 |
Protein GI | 163853281 |
COG category | [K] Transcription |
COG ID | [COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) |
TIGRFAM ID | [TIGR02392] alternative sigma factor RpoH [TIGR02937] RNA polymerase sigma factor, sigma-70 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.382692 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 0.0609598 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCAGAGA TCGCGGGGAT GCGGCGTCAG TTCATTCGAA CAGCGATGGA GGCGCCGTTC CTTGCGAGGG ATGAAGAGCG TGGTTTGGCG GTGGCCTGGA AGGAAGCGCG CGACGAGCGC GCCCTGCACC GTCTGATCTC CGCGCATATG CGCCTCGTGA TCGCGCTTGC CGGCCGGTTT CGCCATTACG GCCTGCCGAT GGCCGATCTC GTGCAAGAGG GCCATGTCGG CCTGATGGAG GCCGCCGCCC GGTTTGAGCC GGAGCGCGAA GTCCGCTTCT CCACCTACGC CACGTGGTGG ATCCGCGCCT CGATCCAGGA CTACATCCTG CGCAACTGGT CGATCGTGCG CGGCGGAACG AGTTCCGCGC AGAAGGCCCT GTTCTTCAAC CTGCGCCGCC TGCGCGCCCG GCTGATGCAA TCGACCGAGG AGCAGGTCGG CTCTGAGATC CACGGACGGA TCGCCACTGC GATCGGCGTC TCACGTGAGG ACGTAGCGCT CATGGATGCC CGCCTGTCGG GCCCCGACAT GTCGCTGAAC GCTCCGGTCG GTGAGGAGAG CGAGGCGTCC TCCGAGCGCA TGGACTTCCT GGTCGACAAC GCCGCCCTGC CCGATGAAAC GGTCTCGGCC CTGGTCGATG GCGAACGGCG GCTCATCTGG CTGCGGCAGG CCCTGACGGT GCTCTCCGAA CGCGAGCTCC GGATCCTGCG GGAACGGCGC CTCGCCGAGG ATCAGGCGAC CCTGGAAGCG CTGGGCCACC GCCTCGGCAT CTCCAAAGAG CGCGTCCGGC AGATCGAGAA CCGCGCCCTG GAGAAGCTGC GTCGGGCGCT CGCCGAGAAG TTCCCGCAGG CACCGAGCAG CGTCTACGCC TGA
|
Protein sequence | MAEIAGMRRQ FIRTAMEAPF LARDEERGLA VAWKEARDER ALHRLISAHM RLVIALAGRF RHYGLPMADL VQEGHVGLME AAARFEPERE VRFSTYATWW IRASIQDYIL RNWSIVRGGT SSAQKALFFN LRRLRARLMQ STEEQVGSEI HGRIATAIGV SREDVALMDA RLSGPDMSLN APVGEESEAS SERMDFLVDN AALPDETVSA LVDGERRLIW LRQALTVLSE RELRILRERR LAEDQATLEA LGHRLGISKE RVRQIENRAL EKLRRALAEK FPQAPSSVYA
|
| |