Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | M446_4806 |
Symbol | rpoH2 |
ID | 6135725 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium sp. 4-46 |
Kingdom | Bacteria |
Replicon accession | NC_010511 |
Strand | + |
Start bp | 5282744 |
End bp | 5283625 |
Gene Length | 882 bp |
Protein Length | 293 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641644943 |
Product | RNA polymerase factor sigma-32 |
Protein accession | YP_001771570 |
Protein GI | 170742915 |
COG category | [K] Transcription |
COG ID | [COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) |
TIGRFAM ID | [TIGR02392] alternative sigma factor RpoH [TIGR02937] RNA polymerase sigma factor, sigma-70 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.0484047 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.000291044 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGCGGAAA TCGCAGGAAT TCGTCGGCAG TTCGTCCGCA TGGCGATGGA CGCGCCCTTC CTCGAACGCG AGGAGGAGCG CGGGCTCGCG GTGCGGTGGA AGGACAAGCG CGACGAGCAG GCCCTGCACC GCCTGATCTC CGCCCATATG CGCCTCGTCA TCGCGCTGGC GGGCCGCTTC CGCCATTACG GCCTGCCCAT GGCCGACCTC GTCCAGGAGG GCCATGTCGG GCTGATGGAG GCGGCCGCCC GCTTCGAGCC GGAGCGGGAC GTCCGCTTCT CCACCTACGC GACGTGGTGG ATCCGGGCGT CGATCCAGGA CTACATCCTG CGCAACTGGT CGATCGTGCG GGGCGGCACC TCCTCGGCGC AGAAGGCCCT GTTCTTCAAC CTGCGGCGCC TGCGCGCCCG CCTGATGCAA TCGACCGACG AGCGGGTCGG CGACGAGATC CACCGGCGCA TCGCCACGGC GATCGGCGTC TCGCGGGAGG ACGTGGCGCT GATGGATGCG CGCCTGTCGG GCCCCGACAT GTCGCTCAAC GCCCCGATCA CCGACGAGGG CGACGCGTCG GCGGAGCGGG TGGATTTCCT GGTCGACACC TCGCCCCTGC CCGACGAGGC GGTCTCGGAC GCGGTGGACG GCGAGCGGCG CCTCACCTGG CTCAGGCAGG CCCTCACCGT CCTGTCGGAG CGCGAGCTGC GCATCCTGCA CGAGCGGCGG CTGGCCGAGG ATCAGGCGAC CCTGGAGGCC CTGGGCCACC GGCTCGGCAT CTCGAAGGAG CGCGTGCGCC AGATCGAGAA CCGCGCCCTG GAGAAGCTCC GGCGCGCGCT GGCCGAGCGC TTCCCGCAGC AGCCGGGCGG CGGCATGAGC GCGATCATCT GA
|
Protein sequence | MAEIAGIRRQ FVRMAMDAPF LEREEERGLA VRWKDKRDEQ ALHRLISAHM RLVIALAGRF RHYGLPMADL VQEGHVGLME AAARFEPERD VRFSTYATWW IRASIQDYIL RNWSIVRGGT SSAQKALFFN LRRLRARLMQ STDERVGDEI HRRIATAIGV SREDVALMDA RLSGPDMSLN APITDEGDAS AERVDFLVDT SPLPDEAVSD AVDGERRLTW LRQALTVLSE RELRILHERR LAEDQATLEA LGHRLGISKE RVRQIENRAL EKLRRALAER FPQQPGGGMS AII
|
| |