Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_4142 |
Symbol | rpoH2 |
ID | 8014936 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | - |
Start bp | 4227594 |
End bp | 4228457 |
Gene Length | 864 bp |
Protein Length | 287 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 644826712 |
Product | RNA polymerase factor sigma-32 |
Protein accession | YP_002977922 |
Protein GI | 241206826 |
COG category | [K] Transcription |
COG ID | [COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) |
TIGRFAM ID | [TIGR02392] alternative sigma factor RpoH [TIGR02937] RNA polymerase sigma factor, sigma-70 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.000322567 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 0.133894 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGAACA TGTCTGCAGA TCGGCGCATG ATCAAAATCG CGATGGCCGC GCCTTATCTC GCCCGTCAGG AAGAGCACGA TCTCGCCACC CGCTGGAAGG ATCATGACGA CCGCGGCGCG CGCAACCAGA TTGCCATGGC CCATATGCGC CTCGTCATAT CCATGGCCGG CAAGTTCCGC AATTTCGGTC TGCCGATGAG CGATCTGGTC CAGGAGGGCT ATGTCGGTCT GCTCGAGGCC GCCGCCCGCT TCGAGCCGGA ACGCGACGTG CGCTTTTCCA CCTATGCAAG TTGGTGGATC AGGGCATCTA TCCAGGATTA TATCCTGCGC AACTGGTCGA TCGTGCGCGG CGGTACGAGT TCGGCGCAAA AAGCGCTGTT CTTCAATCTG CGCCGTCTGC GCGCCAAGCT CGCCAAGGGA GACACGCAGC TGACGCTGCA ATCCATTCAC CAGGAAATCG CCGCGGCCCT CGGCGTCAGC CTCTCGGATG TACAGACGAT GGATGCTAGG CTTTCCGGCA ACGACGCCTC GCTGCAGGCG CCTTCGGTCT CCGGCGATGC CGAGAGCGCG GAAAAGATGG ACTTCCTCGT CAGCGACGAT CCCCTGCCGG ACGAGCAGGT GTCCAACATG ATCGACGGCG AGCGCCGCCG CGTCTGGCTC GCCTCGGCGC TGAAACATCT CAACGAACGC GAGATGAAGA TCATCAGCGC CCGGCGTCTG GCGGAAGACG GTGCCACGCT CGAAGAGCTC GGTGCCGATC TCGGTATTTC CAAGGAGCGC GTGCGTCAGA TCGAAAGCCG AGCGATGGAG AAGCTTCGCA GCGCGCTCGT CAGCGCCGAT CCGCATATGG CGGCCTACGC CTGA
|
Protein sequence | MKNMSADRRM IKIAMAAPYL ARQEEHDLAT RWKDHDDRGA RNQIAMAHMR LVISMAGKFR NFGLPMSDLV QEGYVGLLEA AARFEPERDV RFSTYASWWI RASIQDYILR NWSIVRGGTS SAQKALFFNL RRLRAKLAKG DTQLTLQSIH QEIAAALGVS LSDVQTMDAR LSGNDASLQA PSVSGDAESA EKMDFLVSDD PLPDEQVSNM IDGERRRVWL ASALKHLNER EMKIISARRL AEDGATLEEL GADLGISKER VRQIESRAME KLRSALVSAD PHMAAYA
|
| |