Gene Mext_3881 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_3881 
SymbolrpoH2 
ID5832292 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp4308481 
End bp4309353 
Gene Length873 bp 
Protein Length290 aa 
Translation table11 
GC content67% 
IMG OID641369671 
ProductRNA polymerase factor sigma-32 
Protein accessionYP_001641324 
Protein GI163853281 
COG category[K] Transcription 
COG ID[COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) 
TIGRFAM ID[TIGR02392] alternative sigma factor RpoH
[TIGR02937] RNA polymerase sigma factor, sigma-70 family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.382692 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value0.0609598 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAGAGA TCGCGGGGAT GCGGCGTCAG TTCATTCGAA CAGCGATGGA GGCGCCGTTC 
CTTGCGAGGG ATGAAGAGCG TGGTTTGGCG GTGGCCTGGA AGGAAGCGCG CGACGAGCGC
GCCCTGCACC GTCTGATCTC CGCGCATATG CGCCTCGTGA TCGCGCTTGC CGGCCGGTTT
CGCCATTACG GCCTGCCGAT GGCCGATCTC GTGCAAGAGG GCCATGTCGG CCTGATGGAG
GCCGCCGCCC GGTTTGAGCC GGAGCGCGAA GTCCGCTTCT CCACCTACGC CACGTGGTGG
ATCCGCGCCT CGATCCAGGA CTACATCCTG CGCAACTGGT CGATCGTGCG CGGCGGAACG
AGTTCCGCGC AGAAGGCCCT GTTCTTCAAC CTGCGCCGCC TGCGCGCCCG GCTGATGCAA
TCGACCGAGG AGCAGGTCGG CTCTGAGATC CACGGACGGA TCGCCACTGC GATCGGCGTC
TCACGTGAGG ACGTAGCGCT CATGGATGCC CGCCTGTCGG GCCCCGACAT GTCGCTGAAC
GCTCCGGTCG GTGAGGAGAG CGAGGCGTCC TCCGAGCGCA TGGACTTCCT GGTCGACAAC
GCCGCCCTGC CCGATGAAAC GGTCTCGGCC CTGGTCGATG GCGAACGGCG GCTCATCTGG
CTGCGGCAGG CCCTGACGGT GCTCTCCGAA CGCGAGCTCC GGATCCTGCG GGAACGGCGC
CTCGCCGAGG ATCAGGCGAC CCTGGAAGCG CTGGGCCACC GCCTCGGCAT CTCCAAAGAG
CGCGTCCGGC AGATCGAGAA CCGCGCCCTG GAGAAGCTGC GTCGGGCGCT CGCCGAGAAG
TTCCCGCAGG CACCGAGCAG CGTCTACGCC TGA
 
Protein sequence
MAEIAGMRRQ FIRTAMEAPF LARDEERGLA VAWKEARDER ALHRLISAHM RLVIALAGRF 
RHYGLPMADL VQEGHVGLME AAARFEPERE VRFSTYATWW IRASIQDYIL RNWSIVRGGT
SSAQKALFFN LRRLRARLMQ STEEQVGSEI HGRIATAIGV SREDVALMDA RLSGPDMSLN
APVGEESEAS SERMDFLVDN AALPDETVSA LVDGERRLIW LRQALTVLSE RELRILRERR
LAEDQATLEA LGHRLGISKE RVRQIENRAL EKLRRALAEK FPQAPSSVYA