Gene M446_4806 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_4806 
SymbolrpoH2 
ID6135725 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp5282744 
End bp5283625 
Gene Length882 bp 
Protein Length293 aa 
Translation table11 
GC content71% 
IMG OID641644943 
ProductRNA polymerase factor sigma-32 
Protein accessionYP_001771570 
Protein GI170742915 
COG category[K] Transcription 
COG ID[COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) 
TIGRFAM ID[TIGR02392] alternative sigma factor RpoH
[TIGR02937] RNA polymerase sigma factor, sigma-70 family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0484047 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000291044 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCGGAAA TCGCAGGAAT TCGTCGGCAG TTCGTCCGCA TGGCGATGGA CGCGCCCTTC 
CTCGAACGCG AGGAGGAGCG CGGGCTCGCG GTGCGGTGGA AGGACAAGCG CGACGAGCAG
GCCCTGCACC GCCTGATCTC CGCCCATATG CGCCTCGTCA TCGCGCTGGC GGGCCGCTTC
CGCCATTACG GCCTGCCCAT GGCCGACCTC GTCCAGGAGG GCCATGTCGG GCTGATGGAG
GCGGCCGCCC GCTTCGAGCC GGAGCGGGAC GTCCGCTTCT CCACCTACGC GACGTGGTGG
ATCCGGGCGT CGATCCAGGA CTACATCCTG CGCAACTGGT CGATCGTGCG GGGCGGCACC
TCCTCGGCGC AGAAGGCCCT GTTCTTCAAC CTGCGGCGCC TGCGCGCCCG CCTGATGCAA
TCGACCGACG AGCGGGTCGG CGACGAGATC CACCGGCGCA TCGCCACGGC GATCGGCGTC
TCGCGGGAGG ACGTGGCGCT GATGGATGCG CGCCTGTCGG GCCCCGACAT GTCGCTCAAC
GCCCCGATCA CCGACGAGGG CGACGCGTCG GCGGAGCGGG TGGATTTCCT GGTCGACACC
TCGCCCCTGC CCGACGAGGC GGTCTCGGAC GCGGTGGACG GCGAGCGGCG CCTCACCTGG
CTCAGGCAGG CCCTCACCGT CCTGTCGGAG CGCGAGCTGC GCATCCTGCA CGAGCGGCGG
CTGGCCGAGG ATCAGGCGAC CCTGGAGGCC CTGGGCCACC GGCTCGGCAT CTCGAAGGAG
CGCGTGCGCC AGATCGAGAA CCGCGCCCTG GAGAAGCTCC GGCGCGCGCT GGCCGAGCGC
TTCCCGCAGC AGCCGGGCGG CGGCATGAGC GCGATCATCT GA
 
Protein sequence
MAEIAGIRRQ FVRMAMDAPF LEREEERGLA VRWKDKRDEQ ALHRLISAHM RLVIALAGRF 
RHYGLPMADL VQEGHVGLME AAARFEPERD VRFSTYATWW IRASIQDYIL RNWSIVRGGT
SSAQKALFFN LRRLRARLMQ STDERVGDEI HRRIATAIGV SREDVALMDA RLSGPDMSLN
APITDEGDAS AERVDFLVDT SPLPDEAVSD AVDGERRLTW LRQALTVLSE RELRILHERR
LAEDQATLEA LGHRLGISKE RVRQIENRAL EKLRRALAER FPQQPGGGMS AII