Gene Msil_2307 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_2307 
Symbol 
ID7090291 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp2500914 
End bp2501825 
Gene Length912 bp 
Protein Length303 aa 
Translation table11 
GC content62% 
IMG OID643465630 
ProductRNA polymerase, sigma 32 subunit, RpoH 
Protein accessionYP_002362600 
Protein GI217978453 
COG category[K] Transcription 
COG ID[COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) 
TIGRFAM ID[TIGR02392] alternative sigma factor RpoH
[TIGR02937] RNA polymerase sigma factor, sigma-70 family 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value0.00328793 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGGCTGCTG CGCTGCCAAT GATCTCGGGT GAGAGTGGTC TCGCCCGTTA TTTGAACGAA 
ATCAGACGGT TCCCCATGCT GGAGCCGCAA CAGGAATATA TGCTGGCCAA GCGCTGGCGC
GAGCACGCCG ATTCCGACGC CGCGCATAAG CTTGTCACGT CCCACCTTCG CCTCGTCGCC
AAGATCGCGA TGGGCTATCG CGGCTATGGC CTGCCGATCA GCGAAGTCGT CTCGGAAGGC
AATGTCGGTC TTATGCAGGC CGTCAAGCGC TTTGAACCCG AGAAAGGGTT CCGCCTAGCC
ACCTATGCCA TGTGGTGGAT CCGCGCGTCG ATTCAAGAGT ATATCCTGCG CTCGTGGTCG
CTTGTGAAGA TGGGCACCAC CGCCAGCCAG AAGAAACTCT TCTTCAATCT TCGCAAGGTG
AAGAGCCAGA TCTCGGCGCT GGAAGAGGGC GATCTGCGTC CCGAGCACGT CGACAAGATC
GCGCACCGGC TTGGCGTGTC CAAGCAGGAC GTGATCGACA TGAACCGCCG CATGTCCGGC
GACGCCTCGC TGAACGCTCC TTTGCGCGAG GAAGGCGAAG GCGAATGGCA GGATTGGCTT
GTCGATGACA GCGCCAGTCA GGAAAAACTG CTGGTCGACC GCGAAGAGAC GGACAATCGG
CTCGGCGCCC TGCATACGGC TCTGAACGTG CTGAACGACC GCGAGCGGCG CATTTTCGAG
GCGCGCCGCC TTGCCGACGA TCCGATGACG CTGGAGGCTC TCTCCGACGA ATTCGACATC
TCGCGCGAGC GCGTCCGTCA GATCGAAGTT CGCGCCTTTG AAAAGGTGCA GTCGGCCGTC
AAGGCGGGCG TCGCCCGCGT CGAGGCGGGC GCGCGCAGAG CTCAGATCGC CGGTCCGGCC
GCGCAAGCCT GA
 
Protein sequence
MAAALPMISG ESGLARYLNE IRRFPMLEPQ QEYMLAKRWR EHADSDAAHK LVTSHLRLVA 
KIAMGYRGYG LPISEVVSEG NVGLMQAVKR FEPEKGFRLA TYAMWWIRAS IQEYILRSWS
LVKMGTTASQ KKLFFNLRKV KSQISALEEG DLRPEHVDKI AHRLGVSKQD VIDMNRRMSG
DASLNAPLRE EGEGEWQDWL VDDSASQEKL LVDREETDNR LGALHTALNV LNDRERRIFE
ARRLADDPMT LEALSDEFDI SRERVRQIEV RAFEKVQSAV KAGVARVEAG ARRAQIAGPA
AQA