Gene Msil_0159 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_0159 
Symbol 
ID7090476 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp154670 
End bp156220 
Gene Length1551 bp 
Protein Length516 aa 
Translation table11 
GC content62% 
IMG OID643463493 
ProductRNA polymerase, sigma 54 subunit, RpoN 
Protein accessionYP_002360502 
Protein GI217976355 
COG category[K] Transcription 
COG ID[COG1508] DNA-directed RNA polymerase specialized sigma subunit, sigma54 homolog 
TIGRFAM ID[TIGR02395] RNA polymerase sigma-54 factor 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value0.69641 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTCTCA GCACGAAGCT CATGATGCGC CAGGGACAGT CCCTGGTGAT GACCCCGCAG 
CTGTTGCAGG CGATCAAGCT TCTGCAATTC TCCAACATGG AGCTCAACGC CTTCGTCGAG
GAGGAGCTTG AACGCAATCC CCTGCTCGAG CGCACCGATG ACGCGCCGGA TCCGCATGCC
TTGCTCGCTG ACGTGGAGCC CCTGTCGGAC GCCGCCGAAA GCGGCGGAAG CGTCGATTTC
AACGATCCGG GCGAGACGGA CTGGAGTTCG GAGTCGCTCG CCGTCGATCG CGGCGCGCTG
GAGGCCAGTC TCGGGACCGA GCTGAGCAAC GCTTTTGACG ATGACCGCAC CGCGCCAGCG
GCGGATTTTG CCGAAGGGGC CGGCCTGTCG GCGACATCCT GGACCGGCTC CTCGGCCGGG
CAGGGCGACG GCGAAGGCGC CAATCTCGAG GCCTATGCGG CGAACCCGAC CAATCTCAAG
GATCATCTCG AAGCTCAGCT GATGCTCGCC ACCTCCAACC CCGCCGAGCG AATGATCGGC
CTGATGCTGA TCGATTGCAT CGACGACGCC GGCTACTACG TCGACAATAT GGCCGAGACG
GCGGCCCGGC TGAAGACGCC GATCGCGCGC GTCGAGCGCG TGCTGTCGAT CATTCAGGGT
TTTGATCCCT CGGGCGTCGG CGCCCGCGAT TTGGCTGAAT GCCTCGCCAT TCAGCTGCGC
GAAAAAGATC GCTTCGACCC CGCGATGCAG GCGCTGGTCG CCAATCTTGG TCTGTTGGCC
AAGCGCGATT TCGCGGCCCT GCGCAAGATC TGCAATGTCG ACGAGGAGGA CGTCGCCGAC
ATGCTGGGGG AGATTCGCCA GCTCAATCCG AAGCCTGGCC GCGCTTTTGG CGGCGGCTCG
ATTCAGCCGC TCGTGCCCGA TGTCATTGTC CGCGCGGCGC CCGGCGGGGC ATGGCATGTC
GAACTCAACA CCGAGGTGCT GCCGCGCATT CTGGTCAACA ACAGCTATGT CGCGCGCGTC
ACGAAATCCC AGTCGAACGA TGTCGACAAG ACCTTCATGT CGACCTGCCT GCAAACCGCG
AACTGGCTGA CCAAGAGCCT CGAGCAGAGG GCGCGGACGA TTTTGAAAGT GTCGAGTGAA
ATCGTCCGCC AGCAGGACGC CTTCTTCCGC CAAGGCGTCG AACATCTGCG CCCGCTGAAT
CTCAAGACCA TCGCCGAAGC GATCGGAATG CATGAATCGA CCGTGTCGCG CGTCACCTCC
AACAAATATA TGGCGACCCC GCGGGGCCTG TTCGAGCTGA AATATTTCTT CACCGCCTCG
ATCGCCTCGA ACAATGGCGG CGACGCTCAT TCGGCGGAAT CGGTGCGCTT CCGCATCCGC
CACATGATTG AGCAGGAGAG CCCGACCGAC ATTCTGTCGG ACGATGCGAT CGTCGCCAAA
CTCAAGGACG TCAACATCGA CATTGCGAGG CGCACCGTCG CCAAATATCG CGAGAGCCTC
AAAATCCGCT CCTCGGTGGA GCGGCGGCGC GAAAAATCTC ACATGTATTA A
 
Protein sequence
MALSTKLMMR QGQSLVMTPQ LLQAIKLLQF SNMELNAFVE EELERNPLLE RTDDAPDPHA 
LLADVEPLSD AAESGGSVDF NDPGETDWSS ESLAVDRGAL EASLGTELSN AFDDDRTAPA
ADFAEGAGLS ATSWTGSSAG QGDGEGANLE AYAANPTNLK DHLEAQLMLA TSNPAERMIG
LMLIDCIDDA GYYVDNMAET AARLKTPIAR VERVLSIIQG FDPSGVGARD LAECLAIQLR
EKDRFDPAMQ ALVANLGLLA KRDFAALRKI CNVDEEDVAD MLGEIRQLNP KPGRAFGGGS
IQPLVPDVIV RAAPGGAWHV ELNTEVLPRI LVNNSYVARV TKSQSNDVDK TFMSTCLQTA
NWLTKSLEQR ARTILKVSSE IVRQQDAFFR QGVEHLRPLN LKTIAEAIGM HESTVSRVTS
NKYMATPRGL FELKYFFTAS IASNNGGDAH SAESVRFRIR HMIEQESPTD ILSDDAIVAK
LKDVNIDIAR RTVAKYRESL KIRSSVERRR EKSHMY