Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msil_0159 |
Symbol | |
ID | 7090476 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylocella silvestris BL2 |
Kingdom | Bacteria |
Replicon accession | NC_011666 |
Strand | + |
Start bp | 154670 |
End bp | 156220 |
Gene Length | 1551 bp |
Protein Length | 516 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 643463493 |
Product | RNA polymerase, sigma 54 subunit, RpoN |
Protein accession | YP_002360502 |
Protein GI | 217976355 |
COG category | [K] Transcription |
COG ID | [COG1508] DNA-directed RNA polymerase specialized sigma subunit, sigma54 homolog |
TIGRFAM ID | [TIGR02395] RNA polymerase sigma-54 factor |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 40 |
Fosmid unclonability p-value | 0.69641 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTCTCA GCACGAAGCT CATGATGCGC CAGGGACAGT CCCTGGTGAT GACCCCGCAG CTGTTGCAGG CGATCAAGCT TCTGCAATTC TCCAACATGG AGCTCAACGC CTTCGTCGAG GAGGAGCTTG AACGCAATCC CCTGCTCGAG CGCACCGATG ACGCGCCGGA TCCGCATGCC TTGCTCGCTG ACGTGGAGCC CCTGTCGGAC GCCGCCGAAA GCGGCGGAAG CGTCGATTTC AACGATCCGG GCGAGACGGA CTGGAGTTCG GAGTCGCTCG CCGTCGATCG CGGCGCGCTG GAGGCCAGTC TCGGGACCGA GCTGAGCAAC GCTTTTGACG ATGACCGCAC CGCGCCAGCG GCGGATTTTG CCGAAGGGGC CGGCCTGTCG GCGACATCCT GGACCGGCTC CTCGGCCGGG CAGGGCGACG GCGAAGGCGC CAATCTCGAG GCCTATGCGG CGAACCCGAC CAATCTCAAG GATCATCTCG AAGCTCAGCT GATGCTCGCC ACCTCCAACC CCGCCGAGCG AATGATCGGC CTGATGCTGA TCGATTGCAT CGACGACGCC GGCTACTACG TCGACAATAT GGCCGAGACG GCGGCCCGGC TGAAGACGCC GATCGCGCGC GTCGAGCGCG TGCTGTCGAT CATTCAGGGT TTTGATCCCT CGGGCGTCGG CGCCCGCGAT TTGGCTGAAT GCCTCGCCAT TCAGCTGCGC GAAAAAGATC GCTTCGACCC CGCGATGCAG GCGCTGGTCG CCAATCTTGG TCTGTTGGCC AAGCGCGATT TCGCGGCCCT GCGCAAGATC TGCAATGTCG ACGAGGAGGA CGTCGCCGAC ATGCTGGGGG AGATTCGCCA GCTCAATCCG AAGCCTGGCC GCGCTTTTGG CGGCGGCTCG ATTCAGCCGC TCGTGCCCGA TGTCATTGTC CGCGCGGCGC CCGGCGGGGC ATGGCATGTC GAACTCAACA CCGAGGTGCT GCCGCGCATT CTGGTCAACA ACAGCTATGT CGCGCGCGTC ACGAAATCCC AGTCGAACGA TGTCGACAAG ACCTTCATGT CGACCTGCCT GCAAACCGCG AACTGGCTGA CCAAGAGCCT CGAGCAGAGG GCGCGGACGA TTTTGAAAGT GTCGAGTGAA ATCGTCCGCC AGCAGGACGC CTTCTTCCGC CAAGGCGTCG AACATCTGCG CCCGCTGAAT CTCAAGACCA TCGCCGAAGC GATCGGAATG CATGAATCGA CCGTGTCGCG CGTCACCTCC AACAAATATA TGGCGACCCC GCGGGGCCTG TTCGAGCTGA AATATTTCTT CACCGCCTCG ATCGCCTCGA ACAATGGCGG CGACGCTCAT TCGGCGGAAT CGGTGCGCTT CCGCATCCGC CACATGATTG AGCAGGAGAG CCCGACCGAC ATTCTGTCGG ACGATGCGAT CGTCGCCAAA CTCAAGGACG TCAACATCGA CATTGCGAGG CGCACCGTCG CCAAATATCG CGAGAGCCTC AAAATCCGCT CCTCGGTGGA GCGGCGGCGC GAAAAATCTC ACATGTATTA A
|
Protein sequence | MALSTKLMMR QGQSLVMTPQ LLQAIKLLQF SNMELNAFVE EELERNPLLE RTDDAPDPHA LLADVEPLSD AAESGGSVDF NDPGETDWSS ESLAVDRGAL EASLGTELSN AFDDDRTAPA ADFAEGAGLS ATSWTGSSAG QGDGEGANLE AYAANPTNLK DHLEAQLMLA TSNPAERMIG LMLIDCIDDA GYYVDNMAET AARLKTPIAR VERVLSIIQG FDPSGVGARD LAECLAIQLR EKDRFDPAMQ ALVANLGLLA KRDFAALRKI CNVDEEDVAD MLGEIRQLNP KPGRAFGGGS IQPLVPDVIV RAAPGGAWHV ELNTEVLPRI LVNNSYVARV TKSQSNDVDK TFMSTCLQTA NWLTKSLEQR ARTILKVSSE IVRQQDAFFR QGVEHLRPLN LKTIAEAIGM HESTVSRVTS NKYMATPRGL FELKYFFTAS IASNNGGDAH SAESVRFRIR HMIEQESPTD ILSDDAIVAK LKDVNIDIAR RTVAKYRESL KIRSSVERRR EKSHMY
|
| |