Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SNSL254_A3581 |
Symbol | rpoN |
ID | 6484826 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Newport str. SL254 |
Kingdom | Bacteria |
Replicon accession | NC_011080 |
Strand | + |
Start bp | 3466469 |
End bp | 3467902 |
Gene Length | 1434 bp |
Protein Length | 477 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 642738861 |
Product | RNA polymerase factor sigma-54 |
Protein accession | YP_002042578 |
Protein GI | 194442910 |
COG category | [K] Transcription |
COG ID | [COG1508] DNA-directed RNA polymerase specialized sigma subunit, sigma54 homolog |
TIGRFAM ID | [TIGR02395] RNA polymerase sigma-54 factor |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 76 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGCAAG GTTTGCAACT CAGGCTTAGC CAACAACTGG CAATGACGCC TCAGCTACAA CAGGCCATCC GTCTGTTGCA GTTGTCTACG CTGGAACTTC AGCAGGAACT CCAGCAAGCG CTGGAAAATA ACCCGCTGCT TGAGCAAACC GATCTTCATG ACGAAATCGA CACCCAGCAA CCTCAGGATA ACGATCCTCT CGATACCGCC GACGCGCTCG AACAAAAAGA GATGCCGGAA GAGCTGCCGC TTGACGCCAG TTGGGATGAA ATTTACACCG CCGGGACGCC GTCCGGCCCC AGCGGCGATT ATATCGACGA TGAGCTGCCC GTCTATCAGG GCGAAACGAC GCAGTCGTTG CAGGATTATC TGATGTGGCA GGTTGAGCTA ACGCCCTTCT CGGACACCGA TCGCGCTATT GCGACATCCA TTGTCGACGC GGTAGATGAT ACCGGCTATC TCACCGTATC CCTGGACGAA ATTCGCGAAA GCATGGGCGA TGTAGAGGTG GATCTCGATG AGGTTGAAGC CGTTCTGAAG CGCATTCAGC GTTTTGATCC GGTAGGCGTC GCGGCAAAAG ATCTTCGCGA CTGTCTGCTG ATCCAGCTTT CACAATTCGA CAAATCCACG CCGTGGCTGG AAGAGGCGCG GCTCATTATC TGCGATCATC TTGATCTGCT GGCCAACCAC GATTTCCGCA CGTTGATGCG CGTTACCCGA CTAAAAGAAG AGGTGCTGAA AGAGGCGGTA AACCTGATTC AGTCGCTCGA TCCCAGACCC GGTCAATCGA TCCAGACCGG CGAGCCGGAG TACGTTATCC CGGATGTGCT GGTACGCAAG CATAACGGTC GCTGGACGGT GGAGCTTAAT AGCGACAGTA TTCCCCGTTT ACAGATTAAC CAGCACTATG CCGCCATGTG CAATAGCGCG CGCAACGATG CCGACAGCCA GTTCATCCGC AGTAATTTAC AGGATGCGAA ATGGCTGATA AAAAGCCTTG AAAGCCGTAA CGACACGCTG CTGCGCGTCA GTCGCTGCAT CGTCGAGCAA CAGCAAGCCT TTTTTGAACA GGGCGAAGAA TATATGAAAC CGATGGTACT GGCGGATATC GCCCAGGCCG TCGAGATGCA CGAATCCACC ATATCCCGCG TGACCACGCA GAAGTATCTG CACAGCCCGC GCGGTATTTT TGAGCTCAAA TATTTCTTTT CCAGCCACGT CAATACCGAA GGCGGAGGCG AAGCCTCTTC CACCGCGATT CGCGCGCTGG TGAAGAAGTT AATTGCGGCG GAAAACCCCG CGAAACCACT GAGCGACAGC AAGTTAACCT CTCTGCTGTC AGAACAAGGT ATCATGGTGG CACGCCGCAC TGTTGCGAAG TACCGAGAGT CTTTATCCAT TCCGCCGTCA AACCAACGCA AACAGCTGGT TTGA
|
Protein sequence | MKQGLQLRLS QQLAMTPQLQ QAIRLLQLST LELQQELQQA LENNPLLEQT DLHDEIDTQQ PQDNDPLDTA DALEQKEMPE ELPLDASWDE IYTAGTPSGP SGDYIDDELP VYQGETTQSL QDYLMWQVEL TPFSDTDRAI ATSIVDAVDD TGYLTVSLDE IRESMGDVEV DLDEVEAVLK RIQRFDPVGV AAKDLRDCLL IQLSQFDKST PWLEEARLII CDHLDLLANH DFRTLMRVTR LKEEVLKEAV NLIQSLDPRP GQSIQTGEPE YVIPDVLVRK HNGRWTVELN SDSIPRLQIN QHYAAMCNSA RNDADSQFIR SNLQDAKWLI KSLESRNDTL LRVSRCIVEQ QQAFFEQGEE YMKPMVLADI AQAVEMHEST ISRVTTQKYL HSPRGIFELK YFFSSHVNTE GGGEASSTAI RALVKKLIAA ENPAKPLSDS KLTSLLSEQG IMVARRTVAK YRESLSIPPS NQRKQLV
|
| |