Gene SNSL254_A3581 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A3581 
SymbolrpoN 
ID6484826 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp3466469 
End bp3467902 
Gene Length1434 bp 
Protein Length477 aa 
Translation table11 
GC content54% 
IMG OID642738861 
ProductRNA polymerase factor sigma-54 
Protein accessionYP_002042578 
Protein GI194442910 
COG category[K] Transcription 
COG ID[COG1508] DNA-directed RNA polymerase specialized sigma subunit, sigma54 homolog 
TIGRFAM ID[TIGR02395] RNA polymerase sigma-54 factor 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones76 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGCAAG GTTTGCAACT CAGGCTTAGC CAACAACTGG CAATGACGCC TCAGCTACAA 
CAGGCCATCC GTCTGTTGCA GTTGTCTACG CTGGAACTTC AGCAGGAACT CCAGCAAGCG
CTGGAAAATA ACCCGCTGCT TGAGCAAACC GATCTTCATG ACGAAATCGA CACCCAGCAA
CCTCAGGATA ACGATCCTCT CGATACCGCC GACGCGCTCG AACAAAAAGA GATGCCGGAA
GAGCTGCCGC TTGACGCCAG TTGGGATGAA ATTTACACCG CCGGGACGCC GTCCGGCCCC
AGCGGCGATT ATATCGACGA TGAGCTGCCC GTCTATCAGG GCGAAACGAC GCAGTCGTTG
CAGGATTATC TGATGTGGCA GGTTGAGCTA ACGCCCTTCT CGGACACCGA TCGCGCTATT
GCGACATCCA TTGTCGACGC GGTAGATGAT ACCGGCTATC TCACCGTATC CCTGGACGAA
ATTCGCGAAA GCATGGGCGA TGTAGAGGTG GATCTCGATG AGGTTGAAGC CGTTCTGAAG
CGCATTCAGC GTTTTGATCC GGTAGGCGTC GCGGCAAAAG ATCTTCGCGA CTGTCTGCTG
ATCCAGCTTT CACAATTCGA CAAATCCACG CCGTGGCTGG AAGAGGCGCG GCTCATTATC
TGCGATCATC TTGATCTGCT GGCCAACCAC GATTTCCGCA CGTTGATGCG CGTTACCCGA
CTAAAAGAAG AGGTGCTGAA AGAGGCGGTA AACCTGATTC AGTCGCTCGA TCCCAGACCC
GGTCAATCGA TCCAGACCGG CGAGCCGGAG TACGTTATCC CGGATGTGCT GGTACGCAAG
CATAACGGTC GCTGGACGGT GGAGCTTAAT AGCGACAGTA TTCCCCGTTT ACAGATTAAC
CAGCACTATG CCGCCATGTG CAATAGCGCG CGCAACGATG CCGACAGCCA GTTCATCCGC
AGTAATTTAC AGGATGCGAA ATGGCTGATA AAAAGCCTTG AAAGCCGTAA CGACACGCTG
CTGCGCGTCA GTCGCTGCAT CGTCGAGCAA CAGCAAGCCT TTTTTGAACA GGGCGAAGAA
TATATGAAAC CGATGGTACT GGCGGATATC GCCCAGGCCG TCGAGATGCA CGAATCCACC
ATATCCCGCG TGACCACGCA GAAGTATCTG CACAGCCCGC GCGGTATTTT TGAGCTCAAA
TATTTCTTTT CCAGCCACGT CAATACCGAA GGCGGAGGCG AAGCCTCTTC CACCGCGATT
CGCGCGCTGG TGAAGAAGTT AATTGCGGCG GAAAACCCCG CGAAACCACT GAGCGACAGC
AAGTTAACCT CTCTGCTGTC AGAACAAGGT ATCATGGTGG CACGCCGCAC TGTTGCGAAG
TACCGAGAGT CTTTATCCAT TCCGCCGTCA AACCAACGCA AACAGCTGGT TTGA
 
Protein sequence
MKQGLQLRLS QQLAMTPQLQ QAIRLLQLST LELQQELQQA LENNPLLEQT DLHDEIDTQQ 
PQDNDPLDTA DALEQKEMPE ELPLDASWDE IYTAGTPSGP SGDYIDDELP VYQGETTQSL
QDYLMWQVEL TPFSDTDRAI ATSIVDAVDD TGYLTVSLDE IRESMGDVEV DLDEVEAVLK
RIQRFDPVGV AAKDLRDCLL IQLSQFDKST PWLEEARLII CDHLDLLANH DFRTLMRVTR
LKEEVLKEAV NLIQSLDPRP GQSIQTGEPE YVIPDVLVRK HNGRWTVELN SDSIPRLQIN
QHYAAMCNSA RNDADSQFIR SNLQDAKWLI KSLESRNDTL LRVSRCIVEQ QQAFFEQGEE
YMKPMVLADI AQAVEMHEST ISRVTTQKYL HSPRGIFELK YFFSSHVNTE GGGEASSTAI
RALVKKLIAA ENPAKPLSDS KLTSLLSEQG IMVARRTVAK YRESLSIPPS NQRKQLV