Gene SeHA_C3617 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeHA_C3617 
SymbolrpoN 
ID6489181 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Heidelberg str. SL476 
KingdomBacteria 
Replicon accessionNC_011083 
Strand
Start bp3499725 
End bp3501158 
Gene Length1434 bp 
Protein Length477 aa 
Translation table11 
GC content54% 
IMG OID642743737 
ProductRNA polymerase factor sigma-54 
Protein accessionYP_002047349 
Protein GI194447923 
COG category[K] Transcription 
COG ID[COG1508] DNA-directed RNA polymerase specialized sigma subunit, sigma54 homolog 
TIGRFAM ID[TIGR02395] RNA polymerase sigma-54 factor 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones64 
Fosmid unclonability p-value0.167296 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGCAAG GTTTGCAACT CAGGCTTAGC CAACAACTGG CGATGACACC TCAGCTACAA 
CAGGCCATCC GTCTGTTGCA GTTGTCTACG CTGGAACTTC AGCAGGAACT CCAGCAAGCG
CTGGAAAATA ACCCGCTGCT TGAGCAAACC GATCTTCATG ACGAAATCGA CACTCAGCAA
CCTCAGGATA ACGATCCTCT CGATACCGCC GACGCGCTCG AACAAAAAGA GATGCCGGAA
GAGCTGCCGC TTGACGCCAG TTGGGATGAA ATTTACACCG CCGGGACGCC GTCCGGCCCC
AGCGGCGATT ATATCGACGA TGAGCTGCCC GTCTATCAGG GCGAAACGAC GCAGTCGTTG
CAGGATTATC TGATGTGGCA GGTTGAGCTA ACGCCCTTCT CGGATACCGA TCGCGCTATT
GCGACATCCA TTGTCGACGC GGTAGATGAT ACCGGCTATC TCACCGTATC CCTGGACGAA
ATTCGCGAAA GCATGGGCGA TGTAGAGGTG GATCTCGATG AGGTTGAAGC CGTCCTGAAG
CGCATTCAGC GTTTTGATCC GGTAGGCGTC GCGGCAAAAG ATCTTCGCGA CTGTCTGCTG
ATCCAGCTTT CACAATTCGA CAAATCCACG CCGTGGCTGG AAGAGGCGCG GCTTATTATC
TGCGATCACC TTGATCTGCT GGCCAACCAC GATTTCCGCA CGTTGATGCG CGTTACCCGA
CTGAAAGAAG AGGTGCTGAA AGAGGCGGTA AACCTGATTC AGTCGCTCGA TCCCAGACCC
GGTCAATCGA TCCAGACCGG CGAGCCGGAG TACGTTATCC CGGATGTGCT GGTACGCAAG
CATAACGGTC GCTGGACGGT GGAGCTTAAT GGCGACAGTA TTCCCCGTTT ACAGATTAAC
CAGCACTATG CCGCCATGTG CAATAGCGCG CGCAACGATG CCGACAGCCA GTTCATCCGC
AGTAATTTAC AGGATGCGAA ATGGCTGATA AAAAGCCTTG AAAGCCGTAA CGACACGCTG
CTGCGCGTCA GTCGCTGCAT CGTCGAGCAA CAGCAAGCCT TTTTTGAACA GGGCGAAGAA
TATATGAAAC CGATGGTACT GGCGGATATC GCCCAGGCCG TCGAGATGCA CGAATCCACT
ATATCCCGCG TGACCACGCA GAAGTATCTG CACAGCCCGC GCGGTATTTT TGAACTCAAA
TATTTCTTTT CCAGCCACGT CAATACCGAA GGCGGAGGCG AAGCCTCTTC CACCGCGATT
CGCGCGCTGG TGAAGAAGTT AATTGCGGCG GAAAACCCCG CGAAACCACT GAGTGACAGC
AAGTTAACCT CTCTGCTGTC AGAACAGGGT ATCATGGTGG CGCGCCGCAC TGTTGCGAAG
TACCGAGAGT CTTTATCCAT TCCGCCGTCA AACCAACGCA AACAGCTGGT TTGA
 
Protein sequence
MKQGLQLRLS QQLAMTPQLQ QAIRLLQLST LELQQELQQA LENNPLLEQT DLHDEIDTQQ 
PQDNDPLDTA DALEQKEMPE ELPLDASWDE IYTAGTPSGP SGDYIDDELP VYQGETTQSL
QDYLMWQVEL TPFSDTDRAI ATSIVDAVDD TGYLTVSLDE IRESMGDVEV DLDEVEAVLK
RIQRFDPVGV AAKDLRDCLL IQLSQFDKST PWLEEARLII CDHLDLLANH DFRTLMRVTR
LKEEVLKEAV NLIQSLDPRP GQSIQTGEPE YVIPDVLVRK HNGRWTVELN GDSIPRLQIN
QHYAAMCNSA RNDADSQFIR SNLQDAKWLI KSLESRNDTL LRVSRCIVEQ QQAFFEQGEE
YMKPMVLADI AQAVEMHEST ISRVTTQKYL HSPRGIFELK YFFSSHVNTE GGGEASSTAI
RALVKKLIAA ENPAKPLSDS KLTSLLSEQG IMVARRTVAK YRESLSIPPS NQRKQLV