Gene SeD_A3679 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A3679 
SymbolrpoN 
ID6871484 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp3525244 
End bp3526677 
Gene Length1434 bp 
Protein Length477 aa 
Translation table11 
GC content54% 
IMG OID642786657 
ProductRNA polymerase factor sigma-54 
Protein accessionYP_002217291 
Protein GI198242246 
COG category[K] Transcription 
COG ID[COG1508] DNA-directed RNA polymerase specialized sigma subunit, sigma54 homolog 
TIGRFAM ID[TIGR02395] RNA polymerase sigma-54 factor 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones57 
Fosmid unclonability p-value0.246915 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGCAAG GTTTGCAACT CAGGCTTAGC CAACAACTGG CAATGACGCC TCAGCTACAA 
CAGGCCATCC GTCTGTTGCA GTTGTCTACG CTGGAACTTC AGCAGGAACT CCAGCAAGCG
CTGGAAAATA ACCCGCTGCT TGAGCAAACC GATCTTCATG ACGAAATCGA CACTCAGCAA
CCTCAGGATA ACGATCCTCT CGATACCGCC GACGCGCTCG AACAAAAAGA GATGCCGGAA
GAGCTGCCGC TTGACGCCAG TTGGGATGAA ATTTACACCG CCGGGACGCC GTCCGGCCCC
AGCGGCGATT ATATCGACGA TGAGCTGCCC GTCTATCAGG GTGAAACGAC GCAGTCGTTG
CAGGATTATC TGATGTGGCA GGTTGAGCTA ACGCCCTTCT CGGACACCGA TCGCGCTATT
GCGACATCCA TTGTCGATGC GGTAGATGAT ACCGGCTATC TCACCGTATC CCTGGACGAA
ATTCGCGAAA GCATGGGCGA TGTAGAGGTG GATCTCGATG AGGTCGAAGC CGTCCTGAAG
CGTATTCAGC GTTTTGATCC GGTAGGCGTC GCGGCAAAAG ATCTTCGCGA CTGTCTGCTG
ATCCAGCTTT CACAATTCGA CAAATCCACG CCGTGGCTGG AAGAGGCGCG GCTCATTATC
TGCGATCATC TTGATCTGCT GGCCAACCAC GATTTCCGCA CGTTGATGCG CGTTACCCGA
CTAAAAGAAG AGGTGCTGAA AGAGGCGGTA AACCTGATTC AGTCGCTCGA TCCCAGACCC
GGTCAATCGA TCCAGACCGG CGAGCCGGAG TACGTTATCC CGGATGTGCT GGTACGCAAG
CATAACGGTC GCTGGACGGT GGAGCTTAAT AGCGACAGTA TTCCCCGTTT ACAGATTAAC
CAGCACTATG CCGCCATGTG CAATAGCGCG CGCAACGATG CCGACAGCCA GTTCATCCGC
AGTAATTTAC AGGATGCGAA ATGGCTGATA AAAAGCCTTG AAAGCCGCAA CGACACGCTG
CTGCGCGTCA GTCGCTGCAT CGTCGAGCAA CAGCAAGCCT TTTTTGAACA GGGCGAAGAA
TATATGAAAC CGATGGTACT GGCGGATATC GCCCAGGCCG TCGAGATGCA CGAATCCACT
ATATCCCGCG TGACCACGCA GAAGTATTTG CACAGCCCGC GCGGTATTTT TGAACTCAAA
TATTTCTTTT CCAGCCACGT CAATACCGAA GGCGGAGGCG AAGCCTCTTC CACCGCGATT
CGCGCGCTGG TGAAGAAGTT AATTGCGGCG GAAAACCCCG CGAAACCGCT GAGCGACAGC
AAGTTAACCT CTCTGCTGTC AGAACAAGGT ATCATGGTGG CGCGCCGCAC TGTTGCGAAG
TACCGAGAGT CTTTATCCAT TCCGCCGTCA AACCAACGCA AACAGCTGGT TTGA
 
Protein sequence
MKQGLQLRLS QQLAMTPQLQ QAIRLLQLST LELQQELQQA LENNPLLEQT DLHDEIDTQQ 
PQDNDPLDTA DALEQKEMPE ELPLDASWDE IYTAGTPSGP SGDYIDDELP VYQGETTQSL
QDYLMWQVEL TPFSDTDRAI ATSIVDAVDD TGYLTVSLDE IRESMGDVEV DLDEVEAVLK
RIQRFDPVGV AAKDLRDCLL IQLSQFDKST PWLEEARLII CDHLDLLANH DFRTLMRVTR
LKEEVLKEAV NLIQSLDPRP GQSIQTGEPE YVIPDVLVRK HNGRWTVELN SDSIPRLQIN
QHYAAMCNSA RNDADSQFIR SNLQDAKWLI KSLESRNDTL LRVSRCIVEQ QQAFFEQGEE
YMKPMVLADI AQAVEMHEST ISRVTTQKYL HSPRGIFELK YFFSSHVNTE GGGEASSTAI
RALVKKLIAA ENPAKPLSDS KLTSLLSEQG IMVARRTVAK YRESLSIPPS NQRKQLV