Gene SeD_A3233 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A3233 
SymbolrpoS 
ID6872205 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp3109984 
End bp3110976 
Gene Length993 bp 
Protein Length330 aa 
Translation table11 
GC content52% 
IMG OID642786249 
ProductRNA polymerase sigma factor RpoS 
Protein accessionYP_002216890 
Protein GI198243199 
COG category[K] Transcription 
COG ID[COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) 
TIGRFAM ID[TIGR02394] RNA polymerase sigma factor RpoS
[TIGR02937] RNA polymerase sigma factor, sigma-70 family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.718161 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones88 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTCAGA ATACGCTGAA AGTTCATGAT TTAAATGAAG ACGCGGAATT TGATGAGAAC 
GGAGTAGAGG CTTTTGACGA AAAAGCCTTG AGTGAAGAGG AACCCAGTGA TAACGACCTG
GCTGAAGAAG AGCTGTTATC GCAAGGGGCC ACACAGCGTG TGTTGGACGC GACTCAGCTT
TACCTTGGTG AGATTGGGTA TTCACCACTG TTAACAGCCG AAGAAGAAGT CTATTTTGCG
CGTCGCGCAC TGCGTGGAGA TGTCGCTTCT CGCCGTCGCA TGATTGAGAG TAACCTGCGT
CTGGTGGTAA AAATTGCCCG CCGTTATGGC AATCGTGGAC TGGCGTTGCT GGACCTGATT
GAAGAGGGCA ATCTGGGGCT TATCCGTGCA GTCGAGAAGT TTGACCCGGA ACGCGGGTTC
CGCTTCTCAA CATACGCAAC CTGGTGGATT CGCCAGACAA TCGAACGGGC GATTATGAAC
CAAACCCGTA CGATTCGCTT GCCGATTCAC ATTGTTAAAG AGCTGAACGT ATACCTGCGC
ACCGCACGTG AGTTGTCGCA TAAACTGGAC CACGAACCGA GTGCGGAAGA AATTGCAGAG
CAACTGGATA AACCGGTTGA TGACGTCAGC CGTATGCTTC GTCTCAACGA GCGCATTACC
TCGGTAGACA CCCCGCTGGG CGGTGATTCC GAAAAAGCGT TGCTGGACAT CCTGGCCGAT
GAAAAAGAGA ACGGTCCGGA AGACACCACG CAAGATGACG ATATGAAACA GAGCATCGTC
AAATGGTTGT TCGAACTGAA CGCCAAACAG CGTGAAGTGC TGGCGCGCCG TTTCGGTCTG
CTGGGATATG AAGCTGCGAC ACTGGAAGAT GTAGGCCGTG AAATCGGTCT TACGCGTGAG
CGTGTTCGTC AGATTCAGGT TGAAGGCCTG CGCCGTCTGC GCGAAATTCT GCAGACGCAG
GGGCTGAATA TCGAAGCGCT GTTCCGCGAG TAA
 
Protein sequence
MSQNTLKVHD LNEDAEFDEN GVEAFDEKAL SEEEPSDNDL AEEELLSQGA TQRVLDATQL 
YLGEIGYSPL LTAEEEVYFA RRALRGDVAS RRRMIESNLR LVVKIARRYG NRGLALLDLI
EEGNLGLIRA VEKFDPERGF RFSTYATWWI RQTIERAIMN QTRTIRLPIH IVKELNVYLR
TARELSHKLD HEPSAEEIAE QLDKPVDDVS RMLRLNERIT SVDTPLGGDS EKALLDILAD
EKENGPEDTT QDDDMKQSIV KWLFELNAKQ REVLARRFGL LGYEAATLED VGREIGLTRE
RVRQIQVEGL RRLREILQTQ GLNIEALFRE