Gene ECH74115_3992 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_3992 
SymbolrpoS 
ID6967777 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp3691527 
End bp3692519 
Gene Length993 bp 
Protein Length330 aa 
Translation table11 
GC content52% 
IMG OID643387761 
ProductRNA polymerase sigma factor RpoS 
Protein accessionYP_002272204 
Protein GI209395891 
COG category[K] Transcription 
COG ID[COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) 
TIGRFAM ID[TIGR02394] RNA polymerase sigma factor RpoS
[TIGR02937] RNA polymerase sigma factor, sigma-70 family 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones72 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTCAGA ATACGCTGAA AGTTCATGAT TTAAATGAAG ATGCGGAATT TGATGAGAAC 
GGAGTTGAGG TTTTTGACGA AAAGGCCTTA GTAGAAGAGG AACCCAGTGA TAACGATTTG
GCCGAAGAGG AACTGTTATC GCAGGGAGCC ACACAGCGTG TGTTGGACGC GACTCAGCTT
TACCTTGGTG AGATTGGTTA TTCACCACTG TTAACGGCCG AAGAAGAAGT TTATTTTGCG
CGTCGCGCAC TGCGTGGAGA TGTCGCCTCT CGCCGCCGGA TGATCGAGAG TAACTTGCGT
CTGGTGGTAA AAATTGCCCG CCGTTATGGC AATCGTGGTC TGGCGTTGCT GGACCTTATC
GAAGAGGGCA ACCTGGGGCT GATCCGTGCG GTAGAGAAGT TTGACCCGGA ACGTGGTTTC
CGCTTCTCAA CATACGCAAC CTGGTGGATT CGCCAGACGA TTGAACGGGC GATTATGAAC
CAAACCCGTA CTATTCGTTT GCCGATTCAC ATCGTAAAGG AGCTGAACGT TTACCTGCGA
ACCGCACGTG AGTTGTCCCA TAAGCTGGAC CATGAACCAA GTGCGGAAGA GATCGCAGAG
CAACTGGATA AGCCAGTTGA TGACGTCAGC CGTATGCTTC GTCTTAACGA GCGCATTACC
TCGGTAGACA CCCCGCTGGG TGGTGATTCC GAAAAAGCGT TGCTGGACAT CCTGGCCGAT
GAAAAAGAGA ACGGTCCGGA AGATACCACG CAAGATGACG ATATGAAGCA GAGCATCGTC
AAATGGCTGT TCGAGCTGAA CGCCAAACAG CGTGAAGTAC TGGCACGTCG ATTCGGTTTG
CTGGGGTACG AAGCGGCAAC ACTGGAAGAT GTAGGTCGTG AAATTGGCCT CACCCGTGAA
CGTGTTCGCC AGATTCAGGT TGAAGGCCTG CGCCGTTTGC GCGAAATCCT GCAAACGCAG
GGGCTGAATA TCGAAGCGCT GTTCCGCGAG TAA
 
Protein sequence
MSQNTLKVHD LNEDAEFDEN GVEVFDEKAL VEEEPSDNDL AEEELLSQGA TQRVLDATQL 
YLGEIGYSPL LTAEEEVYFA RRALRGDVAS RRRMIESNLR LVVKIARRYG NRGLALLDLI
EEGNLGLIRA VEKFDPERGF RFSTYATWWI RQTIERAIMN QTRTIRLPIH IVKELNVYLR
TARELSHKLD HEPSAEEIAE QLDKPVDDVS RMLRLNERIT SVDTPLGGDS EKALLDILAD
EKENGPEDTT QDDDMKQSIV KWLFELNAKQ REVLARRFGL LGYEAATLED VGREIGLTRE
RVRQIQVEGL RRLREILQTQ GLNIEALFRE