Gene ECH74115_4380 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_4380 
SymbolrpoD 
ID6967952 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp4055661 
End bp4057502 
Gene Length1842 bp 
Protein Length613 aa 
Translation table11 
GC content53% 
IMG OID643388102 
ProductRNA polymerase sigma factor RpoD 
Protein accessionYP_002272540 
Protein GI209399452 
COG category[K] Transcription 
COG ID[COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) 
TIGRFAM ID[TIGR02393] RNA polymerase sigma factor RpoD, C-terminal domain
[TIGR02937] RNA polymerase sigma factor, sigma-70 family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00205022 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones77 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGCAAA ACCCGCAGTC ACAGCTGAAA CTTCTTGTCA CCCGTGGTAA GGAGCAAGGC 
TATCTGACCT ATGCCGAGGT CAATGACCAT CTGCCGGAAG ATATCGTCGA TTCAGATCAG
ATCGAAGACA TCATCCAAAT GATCAACGAC ATGGGCATTC AGGTGATGGA AGAAGCACCG
GATGCCGATG ATCTGATGCT GGCTGAAAAC ACCGCGGACG AAGATGCTGC CGAAGCCGCC
GCGCAGGTGC TTTCCAGCGT GGAATCTGAA ATCGGGCGCA CGACTGACCC GGTACGCATG
TACATGCGTG AAATGGGCAC CGTTGAACTG TTGACCCGCG AAGGCGAAAT TGACATCGCT
AAGCGTATTG AAGACGGGAT CAACCAGGTT CAATGCTCCG TTGCTGAATA TCCGGAAGCG
ATCACCTATC TGCTGGAACA GTACGATCGT GTTGAAGCAG AAGAAGCGCG TCTGTCCGAT
CTGATCACCG GCTTTGTTGA CCCGAACGCA GAAGAAGATC TGGCACCTAC CGCCACTCAC
GTCGGTTCTG AGCTTTCCCA GGAAGATCTG GACGATGACG AAGATGAAGA CGAAGAAGAT
GGCGATGACG ACAGCGCCGA TGATGACAAC AGCATCGACC CGGAACTGGC TCGCGAAAAA
TTTGCGGAAC TGCGCGCTCA GTACGTTGTA ACGCGTGACA CCATCAAAGC GAAAGGTCGC
AGTCACGCTG CCGCTCAGGA AGAGATCCTG AAACTGTCTG AAGTATTCAA ACAGTTCCGC
CTGGTGCCGA AGCAGTTTGA CTACCTGGTC AACAGTATGC GCGTCATGAT GGACCGCGTT
CGTACGCAAG AACGTCTGAT CATGAAGCTC TGCGTTGAGC AGTGCAAAAT GCCGAAGAAA
AACTTCATTA CTCTGTTTAC CGGCAACGAA ACCAGCGATA CCTGGTTCAA CGCGGCAATT
GCGATGAACA AGCCGTGGTC GGAAAAACTG CACGATGTCT CTGAAGAAGT GCATCGCGCC
CTGCAGAAAC TGCAGCAGAT TGAAGAAGAA ACCGGCCTGA CCATCGAGCA GGTTAAAGAT
ATCAACCGTC GTATGTCCAT CGGTGAAGCG AAAGCCCGCC GTGCGAAGAA AGAGATGGTT
GAAGCGAACT TACGTCTGGT TATTTCTATC GCCAAGAAAT ACACCAACCG TGGCTTGCAG
TTCCTTGACC TGATTCAGGA AGGCAACATC GGTCTGATGA AAGCGGTTGA TAAATTCGAA
TACCGCCGTG GTTACAAGTT CTCCACCTAC GCAACCTGGT GGATCCGTCA GGCGATCACC
CGCTCTATCG CGGATCAGGC GCGCACCATC CGTATTCCGG TGCATATGAT TGAGACTATC
AACAAACTCA ACCGTATTTC TCGCCAGATG CTGCAAGAGA TGGGCCGTGA ACCGACGCCG
GAAGAACTGG CTGAACGTAT GCTGATGCCG GAAGACAAGA TCCGCAAAGT GCTGAAGATC
GCCAAAGAGC CAATCTCCAT GGAAACGCCG ATCGGTGATG ATGAAGATTC GCATCTGGGG
GATTTCATCG AGGATACCAC CCTCGAGCTG CCGCTGGATT CTGCGACCAC CGAAAGCCTG
CGTGCGGCAA CACACGACGT GCTGGCTGGC CTGACCGCGC GTGAAGCGAA AGTTCTGCGT
ATGCGTTTCG GTATCGATAT GAACACCGAT CACACGCTGG AAGAAGTGGG TAAACAGTTC
GACGTTACCC GTGAACGTAT CCGTCAGATC GAAGCGAAGG CGCTGCGCAA ACTGCGTCAC
CCGAGCCGTT CTGAAGTGCT GCGTAGCTTC CTGGACGATT AA
 
Protein sequence
MEQNPQSQLK LLVTRGKEQG YLTYAEVNDH LPEDIVDSDQ IEDIIQMIND MGIQVMEEAP 
DADDLMLAEN TADEDAAEAA AQVLSSVESE IGRTTDPVRM YMREMGTVEL LTREGEIDIA
KRIEDGINQV QCSVAEYPEA ITYLLEQYDR VEAEEARLSD LITGFVDPNA EEDLAPTATH
VGSELSQEDL DDDEDEDEED GDDDSADDDN SIDPELAREK FAELRAQYVV TRDTIKAKGR
SHAAAQEEIL KLSEVFKQFR LVPKQFDYLV NSMRVMMDRV RTQERLIMKL CVEQCKMPKK
NFITLFTGNE TSDTWFNAAI AMNKPWSEKL HDVSEEVHRA LQKLQQIEEE TGLTIEQVKD
INRRMSIGEA KARRAKKEMV EANLRLVISI AKKYTNRGLQ FLDLIQEGNI GLMKAVDKFE
YRRGYKFSTY ATWWIRQAIT RSIADQARTI RIPVHMIETI NKLNRISRQM LQEMGREPTP
EELAERMLMP EDKIRKVLKI AKEPISMETP IGDDEDSHLG DFIEDTTLEL PLDSATTESL
RAATHDVLAG LTAREAKVLR MRFGIDMNTD HTLEEVGKQF DVTRERIRQI EAKALRKLRH
PSRSEVLRSF LDD