Gene ECH74115_5453 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_5453 
SymbolrpoB 
ID6972258 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp5093254 
End bp5097282 
Gene Length4029 bp 
Protein Length1342 aa 
Translation table11 
GC content53% 
IMG OID643389102 
ProductDNA-directed RNA polymerase subunit beta 
Protein accessionYP_002273503 
Protein GI209395910 
COG category[K] Transcription 
COG ID[COG0085] DNA-directed RNA polymerase, beta subunit/140 kD subunit 
TIGRFAM ID[TIGR02013] DNA-directed RNA polymerase, beta subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.182219 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value0.00256328 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGTTTACT CCTATACCGA GAAAAAACGT ATTCGTAAGG ATTTTGGTAA ACGTCCACAA 
GTTCTGGATG TACCTTATCT CCTTTCTATC CAGCTTGACT CGTTTCAGAA ATTTATCGAG
CAAGATCCTG AAGGGCAGTA TGGTCTGGAA GCTGCTTTCC GTTCCGTATT CCCGATTCAG
AGCTACAGCG GTAATTCCGA GCTGCAATAC GTCAGCTACC GCCTTGGCGA ACCGGTGTTT
GACGTCCAGG AATGTCAAAT CCGTGGCGTG ACCTATTCCG CACCGCTGCG CGTTAAACTG
CGTCTGGTGA TCTATGAGCG CGAAGCGCCG GAAGGCACCG TAAAAGACAT TAAAGAACAA
GAAGTCTACA TGGGCGAAAT TCCGCTCATG ACAGACAACG GTACCTTTGT TATCAACGGT
ACTGAGCGTG TTATCGTTTC CCAGCTGCAC CGTAGTCCGG GCGTCTTCTT TGACTCCGAC
AAAGGTAAAA CCCACTCTTC GGGTAAAGTG CTGTATAACG CGCGCATCAT CCCTTACCGT
GGTTCCTGGC TGGACTTCGA ATTCGATCCG AAGGACAACC TGTTCGTACG TATCGACCGT
CGCCGTAAAC TGCCTGCGAC CATCATTCTG CGTGCCCTGA ACTACACCAC AGAGCAGATC
CTCGACCTGT TCTTTGAAAA AGTTATCTTT GAAATCCGTG ATAACAAGCT GCAGATGGAA
CTGGTGCCGG AACGCCTGCG TGGTGAAACC GCATCCTTTG ACATCGAAGC TAACGGTAAA
GTGTACGTAG AAAAAGGCCG CCGTATCACT GCGCGCCACA TTCGCCAGCT GGAAAAAGAC
GACGTCAAAC TGATCGAAGT CCCGGTTGAG TACATCGCAG GTAAAGTGGT TGCTAAAGAC
TATATTGATG AGTCTACCGG CGAGCTGATC TGCGCAGCGA ACATGGAGCT GAGCCTGGAT
CTGCTGGCTA AGCTGAGCCA GTCTGGTCAC AAGCGTATCG AAACGCTGTT CACCAATGAT
CTGGATCACG GCCCGTATAT CTCTGAAACC TTACGTGTCG ACCCAACTAA CGACCGTCTG
AGCGCACTGG TAGAAATCTA CCGCATGATG CGCCCTGGCG AGCCGCCGAC TCGTGAAGCA
GCGGAAAGCC TGTTCGAGAA CCTGTTCTTC TCCGAAGACC GTTATGACCT GTCTGCGGTT
GGTCGTATGA AGTTCAACCG TTCTCTGCTG CGCGAAGAAA TCGAAGGTTC TGGTATCCTG
AGCAAAGACG ACATCATTGA TGTTATGAAA AAGCTCATCG ATATCCGTAA CGGTAAAGGC
GAAGTCGATG ATATCGACCA CCTCGGCAAC CGTCGTATCC GTTCCGTTGG CGAAATGGCG
GAAAACCAGT TCCGCGTTGG CCTGGTACGT GTAGAGCGTG CGGTGAAAGA GCGTCTGTCT
CTGGGCGATC TGGATACCCT GATGCCTCAG GATATGATCA ACGCCAAGCC GATTTCCGCA
GCAGTGAAAG AGTTCTTCGG TTCCAGCCAG CTGTCTCAGT TTATGGACCA GAACAACCCG
CTGTCTGAGA TTACGCACAA ACGTCGTATC TCCGCACTCG GCCCAGGCGG TCTGACCCGT
GAACGTGCAG GCTTCGAAGT TCGAGACGTA CACCCGACTC ACTACGGTCG CGTATGTCCA
ATCGAAACCC CTGAAGGTCC GAACATCGGT CTGATCAACT CTCTGTCCGT GTACGCACAG
ACTAACGAAT ACGGCTTCCT TGAGACTCCG TATCGTAAAG TGACTGACGG TGTTGTAACT
GACGAAATTC ACTACCTGTC TGCTATCGAA GAAGGCAACT ACGTTATCGC CCAGGCGAAC
TCCAACCTGG ATGAAGAAGG CCACTTCGTA GAAGACCTGG TAACCTGCCG TAGCAAAGGC
GAATCCAGCT TGTTCAGCCG TGACCAGGTT GACTACATGG ACGTATCCAC CCAGCAGGTG
GTATCCGTCG GTGCGTCCCT GATCCCGTTC CTGGAACACG ATGACGCCAA CCGTGCATTG
ATGGGTGCGA ACATGCAACG TCAGGCCGTT CCGACTCTGC GTGCTGATAA GCCGCTGGTT
GGTACTGGTA TGGAACGTGC TGTTGCCGTT GACTCCGGTG TAACTGCGGT TGCTAAACGT
GGTGGTGTCG TTCAGTACGT GGATGCTTCC CGTATCGTTA TCAAAGTTAA CGAAGACGAG
ATGTATCCGG GTGAAGCAGG TATCGACATC TACAACCTGA CCAAATACAC CCGTTCTAAC
CAGAACACCT GTATTAACCA GATGCCGTGT GTGTCTCTGG GTGAACCGGT TGAACGTGGC
GACGTGCTGG CAGACGGTCC GTCCACCGAC CTTGGTGAAC TGGCGCTTGG TCAGAACATG
CGCGTAGCGT TCATGCCGTG GAATGGTTAC AACTTCGAAG ACTCCATCCT CGTATCCGAG
CGTGTTGTTC AGGAAGACCG TTTCACCACC ATCCACATTC AGGAACTGGC GTGTGTGTCC
CGTGACACCA AGCTGGGGCC AGAAGAGATC ACCGCTGACA TCCCGAACGT GGGTGAAGCT
GCGCTCTCCA AACTGGATGA ATCCGGTATC GTTTATATTG GTGCGGAAGT GACCGGTGGC
GACATTCTGG TTGGTAAGGT TACGCCGAAA GGTGAAACTC AGCTGACCCC AGAAGAAAAA
CTGCTGCGTG CGATCTTCGG TGAGAAAGCG TCTGACGTTA AAGACTCTTC TCTGCGCGTA
CCAAACGGTG TATCCGGTAC GGTTATCGAC GTTCAGGTCT TTACTCGCGA TGGCGTAGAA
AAAGACAAAC GTGCGCTGGA AATCGAAGAA ATGCAGCTCA AACAGGCGAA GAAAGACCTG
TCTGAAGAAC TGCAGATCCT CGAAGCGGGT CTGTTCAGCC GTATCCGTGC TGTGCTGGTA
GCCGGTGGCG TTGAAGCTGA GAAGCTCGAC AAATTGCCGC GCGATCGCTG GCTGGAGCTG
GGCCTGACCG ACGAAGAGAA ACAAAATCAG CTGGAACAGC TGGCTGAGCA GTATGACGAA
CTGAAACACG AGTTCGAGAA GAAACTCGAA GCGAAACGCC GCAAAATCAC CCAGGGCGAC
GATCTGGCAC CGGGCGTGCT GAAGATTGTT AAGGTATATC TGGCGGTTAA ACGCCGTATC
CAGCCTGGTG ACAAGATGGC AGGTCGTCAC GGTAACAAGG GTGTAATTTC TAAGATCAAC
CCGATCGAAG ATATGCCTTA CGATGAAAAC GGTACGCCGG TAGACATCGT ACTGAACCCG
CTGGGCGTAC CGTCTCGTAT GAACATCGGT CAGATCCTCG AAACCCACTT GGGTATGGCT
GCGAAAGGTA TCGGCGACAA GATCAACGCC ATGCTGAAAC AGCAGCAGGA AGTCGCGAAA
CTGCGTGAAT TCATCCAGCG TGCGTACGAT CTGGGCGCTG ACGTTCGTCA GAAAGTTGAC
CTGAGTACCT TCAGCGATGA AGAAGTTATG CGTCTGGCTG AAAACCTGCG CAAAGGTATG
CCAATCGCAA CGCCGGTGTT CGACGGTGCG AAAGAAGCAG AAATTAAAGA GCTGCTGAAA
CTTGGCGACC TGCCGACTTC TGGTCAGATC CGCCTGTACG ACGGCCGCAC TGGTGAACAG
TTCGAACGTC CGGTAACCGT TGGTTACATG TACATGCTGA AACTGAACCA CCTGGTCGAC
GACAAGATGC ACGCGCGTTC CACCGGTTCT TACAGCCTGG TTACTCAGCA GCCGCTGGGT
GGTAAGGCAC AGTTCGGTGG TCAGCGTTTC GGGGAGATGG AAGTGTGGGC GCTGGAAGCA
TACGGCGCAG CATACACCCT GCAGGAAATG CTCACCGTTA AGTCTGATGA CGTGAACGGT
CGTACTAAGA TGTATAAAAA CATCGTGGAC GGCAACCATC AGATGGAGCC GGGCATGCCA
GAATCCTTCA ACGTATTGTT GAAAGAGATT CGTTCGCTGG GTATCAACAT CGAACTGGAA
GACGAGTAA
 
Protein sequence
MVYSYTEKKR IRKDFGKRPQ VLDVPYLLSI QLDSFQKFIE QDPEGQYGLE AAFRSVFPIQ 
SYSGNSELQY VSYRLGEPVF DVQECQIRGV TYSAPLRVKL RLVIYEREAP EGTVKDIKEQ
EVYMGEIPLM TDNGTFVING TERVIVSQLH RSPGVFFDSD KGKTHSSGKV LYNARIIPYR
GSWLDFEFDP KDNLFVRIDR RRKLPATIIL RALNYTTEQI LDLFFEKVIF EIRDNKLQME
LVPERLRGET ASFDIEANGK VYVEKGRRIT ARHIRQLEKD DVKLIEVPVE YIAGKVVAKD
YIDESTGELI CAANMELSLD LLAKLSQSGH KRIETLFTND LDHGPYISET LRVDPTNDRL
SALVEIYRMM RPGEPPTREA AESLFENLFF SEDRYDLSAV GRMKFNRSLL REEIEGSGIL
SKDDIIDVMK KLIDIRNGKG EVDDIDHLGN RRIRSVGEMA ENQFRVGLVR VERAVKERLS
LGDLDTLMPQ DMINAKPISA AVKEFFGSSQ LSQFMDQNNP LSEITHKRRI SALGPGGLTR
ERAGFEVRDV HPTHYGRVCP IETPEGPNIG LINSLSVYAQ TNEYGFLETP YRKVTDGVVT
DEIHYLSAIE EGNYVIAQAN SNLDEEGHFV EDLVTCRSKG ESSLFSRDQV DYMDVSTQQV
VSVGASLIPF LEHDDANRAL MGANMQRQAV PTLRADKPLV GTGMERAVAV DSGVTAVAKR
GGVVQYVDAS RIVIKVNEDE MYPGEAGIDI YNLTKYTRSN QNTCINQMPC VSLGEPVERG
DVLADGPSTD LGELALGQNM RVAFMPWNGY NFEDSILVSE RVVQEDRFTT IHIQELACVS
RDTKLGPEEI TADIPNVGEA ALSKLDESGI VYIGAEVTGG DILVGKVTPK GETQLTPEEK
LLRAIFGEKA SDVKDSSLRV PNGVSGTVID VQVFTRDGVE KDKRALEIEE MQLKQAKKDL
SEELQILEAG LFSRIRAVLV AGGVEAEKLD KLPRDRWLEL GLTDEEKQNQ LEQLAEQYDE
LKHEFEKKLE AKRRKITQGD DLAPGVLKIV KVYLAVKRRI QPGDKMAGRH GNKGVISKIN
PIEDMPYDEN GTPVDIVLNP LGVPSRMNIG QILETHLGMA AKGIGDKINA MLKQQQEVAK
LREFIQRAYD LGADVRQKVD LSTFSDEEVM RLAENLRKGM PIATPVFDGA KEAEIKELLK
LGDLPTSGQI RLYDGRTGEQ FERPVTVGYM YMLKLNHLVD DKMHARSTGS YSLVTQQPLG
GKAQFGGQRF GEMEVWALEA YGAAYTLQEM LTVKSDDVNG RTKMYKNIVD GNHQMEPGMP
ESFNVLLKEI RSLGINIELE DE