Gene EcSMS35_4435 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_4435 
SymbolrpoB 
ID6145860 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp4526696 
End bp4530724 
Gene Length4029 bp 
Protein Length1342 aa 
Translation table11 
GC content53% 
IMG OID641619255 
ProductDNA-directed RNA polymerase subunit beta 
Protein accessionYP_001746371 
Protein GI170680149 
COG category[K] Transcription 
COG ID[COG0085] DNA-directed RNA polymerase, beta subunit/140 kD subunit 
TIGRFAM ID[TIGR02013] DNA-directed RNA polymerase, beta subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.399722 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value2.77463e-06 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGGTTTACT CCTATACCGA GAAAAAACGT ATTCGTAAGG ATTTTGGTAA ACGTCCACAA 
GTTCTGGATG TACCTTATCT CCTTTCTATC CAGCTTGACT CGTTTCAGAA ATTTATCGAG
CAAGATCCTG AAGGGCAGTA TGGTCTGGAA GCTGCTTTCC GTTCCGTATT CCCGATTCAG
AGCTACAGCG GTAATTCCGA GCTGCAATAC GTCAGCTACC GCCTTGGCGA ACCGGTGTTT
GACGTCCAGG AATGTCAAAT CCGTGGCGTG ACCTATTCCG CACCGCTGCG CGTTAAACTG
CGTCTGGTGA TCTATGAGCG CGAAGCGCCG GAAGGCACCG TAAAAGACAT TAAAGAACAA
GAAGTCTACA TGGGCGAAAT TCCGCTCATG ACAGACAACG GTACCTTTGT TATCAACGGT
ACTGAGCGTG TTATCGTTTC CCAGCTGCAC CGTAGTCCGG GCGTCTTCTT TGACTCCGAC
AAAGGTAAAA CCCACTCTTC GGGTAAAGTG CTGTATAACG CGCGCATCAT CCCTTACCGT
GGTTCCTGGC TGGACTTCGA ATTCGATCCG AAGGACAACC TGTTCGTACG TATCGACCGT
CGCCGTAAAC TGCCTGCGAC CATCATTCTG CGCGCCCTGA ACTATACCAC AGAGCAGATC
CTCGACCTGT TCTTTGAAAA AGTTATCTTT GAAATCCGTG ATAACAAGCT GCAGATGGAA
CTGGTGCCGG AACGCCTGCG TGGTGAAACC GCATCTTTTG ACATCGAAGC TAACGGTAAA
GTGTACGTAG AAAAAGGCCG CCGTATTACT GCGCGCCATA TTCGCCAGCT GGAAAAAGAC
GACGTCAAAC TGATCGAAGT CCCGGTTGAG TACATCGCAG GTAAAGTGGT TGCTAAAGAC
TATATTGATG AGTCTACCGG CGAGCTGATC TGCGCAGCGA ACATGGAGCT GAGCCTGGAT
CTGCTGGCTA AGCTGAGCCA GTCTGGTCAC AAGCGTATCG AAACGCTGTT CACCAACGAC
CTGGATCACG GCCCATATAT CTCTGAAACC TTACGTGTCG ACCCAACTAA CGACCGTCTG
AGCGCACTGG TAGAAATCTA CCGTATGATG CGCCCTGGCG AGCCGCCGAC TCGTGAAGCT
GCTGAAAGCC TGTTCGAGAA CCTGTTCTTC TCCGAAGACC GCTATGACTT GTCTGCGGTT
GGTCGTATGA AGTTCAACCG TTCTCTGCTG CGCGAAGAAA TCGAAGGTTC CGGTATCCTG
AGCAAAGACG ACATCATTGA TGTTATGAAA AAGCTCATCG ATATCCGTAA CGGTAAAGGC
GAAGTCGATG ATATCGACCA CCTCGGCAAC CGTCGTATCC GTTCTGTTGG CGAAATGGCG
GAAAACCAGT TCCGCGTTGG CCTGGTGCGT GTAGAGCGTG CGGTGAAAGA GCGTCTGTCT
CTGGGCGATC TGGATACCCT GATGCCTCAG GATATGATCA ACGCCAAGCC GATTTCTGCA
GCAGTGAAAG AGTTCTTCGG TTCCAGCCAG CTGTCTCAGT TTATGGACCA GAACAACCCG
CTGTCTGAGA TTACGCACAA ACGTCGTATC TCCGCACTCG GCCCGGGTGG TCTGACCCGT
GAACGTGCAG GCTTCGAAGT TCGAGACGTA CACCCGACTC ACTACGGTCG CGTATGTCCA
ATCGAAACCC CTGAAGGTCC GAACATCGGT CTGATCAACT CTCTGTCCGT GTACGCACAG
ACTAACGAAT ACGGCTTCCT TGAGACTCCG TATCGTAAAG TGACCGACGG TGTTGTAACT
GACGAAATTC ACTACCTGTC TGCTATCGAA GAAGGCAACT ACGTTATCGC CCAGGCGAAC
TCCAACCTGG ATGAAGAAGG CCACTTCGTA GAAGACCTGG TAACTTGCCG TAGCAAAGGC
GAATCCAGCT TGTTCAGCCG TGACCAGGTT GACTACATGG ACGTATCCAC CCAGCAGGTG
GTATCCGTCG GTGCGTCCCT GATCCCGTTC CTGGAACACG ATGACGCCAA CCGTGCATTG
ATGGGTGCGA ACATGCAACG TCAGGCCGTT CCGACTCTGC GCGCTGATAA GCCGCTGGTT
GGTACGGGTA TGGAACGTGC TGTTGCCGTT GACTCCGGTG TAACTGCGGT TGCTAAACGT
GGTGGTGTCG TTCAGTACGT GGATGCTTCC CGTATCGTTA TCAAAGTTAA CGAAGACGAG
ATGTATCCGG GTGAAGCAGG TATCGACATC TACAACCTGA CCAAATACAC CCGTTCTAAC
CAGAACACTT GTATCAACCA GATGCCGTGT GTGTCTCTGG GGGAACCGGT AGAGCGTGGC
GACGTGCTGG CAGATGGTCC GTCCACCGAC CTCGGTGAAC TGGCGCTTGG TCAGAACATG
CGCGTAGCGT TCATGCCGTG GAATGGTTAC AACTTCGAAG ACTCCATCCT CGTATCCGAG
CGTGTTGTTC AGGAAGACCG TTTCACCACC ATCCACATTC AGGAACTGGC GTGTGTGTCC
CGTGACACCA AGCTGGGGCC GGAAGAGATC ACCGCTGACA TCCCGAACGT GGGTGAAGCT
GCGCTCTCCA AACTGGATGA ATCCGGTATC GTTTACATTG GTGCGGAAGT GACCGGTGGC
GACATTCTGG TTGGTAAGGT TACGCCGAAA GGCGAAACTC AGCTGACCCC AGAAGAAAAA
CTGCTGCGTG CGATCTTCGG TGAGAAAGCG TCTGACGTTA AAGACTCTTC TCTGCGCGTA
CCAAACGGTG TATCCGGTAC GGTTATCGAC GTTCAGGTCT TTACTCGCGA TGGCGTAGAA
AAAGACAAAC GTGCGCTGGA AATCGAAGAA ATGCAGCTCA AACAGGCGAA GAAAGACCTG
TCTGAAGAAC TGCAGATCCT CGAAGCGGGT CTGTTCAGCC GTATCCGTGC TGTGCTGGTA
GCCGGTGGCG TTGAAGCTGA GAAGCTCGAC AAACTGCCGC GCGATCGCTG GCTGGAGCTG
GGCCTGACCG ACGAAGAGAA ACAAAATCAG CTGGAACAGC TGGCTGAGCA GTATGACGAA
CTGAAACACG AGTTCGAGAA GAAACTCGAA GCGAAACGCC GCAAAATCAC CCAGGGCGAC
GATCTGGCAC CGGGCGTGCT GAAGATTGTT AAAGTATATC TGGCGGTTAA ACGCCGTATC
CAGCCTGGTG ACAAGATGGC AGGTCGTCAC GGTAACAAGG GTGTAATTTC TAAGATCAAC
CCGATCGAAG ATATGCCTTA CGATGAAAAC GGTACGCCGG TAGACATCGT ACTGAACCCG
CTGGGCGTAC CGTCTCGTAT GAACATCGGT CAGATCCTCG AAACCCACCT GGGTATGGCT
GCGAAAGGTA TCGGCGACAA GATCAACGCC ATGCTGAAAC AGCAGCAAGA AGTCGCGAAA
CTGCGCGAAT TCATCCAGCG TGCGTACGAT CTGGGCGCTG ACGTTCGTCA GAAAGTTGAC
CTGAGTACCT TCAGCGATGA AGAAGTTATG CGTCTGGCTG AAAACCTGCG CAAAGGTATG
CCAATCGCAA CGCCGGTCTT CGACGGTGCG AAAGAAGCAG AAATTAAAGA GCTGCTGAAA
CTTGGCGACC TGCCGACTTC TGGTCAGATC CGCCTGTACG ACGGCCGCAC TGGTGAACAG
TTCGAGCGTC CGGTAACCGT TGGTTACATG TACATGCTGA AACTGAACCA CCTGGTCGAC
GACAAGATGC ACGCGCGTTC CACCGGTTCT TACAGCCTGG TTACTCAGCA GCCGCTGGGT
GGTAAGGCAC AGTTCGGTGG TCAGCGTTTC GGGGAGATGG AAGTGTGGGC GCTGGAAGCA
TACGGCGCAG CATACACCCT GCAGGAAATG CTCACCGTTA AGTCTGATGA CGTGAACGGT
CGTACTAAGA TGTATAAAAA CATCGTGGAC GGCAACCATC AGATGGAGCC GGGCATGCCA
GAATCCTTCA ACGTATTGTT GAAAGAGATT CGTTCGCTGG GTATCAACAT CGAACTGGAA
GACGAGTAA
 
Protein sequence
MVYSYTEKKR IRKDFGKRPQ VLDVPYLLSI QLDSFQKFIE QDPEGQYGLE AAFRSVFPIQ 
SYSGNSELQY VSYRLGEPVF DVQECQIRGV TYSAPLRVKL RLVIYEREAP EGTVKDIKEQ
EVYMGEIPLM TDNGTFVING TERVIVSQLH RSPGVFFDSD KGKTHSSGKV LYNARIIPYR
GSWLDFEFDP KDNLFVRIDR RRKLPATIIL RALNYTTEQI LDLFFEKVIF EIRDNKLQME
LVPERLRGET ASFDIEANGK VYVEKGRRIT ARHIRQLEKD DVKLIEVPVE YIAGKVVAKD
YIDESTGELI CAANMELSLD LLAKLSQSGH KRIETLFTND LDHGPYISET LRVDPTNDRL
SALVEIYRMM RPGEPPTREA AESLFENLFF SEDRYDLSAV GRMKFNRSLL REEIEGSGIL
SKDDIIDVMK KLIDIRNGKG EVDDIDHLGN RRIRSVGEMA ENQFRVGLVR VERAVKERLS
LGDLDTLMPQ DMINAKPISA AVKEFFGSSQ LSQFMDQNNP LSEITHKRRI SALGPGGLTR
ERAGFEVRDV HPTHYGRVCP IETPEGPNIG LINSLSVYAQ TNEYGFLETP YRKVTDGVVT
DEIHYLSAIE EGNYVIAQAN SNLDEEGHFV EDLVTCRSKG ESSLFSRDQV DYMDVSTQQV
VSVGASLIPF LEHDDANRAL MGANMQRQAV PTLRADKPLV GTGMERAVAV DSGVTAVAKR
GGVVQYVDAS RIVIKVNEDE MYPGEAGIDI YNLTKYTRSN QNTCINQMPC VSLGEPVERG
DVLADGPSTD LGELALGQNM RVAFMPWNGY NFEDSILVSE RVVQEDRFTT IHIQELACVS
RDTKLGPEEI TADIPNVGEA ALSKLDESGI VYIGAEVTGG DILVGKVTPK GETQLTPEEK
LLRAIFGEKA SDVKDSSLRV PNGVSGTVID VQVFTRDGVE KDKRALEIEE MQLKQAKKDL
SEELQILEAG LFSRIRAVLV AGGVEAEKLD KLPRDRWLEL GLTDEEKQNQ LEQLAEQYDE
LKHEFEKKLE AKRRKITQGD DLAPGVLKIV KVYLAVKRRI QPGDKMAGRH GNKGVISKIN
PIEDMPYDEN GTPVDIVLNP LGVPSRMNIG QILETHLGMA AKGIGDKINA MLKQQQEVAK
LREFIQRAYD LGADVRQKVD LSTFSDEEVM RLAENLRKGM PIATPVFDGA KEAEIKELLK
LGDLPTSGQI RLYDGRTGEQ FERPVTVGYM YMLKLNHLVD DKMHARSTGS YSLVTQQPLG
GKAQFGGQRF GEMEVWALEA YGAAYTLQEM LTVKSDDVNG RTKMYKNIVD GNHQMEPGMP
ESFNVLLKEI RSLGINIELE DE