Gene EcolC_4038 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_4038 
SymbolrpoB 
ID6064720 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp4447056 
End bp4451084 
Gene Length4029 bp 
Protein Length1342 aa 
Translation table11 
GC content53% 
IMG OID641603453 
ProductDNA-directed RNA polymerase subunit beta 
Protein accessionYP_001726964 
Protein GI170022010 
COG category[K] Transcription 
COG ID[COG0085] DNA-directed RNA polymerase, beta subunit/140 kD subunit 
TIGRFAM ID[TIGR02013] DNA-directed RNA polymerase, beta subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0403787 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00374316 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGTTTACT CCTATACCGA GAAAAAACGT ATTCGTAAGG ATTTTGGTAA ACGTCCACAA 
GTTCTGGATG TACCTTATCT CCTTTCTATC CAGCTTGACT CGTTTCAGAA ATTTATCGAG
CAAGATCCTG AAGGGCAGTA TGGTCTGGAA GCTGCTTTCC GTTCCGTATT CCCGATTCAG
AGCTACAGCG GTAATTCCGA GCTGCAATAC GTCAGCTACC GCCTTGGCGA ACCGGTGTTT
GACGTCCAGG AATGTCAAAT CCGTGGCGTG ACCTATTCCG CACCGCTGCG CGTTAAACTG
CGTCTGGTGA TCTATGAGCG CGAAGCGCCG GAAGGCACCG TAAAAGACAT TAAAGAACAA
GAAGTCTACA TGGGCGAAAT TCCGCTCATG ACAGACAACG GTACCTTTGT TATCAACGGT
ACTGAGCGTG TTATCGTTTC CCAGCTGCAC CGTAGTCCGG GCGTCTTCTT TGACTCCGAC
AAAGGTAAAA CCCACTCTTC GGGTAAAGTG CTGTATAACG CGCGCATCAT CCCTTACCGT
GGTTCCTGGC TGGACTTCGA ATTCGATCCG AAGGACAACC TGTTCGTACG TATCGACCGT
CGCCGTAAAC TGCCTGCGAC CATCATTCTG CGCGCCCTGA ACTACACCAC AGAGCAGATC
CTCGACCTGT TCTTTGAAAA AGTTATCTTT GAAATCCGTG ATAACAAGCT GCAGATGGAA
CTGGTGCCGG AACGCCTGCG TGGTGAAACC GCATCTTTTG ACATCGAAGC TAACGGTAAA
GTGTACGTAG AAAAAGGCCG CCGTATCACT GCGCGCCACA TTCGCCAGCT GGAAAAAGAC
GACGTCAAAC TGATCGAAGT CCCGGTTGAG TACATCGCAG GTAAAGTGGT TGCTAAAGAC
TATATTGATG AGTCTACCGG CGAGCTGATC TGCGCAGCGA ACATGGAGCT GAGCCTGGAT
CTGCTGGCTA AGCTGAGCCA GTCTGGTCAC AAGCGTATCG AAACGCTGTT CACCAACGAT
CTGGATCACG GCCCATATAT CTCTGAAACC TTACGTGTCG ACCCAACTAA CGACCGTCTG
AGCGCACTGG TAGAAATCTA CCGCATGATG CGCCCTGGCG AGCCGCCGAC TCGTGAAGCA
GCGGAAAGCC TGTTCGAGAA CCTGTTCTTC TCCGAAGACC GTTATGACCT GTCTGCGGTT
GGTCGTATGA AGTTCAACCG TTCTCTGCTG CGCGAAGAAA TCGAAGGTTC CGGTATCCTG
AGCAAAGACG ACATCATTGA TGTTATGAAA AAGCTCATCG ATATCCGTAA CGGTAAAGGC
GAAGTCGATG ATATCGACCA CCTCGGCAAC CGTCGTATCC GTTCCGTTGG CGAAATGGCG
GAAAACCAGT TCCGCGTTGG CCTGGTACGT GTAGAGCGTG CGGTGAAAGA GCGTCTGTCT
CTGGGCGATC TGGATACCCT GATGCCTCAG GATATGATCA ACGCCAAGCC GATTTCCGCA
GCAGTGAAAG AGTTCTTCGG TTCCAGCCAG CTGTCTCAGT TTATGGACCA GAACAACCCG
CTGTCTGAGA TTACGCACAA ACGTCGTATC TCCGCACTCG GCCCAGGCGG TCTGACCCGT
GAACGTGCAG GCTTCGAAGT TCGAGACGTA CACCCGACTC ACTACGGTCG CGTATGTCCA
ATCGAAACCC CTGAAGGTCC GAACATCGGT CTGATCAACT CTCTGTCCGT GTACGCACAG
ACTAACGAAT ACGGCTTCCT TGAGACTCCG TATCGTAAAG TGACCGACGG TGTTGTAACT
GACGAAATTC ACTACCTGTC TGCTATCGAA GAAGGCAACT ACGTTATCGC CCAGGCGAAC
TCCAACCTGG ATGAAGAAGG CCACTTCGTA GAAGACCTGG TAACTTGCCG TAGCAAAGGC
GAATCCAGCT TGTTCAGCCG TGACCAGGTT GACTACATGG ACGTATCCAC CCAGCAGGTG
GTATCCGTCG GTGCGTCCCT GATCCCGTTC CTGGAACACG ATGACGCCAA CCGTGCATTG
ATGGGTGCGA ACATGCAACG TCAGGCCGTT CCGACTCTGC GTGCTGATAA GCCGCTGGTT
GGTACTGGTA TGGAACGTGC TGTTGCCGTT GACTCCGGTG TAACTGCGGT AGCTAAACGT
GGTGGTGTCG TTCAGTACGT GGATGCTTCC CGTATCGTTA TCAAAGTTAA CGAAGACGAG
ATGTATCCGG GTGAAGCAGG TATCGACATC TACAACCTGA CCAAATACAC CCGTTCTAAC
CAGAACACCT GTATCAACCA GATGCCGTGT GTGTCTCTGG GTGAACCGGT TGAACGTGGC
GACGTGCTGG CAGACGGTCC GTCCACCGAC CTCGGTGAAC TGGCGCTTGG TCAGAACATG
CGCGTAGCGT TCATGCCGTG GAATGGTTAC AACTTCGAAG ACTCCATCCT CGTATCCGAG
CGTGTTGTTC AGGAAGACCG TTTCACCACC ATCCACATTC AGGAACTGGC GTGTGTGTCC
CGTGACACCA AGCTGGGGCC AGAAGAGATC ACCGCTGACA TCCCGAACGT GGGTGAAGCT
GCGCTCTCCA AACTGGATGA ATCCGGTATC GTTTATATTG GTGCGGAAGT GACCGGTGGC
GACATTCTGG TTGGTAAGGT TACGCCGAAA GGTGAAACTC AGCTGACCCC AGAAGAAAAA
CTGCTGCGTG CGATCTTCGG TGAGAAAGCC TCTGACGTTA AAGACTCTTC TCTGCGCGTA
CCAAACGGTG TATCCGGTAC GGTTATCGAC GTTCAGGTCT TTACTCGCGA TGGCGTAGAA
AAAGACAAAC GTGCGCTGGA AATCGAAGAA ATGCAGCTCA AACAGGCGAA GAAAGACCTG
TCTGAAGAAC TGCAGATCCT CGAAGCGGGT CTGTTCAGCC GTATCCGTGC TGTGCTGGTA
GCCGGTGGCG TTGAAGCTGA GAAGCTCGAC AAACTGCCGC GCGATCGCTG GCTGGAGCTG
GGCCTGACCG ACGAAGAGAA ACAAAATCAG CTGGAACAGC TGGCTGAGCA GTATGACGAA
CTGAAACACG AGTTCGAGAA GAAACTCGAA GCGAAACGCC GCAAAATCAC CCAGGGCGAC
GATCTGGCAC CGGGCGTGCT GAAGATTGTT AAGGTATATC TGGCGGTTAA ACGCCGTATC
CAGCCTGGTG ACAAGATGGC AGGTCGTCAC GGTAACAAGG GTGTAATTTC TAAGATCAAC
CCGATCGAAG ATATGCCTTA CGATGAAAAC GGTACGCCGG TAGACATCGT ACTGAACCCG
CTGGGCGTAC CGTCTCGTAT GAACATCGGT CAGATCCTCG AAACCCACCT GGGTATGGCT
GCGAAAGGTA TCGGCGACAA GATCAACGCC ATGCTGAAAC AGCAGCAAGA AGTCGCGAAA
CTGCGCGAAT TCATCCAGCG TGCGTACGAT CTGGGCGCTG ACGTTCGTCA GAAAGTTGAC
CTGAGTACCT TCAGCGATGA AGAAGTTATG CGTCTGGCTG AAAACCTGCG CAAAGGTATG
CCAATCGCAA CGCCGGTGTT CGACGGTGCG AAAGAAGCAG AAATTAAAGA GCTGCTGAAA
CTTGGCGACC TGCCGACTTC CGGTCAGATC CGCCTGTACG ATGGTCGCAC TGGTGAACAG
TTCGAGCGTC CGGTAACCGT TGGTTACATG TACATGCTGA AACTGAACCA CCTGGTCGAC
GACAAGATGC ACGCGCGTTC CACCGGTTCT TACAGCCTGG TTACTCAGCA GCCGCTGGGT
GGTAAGGCAC AGTTCGGTGG TCAGCGTTTC GGGGAGATGG AAGTGTGGGC GCTGGAAGCA
TACGGCGCAG CATACACCCT GCAGGAAATG CTCACCGTTA AGTCTGATGA CGTGAACGGT
CGTACCAAGA TGTATAAAAA CATCGTGGAC GGCAACCATC AGATGGAGCC GGGCATGCCA
GAATCCTTCA ACGTATTGTT GAAAGAGATT CGTTCGCTGG GTATCAACAT CGAACTGGAA
GACGAGTAA
 
Protein sequence
MVYSYTEKKR IRKDFGKRPQ VLDVPYLLSI QLDSFQKFIE QDPEGQYGLE AAFRSVFPIQ 
SYSGNSELQY VSYRLGEPVF DVQECQIRGV TYSAPLRVKL RLVIYEREAP EGTVKDIKEQ
EVYMGEIPLM TDNGTFVING TERVIVSQLH RSPGVFFDSD KGKTHSSGKV LYNARIIPYR
GSWLDFEFDP KDNLFVRIDR RRKLPATIIL RALNYTTEQI LDLFFEKVIF EIRDNKLQME
LVPERLRGET ASFDIEANGK VYVEKGRRIT ARHIRQLEKD DVKLIEVPVE YIAGKVVAKD
YIDESTGELI CAANMELSLD LLAKLSQSGH KRIETLFTND LDHGPYISET LRVDPTNDRL
SALVEIYRMM RPGEPPTREA AESLFENLFF SEDRYDLSAV GRMKFNRSLL REEIEGSGIL
SKDDIIDVMK KLIDIRNGKG EVDDIDHLGN RRIRSVGEMA ENQFRVGLVR VERAVKERLS
LGDLDTLMPQ DMINAKPISA AVKEFFGSSQ LSQFMDQNNP LSEITHKRRI SALGPGGLTR
ERAGFEVRDV HPTHYGRVCP IETPEGPNIG LINSLSVYAQ TNEYGFLETP YRKVTDGVVT
DEIHYLSAIE EGNYVIAQAN SNLDEEGHFV EDLVTCRSKG ESSLFSRDQV DYMDVSTQQV
VSVGASLIPF LEHDDANRAL MGANMQRQAV PTLRADKPLV GTGMERAVAV DSGVTAVAKR
GGVVQYVDAS RIVIKVNEDE MYPGEAGIDI YNLTKYTRSN QNTCINQMPC VSLGEPVERG
DVLADGPSTD LGELALGQNM RVAFMPWNGY NFEDSILVSE RVVQEDRFTT IHIQELACVS
RDTKLGPEEI TADIPNVGEA ALSKLDESGI VYIGAEVTGG DILVGKVTPK GETQLTPEEK
LLRAIFGEKA SDVKDSSLRV PNGVSGTVID VQVFTRDGVE KDKRALEIEE MQLKQAKKDL
SEELQILEAG LFSRIRAVLV AGGVEAEKLD KLPRDRWLEL GLTDEEKQNQ LEQLAEQYDE
LKHEFEKKLE AKRRKITQGD DLAPGVLKIV KVYLAVKRRI QPGDKMAGRH GNKGVISKIN
PIEDMPYDEN GTPVDIVLNP LGVPSRMNIG QILETHLGMA AKGIGDKINA MLKQQQEVAK
LREFIQRAYD LGADVRQKVD LSTFSDEEVM RLAENLRKGM PIATPVFDGA KEAEIKELLK
LGDLPTSGQI RLYDGRTGEQ FERPVTVGYM YMLKLNHLVD DKMHARSTGS YSLVTQQPLG
GKAQFGGQRF GEMEVWALEA YGAAYTLQEM LTVKSDDVNG RTKMYKNIVD GNHQMEPGMP
ESFNVLLKEI RSLGINIELE DE