Gene EcHS_A4220 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A4220 
SymbolrpoB 
ID5594854 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp4210827 
End bp4214855 
Gene Length4029 bp 
Protein Length1342 aa 
Translation table11 
GC content53% 
IMG OID640923323 
ProductDNA-directed RNA polymerase subunit beta 
Protein accessionYP_001460773 
Protein GI157163455 
COG category[K] Transcription 
COG ID[COG0085] DNA-directed RNA polymerase, beta subunit/140 kD subunit 
TIGRFAM ID[TIGR02013] DNA-directed RNA polymerase, beta subunit 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.000188468 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTTTACT CCTATACCGA GAAAAAACGT ATTCGTAAGG ATTTTGGTAA ACGTCCACAA 
GTTCTGGATG TACCTTATCT CCTTTCTATC CAGCTTGACT CGTTTCAGAA ATTTATCGAG
CAAGATCCTG AAGGGCAGTA TGGTCTGGAA GCTGCTTTCC GTTCCGTATT CCCGATTCAG
AGCTACAGCG GTAATTCCGA GCTGCAATAC GTCAGCTACC GCCTTGGCGA ACCGGTGTTT
GACGTCCAGG AATGTCAAAT CCGTGGCGTG ACCTATTCCG CACCGCTGCG CGTTAAACTG
CGTCTGGTGA TCTATGAGCG CGAAGCGCCG GAAGGCACCG TAAAAGACAT TAAAGAACAA
GAAGTCTACA TGGGCGAAAT TCCGCTCATG ACAGACAACG GTACCTTTGT TATCAACGGT
ACTGAGCGTG TTATCGTTTC CCAGCTGCAC CGTAGTCCGG GCGTCTTCTT TGACTCCGAC
AAAGGTAAAA CCCACTCTTC GGGTAAAGTG CTGTATAACG CGCGTATCAT CCCTTACCGT
GGTTCCTGGC TGGACTTCGA ATTCGATCCG AAGGACAACC TGTTCGTACG TATCGACCGT
CGCCGTAAAC TGCCTGCGAC CATCATTCTG CGCGCCCTGA ACTACACCAC AGAGCAGATC
CTCGACCTGT TCTTTGAAAA AGTTATCTTT GAAATCCGTG ATAACAAGCT GCAGATGGAA
CTGGTGCCGG AACGCCTGCG TGGTGAAACC GCATCTTTTG ACATCGAAGC TAACGGTAAA
GTGTACGTAG AAAAAGGCCG CCGTATCACT GCGCGCCACA TTCGCCAGCT GGAAAAAGAC
GACGTCAAAC TGATCGAAGT CCCGGTTGAG TACATCGCAG GTAAAGTGGT TGCTAAAGAC
TATATTGATG AGTCTACCGG CGAGCTGATC TGCGCAGCGA ACATGGAGCT GAGCCTGGAT
CTGCTGGCTA AGCTGAGCCA GTCTGGTCAC AAGCGTATCG AAACGCTGTT CACCAACGAT
CTGGATCACG GCCCATATAT CTCTGAAACC TTACGTGTCG ACCCAACTAA CGACCGTCTG
AGCGCACTGG TAGAAATCTA CCGCATGATG CGCCCTGGCG AGCCGCCGAC TCGTGAAGCA
GCTGAAAGCC TGTTCGAGAA CCTGTTCTTC TCCGAAGACC GTTATGACTT GTCTGCGGTT
GGTCGTATGA AGTTCAACCG TTCTCTGCTG CGCGAAGAAA TCGAAGGTTC CGGTATCCTG
AGCAAAGACG ACATCATTGA TGTTATGAAA AAGCTCATCG ATATCCGTAA CGGTAAAGGC
GAAGTCGATG ATATCGACCA CCTCGGCAAC CGTCGTATCC GTTCCGTTGG CGAAATGGCG
GAAAACCAGT TCCGCGTTGG CCTGGTACGT GTAGAGCGTG CGGTGAAAGA GCGTCTGTCT
CTGGGCGATC TGGATACCCT GATGCCTCAG GATATGATCA ACGCCAAGCC GATTTCCGCA
GCAGTGAAAG AGTTCTTCGG TTCCAGCCAG CTGTCTCAGT TTATGGACCA GAACAACCCG
CTGTCTGAGA TTACGCACAA ACGTCGTATC TCCGCACTCG GCCCAGGCGG TCTGACCCGT
GAACGTGCAG GCTTCGAAGT TCGAGACGTA CACCCGACTC ACTACGGTCG CGTATGTCCA
ATCGAAACCC CTGAAGGTCC GAACATCGGT CTGATCAACT CTCTGTCCGT GTACGCACAG
ACTAACGAAT ACGGCTTCCT TGAGACTCCG TATCGTAAAG TGACCGACGG TGTTGTAACT
GACGAAATTC ACTACCTGTC TGCTATCGAA GAAGGCAACT ACGTTATCGC CCAGGCGAAC
TCCAACCTGG ATGAAGAAGG CCACTTCGTA GAAGACCTGG TAACTTGCCG TAGCAAAGGC
GAATCCAGCT TGTTCAGCCG CGACCAGGTT GACTACATGG ACGTATCCAC CCAGCAGGTG
GTATCCGTCG GTGCGTCCCT GATCCCGTTC CTGGAACACG ATGACGCCAA CCGTGCATTG
ATGGGTGCGA ACATGCAACG TCAGGCCGTT CCGACTCTGC GCGCTGATAA GCCGCTGGTT
GGTACTGGTA TGGAACGTGC TGTTGCCGTT GACTCCGGTG TAACTGCGGT AGCTAAACGT
GGTGGTGTCG TTCAGTACGT GGATGCTTCC CGTATCGTTA TCAAAGTTAA CGAAGACGAG
ATGTATCCGG GTGAAGCAGG TATCGACATC TACAACCTGA CCAAATACAC CCGTTCTAAC
CAGAACACCT GTATCAACCA GATGCCGTGT GTGTCTCTGG GTGAACCGGT TGAACGTGGC
GACGTGCTGG CAGACGGTCC GTCCACCGAC CTCGGTGAAC TGGCGCTTGG TCAGAACATG
CGCGTAGCGT TCATGCCGTG GAATGGTTAC AACTTCGAAG ACTCCATCCT CGTATCCGAG
CGTGTTGTTC AGGAAGACCG TTTCACCACC ATCCACATTC AGGAACTGGC GTGTGTGTCC
CGTGACACCA AGCTGGGGCC GGAAGAGATC ACCGCTGACA TCCCGAACGT GGGTGAAGCT
GCGCTCTCCA AACTGGATGA ATCCGGTATC GTTTACATTG GTGCGGAAGT GACCGGTGGC
GACATTCTGG TTGGTAAGGT AACGCCGAAA GGTGAAACTC AGCTGACCCC AGAAGAAAAA
CTGCTGCGTG CGATCTTCGG TGAGAAAGCC TCTGACGTTA AAGACTCTTC TCTGCGCGTA
CCAAACGGTG TATCCGGTAC GGTTATCGAC GTTCAGGTCT TTACTCGCGA TGGCGTAGAA
AAAGACAAAC GTGCGCTGGA AATCGAAGAA ATGCAGCTCA AACAGGCGAA GAAAGACCTG
TCTGAAGAAC TGCAGATCCT CGAAGCGGGT CTGTTCAGCC GTATCCGTGC TGTGCTGGTA
GCCGGTGGCG TTGAAGCTGA GAAGCTCGAC AAACTGCCGC GCGATCGCTG GCTGGAGCTG
GGCCTGACAG ACGAAGAGAA ACAAAATCAG CTGGAACAGC TGGCTGAGCA GTATGACGAA
CTGAAACACG AGTTCGAGAA GAAACTCGAA GCGAAACGCC GCAAAATCAC CCAGGGCGAC
GATCTGGCAC CGGGCGTGCT GAAGATTGTT AAGGTATATC TGGCGGTTAA ACGCCGTATC
CAGCCTGGTG ACAAGATGGC AGGTCGTCAC GGTAACAAGG GTGTAATTTC TAAGATCAAC
CCGATCGAAG ATATGCCTTA CGATGAAAAC GGTACGCCGG TAGACATCGT ACTGAACCCG
CTGGGCGTAC CGTCTCGTAT GAACATCGGT CAGATCCTCG AAACCCACCT GGGTATGGCT
GCGAAAGGTA TCGGCGACAA GATCAACGCC ATGCTGAAAC AGCAGCAAGA AGTCGCGAAA
CTGCGCGAAT TCATCCAGCG TGCGTACGAT CTGGGCGCTG ACGTTCGTCA GAAAGTTGAC
CTGAGTACCT TCAGCGATGA AGAAGTTATG CGTCTGGCTG AAAACCTGCG CAAAGGTATG
CCAATCGCAA CGCCGGTGTT CGACGGTGCG AAAGAAGCAG AAATTAAAGA GCTGCTGAAA
CTTGGCGACC TGCCGACTTC CGGTCAGATC CGCCTGTACG ATGGTCGCAC TGGTGAACAG
TTCGAGCGTC CGGTAACCGT TGGTTACATG TACATGCTGA AACTGAACCA CCTGGTCGAC
GACAAGATGC ACGCGCGTTC CACCGGTTCT TACAGCCTGG TTACTCAGCA GCCGCTGGGT
GGTAAGGCAC AGTTCGGTGG TCAGCGTTTC GGGGAGATGG AAGTGTGGGC GCTGGAAGCA
TACGGCGCAG CATACACCCT GCAGGAAATG CTCACCGTTA AGTCTGATGA CGTGAACGGT
CGTACCAAGA TGTATAAAAA CATCGTGGAC GGCAACCATC AGATGGAGCC GGGCATGCCA
GAATCCTTCA ACGTATTGTT GAAAGAGATT CGTTCGCTGG GTATCAACAT CGAACTGGAA
GACGAGTAA
 
Protein sequence
MVYSYTEKKR IRKDFGKRPQ VLDVPYLLSI QLDSFQKFIE QDPEGQYGLE AAFRSVFPIQ 
SYSGNSELQY VSYRLGEPVF DVQECQIRGV TYSAPLRVKL RLVIYEREAP EGTVKDIKEQ
EVYMGEIPLM TDNGTFVING TERVIVSQLH RSPGVFFDSD KGKTHSSGKV LYNARIIPYR
GSWLDFEFDP KDNLFVRIDR RRKLPATIIL RALNYTTEQI LDLFFEKVIF EIRDNKLQME
LVPERLRGET ASFDIEANGK VYVEKGRRIT ARHIRQLEKD DVKLIEVPVE YIAGKVVAKD
YIDESTGELI CAANMELSLD LLAKLSQSGH KRIETLFTND LDHGPYISET LRVDPTNDRL
SALVEIYRMM RPGEPPTREA AESLFENLFF SEDRYDLSAV GRMKFNRSLL REEIEGSGIL
SKDDIIDVMK KLIDIRNGKG EVDDIDHLGN RRIRSVGEMA ENQFRVGLVR VERAVKERLS
LGDLDTLMPQ DMINAKPISA AVKEFFGSSQ LSQFMDQNNP LSEITHKRRI SALGPGGLTR
ERAGFEVRDV HPTHYGRVCP IETPEGPNIG LINSLSVYAQ TNEYGFLETP YRKVTDGVVT
DEIHYLSAIE EGNYVIAQAN SNLDEEGHFV EDLVTCRSKG ESSLFSRDQV DYMDVSTQQV
VSVGASLIPF LEHDDANRAL MGANMQRQAV PTLRADKPLV GTGMERAVAV DSGVTAVAKR
GGVVQYVDAS RIVIKVNEDE MYPGEAGIDI YNLTKYTRSN QNTCINQMPC VSLGEPVERG
DVLADGPSTD LGELALGQNM RVAFMPWNGY NFEDSILVSE RVVQEDRFTT IHIQELACVS
RDTKLGPEEI TADIPNVGEA ALSKLDESGI VYIGAEVTGG DILVGKVTPK GETQLTPEEK
LLRAIFGEKA SDVKDSSLRV PNGVSGTVID VQVFTRDGVE KDKRALEIEE MQLKQAKKDL
SEELQILEAG LFSRIRAVLV AGGVEAEKLD KLPRDRWLEL GLTDEEKQNQ LEQLAEQYDE
LKHEFEKKLE AKRRKITQGD DLAPGVLKIV KVYLAVKRRI QPGDKMAGRH GNKGVISKIN
PIEDMPYDEN GTPVDIVLNP LGVPSRMNIG QILETHLGMA AKGIGDKINA MLKQQQEVAK
LREFIQRAYD LGADVRQKVD LSTFSDEEVM RLAENLRKGM PIATPVFDGA KEAEIKELLK
LGDLPTSGQI RLYDGRTGEQ FERPVTVGYM YMLKLNHLVD DKMHARSTGS YSLVTQQPLG
GKAQFGGQRF GEMEVWALEA YGAAYTLQEM LTVKSDDVNG RTKMYKNIVD GNHQMEPGMP
ESFNVLLKEI RSLGINIELE DE