Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A4220 |
Symbol | rpoB |
ID | 5594854 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | + |
Start bp | 4210827 |
End bp | 4214855 |
Gene Length | 4029 bp |
Protein Length | 1342 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 640923323 |
Product | DNA-directed RNA polymerase subunit beta |
Protein accession | YP_001460773 |
Protein GI | 157163455 |
COG category | [K] Transcription |
COG ID | [COG0085] DNA-directed RNA polymerase, beta subunit/140 kD subunit |
TIGRFAM ID | [TIGR02013] DNA-directed RNA polymerase, beta subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.000188468 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTTTACT CCTATACCGA GAAAAAACGT ATTCGTAAGG ATTTTGGTAA ACGTCCACAA GTTCTGGATG TACCTTATCT CCTTTCTATC CAGCTTGACT CGTTTCAGAA ATTTATCGAG CAAGATCCTG AAGGGCAGTA TGGTCTGGAA GCTGCTTTCC GTTCCGTATT CCCGATTCAG AGCTACAGCG GTAATTCCGA GCTGCAATAC GTCAGCTACC GCCTTGGCGA ACCGGTGTTT GACGTCCAGG AATGTCAAAT CCGTGGCGTG ACCTATTCCG CACCGCTGCG CGTTAAACTG CGTCTGGTGA TCTATGAGCG CGAAGCGCCG GAAGGCACCG TAAAAGACAT TAAAGAACAA GAAGTCTACA TGGGCGAAAT TCCGCTCATG ACAGACAACG GTACCTTTGT TATCAACGGT ACTGAGCGTG TTATCGTTTC CCAGCTGCAC CGTAGTCCGG GCGTCTTCTT TGACTCCGAC AAAGGTAAAA CCCACTCTTC GGGTAAAGTG CTGTATAACG CGCGTATCAT CCCTTACCGT GGTTCCTGGC TGGACTTCGA ATTCGATCCG AAGGACAACC TGTTCGTACG TATCGACCGT CGCCGTAAAC TGCCTGCGAC CATCATTCTG CGCGCCCTGA ACTACACCAC AGAGCAGATC CTCGACCTGT TCTTTGAAAA AGTTATCTTT GAAATCCGTG ATAACAAGCT GCAGATGGAA CTGGTGCCGG AACGCCTGCG TGGTGAAACC GCATCTTTTG ACATCGAAGC TAACGGTAAA GTGTACGTAG AAAAAGGCCG CCGTATCACT GCGCGCCACA TTCGCCAGCT GGAAAAAGAC GACGTCAAAC TGATCGAAGT CCCGGTTGAG TACATCGCAG GTAAAGTGGT TGCTAAAGAC TATATTGATG AGTCTACCGG CGAGCTGATC TGCGCAGCGA ACATGGAGCT GAGCCTGGAT CTGCTGGCTA AGCTGAGCCA GTCTGGTCAC AAGCGTATCG AAACGCTGTT CACCAACGAT CTGGATCACG GCCCATATAT CTCTGAAACC TTACGTGTCG ACCCAACTAA CGACCGTCTG AGCGCACTGG TAGAAATCTA CCGCATGATG CGCCCTGGCG AGCCGCCGAC TCGTGAAGCA GCTGAAAGCC TGTTCGAGAA CCTGTTCTTC TCCGAAGACC GTTATGACTT GTCTGCGGTT GGTCGTATGA AGTTCAACCG TTCTCTGCTG CGCGAAGAAA TCGAAGGTTC CGGTATCCTG AGCAAAGACG ACATCATTGA TGTTATGAAA AAGCTCATCG ATATCCGTAA CGGTAAAGGC GAAGTCGATG ATATCGACCA CCTCGGCAAC CGTCGTATCC GTTCCGTTGG CGAAATGGCG GAAAACCAGT TCCGCGTTGG CCTGGTACGT GTAGAGCGTG CGGTGAAAGA GCGTCTGTCT CTGGGCGATC TGGATACCCT GATGCCTCAG GATATGATCA ACGCCAAGCC GATTTCCGCA GCAGTGAAAG AGTTCTTCGG TTCCAGCCAG CTGTCTCAGT TTATGGACCA GAACAACCCG CTGTCTGAGA TTACGCACAA ACGTCGTATC TCCGCACTCG GCCCAGGCGG TCTGACCCGT GAACGTGCAG GCTTCGAAGT TCGAGACGTA CACCCGACTC ACTACGGTCG CGTATGTCCA ATCGAAACCC CTGAAGGTCC GAACATCGGT CTGATCAACT CTCTGTCCGT GTACGCACAG ACTAACGAAT ACGGCTTCCT TGAGACTCCG TATCGTAAAG TGACCGACGG TGTTGTAACT GACGAAATTC ACTACCTGTC TGCTATCGAA GAAGGCAACT ACGTTATCGC CCAGGCGAAC TCCAACCTGG ATGAAGAAGG CCACTTCGTA GAAGACCTGG TAACTTGCCG TAGCAAAGGC GAATCCAGCT TGTTCAGCCG CGACCAGGTT GACTACATGG ACGTATCCAC CCAGCAGGTG GTATCCGTCG GTGCGTCCCT GATCCCGTTC CTGGAACACG ATGACGCCAA CCGTGCATTG ATGGGTGCGA ACATGCAACG TCAGGCCGTT CCGACTCTGC GCGCTGATAA GCCGCTGGTT GGTACTGGTA TGGAACGTGC TGTTGCCGTT GACTCCGGTG TAACTGCGGT AGCTAAACGT GGTGGTGTCG TTCAGTACGT GGATGCTTCC CGTATCGTTA TCAAAGTTAA CGAAGACGAG ATGTATCCGG GTGAAGCAGG TATCGACATC TACAACCTGA CCAAATACAC CCGTTCTAAC CAGAACACCT GTATCAACCA GATGCCGTGT GTGTCTCTGG GTGAACCGGT TGAACGTGGC GACGTGCTGG CAGACGGTCC GTCCACCGAC CTCGGTGAAC TGGCGCTTGG TCAGAACATG CGCGTAGCGT TCATGCCGTG GAATGGTTAC AACTTCGAAG ACTCCATCCT CGTATCCGAG CGTGTTGTTC AGGAAGACCG TTTCACCACC ATCCACATTC AGGAACTGGC GTGTGTGTCC CGTGACACCA AGCTGGGGCC GGAAGAGATC ACCGCTGACA TCCCGAACGT GGGTGAAGCT GCGCTCTCCA AACTGGATGA ATCCGGTATC GTTTACATTG GTGCGGAAGT GACCGGTGGC GACATTCTGG TTGGTAAGGT AACGCCGAAA GGTGAAACTC AGCTGACCCC AGAAGAAAAA CTGCTGCGTG CGATCTTCGG TGAGAAAGCC TCTGACGTTA AAGACTCTTC TCTGCGCGTA CCAAACGGTG TATCCGGTAC GGTTATCGAC GTTCAGGTCT TTACTCGCGA TGGCGTAGAA AAAGACAAAC GTGCGCTGGA AATCGAAGAA ATGCAGCTCA AACAGGCGAA GAAAGACCTG TCTGAAGAAC TGCAGATCCT CGAAGCGGGT CTGTTCAGCC GTATCCGTGC TGTGCTGGTA GCCGGTGGCG TTGAAGCTGA GAAGCTCGAC AAACTGCCGC GCGATCGCTG GCTGGAGCTG GGCCTGACAG ACGAAGAGAA ACAAAATCAG CTGGAACAGC TGGCTGAGCA GTATGACGAA CTGAAACACG AGTTCGAGAA GAAACTCGAA GCGAAACGCC GCAAAATCAC CCAGGGCGAC GATCTGGCAC CGGGCGTGCT GAAGATTGTT AAGGTATATC TGGCGGTTAA ACGCCGTATC CAGCCTGGTG ACAAGATGGC AGGTCGTCAC GGTAACAAGG GTGTAATTTC TAAGATCAAC CCGATCGAAG ATATGCCTTA CGATGAAAAC GGTACGCCGG TAGACATCGT ACTGAACCCG CTGGGCGTAC CGTCTCGTAT GAACATCGGT CAGATCCTCG AAACCCACCT GGGTATGGCT GCGAAAGGTA TCGGCGACAA GATCAACGCC ATGCTGAAAC AGCAGCAAGA AGTCGCGAAA CTGCGCGAAT TCATCCAGCG TGCGTACGAT CTGGGCGCTG ACGTTCGTCA GAAAGTTGAC CTGAGTACCT TCAGCGATGA AGAAGTTATG CGTCTGGCTG AAAACCTGCG CAAAGGTATG CCAATCGCAA CGCCGGTGTT CGACGGTGCG AAAGAAGCAG AAATTAAAGA GCTGCTGAAA CTTGGCGACC TGCCGACTTC CGGTCAGATC CGCCTGTACG ATGGTCGCAC TGGTGAACAG TTCGAGCGTC CGGTAACCGT TGGTTACATG TACATGCTGA AACTGAACCA CCTGGTCGAC GACAAGATGC ACGCGCGTTC CACCGGTTCT TACAGCCTGG TTACTCAGCA GCCGCTGGGT GGTAAGGCAC AGTTCGGTGG TCAGCGTTTC GGGGAGATGG AAGTGTGGGC GCTGGAAGCA TACGGCGCAG CATACACCCT GCAGGAAATG CTCACCGTTA AGTCTGATGA CGTGAACGGT CGTACCAAGA TGTATAAAAA CATCGTGGAC GGCAACCATC AGATGGAGCC GGGCATGCCA GAATCCTTCA ACGTATTGTT GAAAGAGATT CGTTCGCTGG GTATCAACAT CGAACTGGAA GACGAGTAA
|
Protein sequence | MVYSYTEKKR IRKDFGKRPQ VLDVPYLLSI QLDSFQKFIE QDPEGQYGLE AAFRSVFPIQ SYSGNSELQY VSYRLGEPVF DVQECQIRGV TYSAPLRVKL RLVIYEREAP EGTVKDIKEQ EVYMGEIPLM TDNGTFVING TERVIVSQLH RSPGVFFDSD KGKTHSSGKV LYNARIIPYR GSWLDFEFDP KDNLFVRIDR RRKLPATIIL RALNYTTEQI LDLFFEKVIF EIRDNKLQME LVPERLRGET ASFDIEANGK VYVEKGRRIT ARHIRQLEKD DVKLIEVPVE YIAGKVVAKD YIDESTGELI CAANMELSLD LLAKLSQSGH KRIETLFTND LDHGPYISET LRVDPTNDRL SALVEIYRMM RPGEPPTREA AESLFENLFF SEDRYDLSAV GRMKFNRSLL REEIEGSGIL SKDDIIDVMK KLIDIRNGKG EVDDIDHLGN RRIRSVGEMA ENQFRVGLVR VERAVKERLS LGDLDTLMPQ DMINAKPISA AVKEFFGSSQ LSQFMDQNNP LSEITHKRRI SALGPGGLTR ERAGFEVRDV HPTHYGRVCP IETPEGPNIG LINSLSVYAQ TNEYGFLETP YRKVTDGVVT DEIHYLSAIE EGNYVIAQAN SNLDEEGHFV EDLVTCRSKG ESSLFSRDQV DYMDVSTQQV VSVGASLIPF LEHDDANRAL MGANMQRQAV PTLRADKPLV GTGMERAVAV DSGVTAVAKR GGVVQYVDAS RIVIKVNEDE MYPGEAGIDI YNLTKYTRSN QNTCINQMPC VSLGEPVERG DVLADGPSTD LGELALGQNM RVAFMPWNGY NFEDSILVSE RVVQEDRFTT IHIQELACVS RDTKLGPEEI TADIPNVGEA ALSKLDESGI VYIGAEVTGG DILVGKVTPK GETQLTPEEK LLRAIFGEKA SDVKDSSLRV PNGVSGTVID VQVFTRDGVE KDKRALEIEE MQLKQAKKDL SEELQILEAG LFSRIRAVLV AGGVEAEKLD KLPRDRWLEL GLTDEEKQNQ LEQLAEQYDE LKHEFEKKLE AKRRKITQGD DLAPGVLKIV KVYLAVKRRI QPGDKMAGRH GNKGVISKIN PIEDMPYDEN GTPVDIVLNP LGVPSRMNIG QILETHLGMA AKGIGDKINA MLKQQQEVAK LREFIQRAYD LGADVRQKVD LSTFSDEEVM RLAENLRKGM PIATPVFDGA KEAEIKELLK LGDLPTSGQI RLYDGRTGEQ FERPVTVGYM YMLKLNHLVD DKMHARSTGS YSLVTQQPLG GKAQFGGQRF GEMEVWALEA YGAAYTLQEM LTVKSDDVNG RTKMYKNIVD GNHQMEPGMP ESFNVLLKEI RSLGINIELE DE
|
| |