Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_2724 |
Symbol | rpoB |
ID | 4810718 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 3211435 |
End bp | 3215187 |
Gene Length | 3753 bp |
Protein Length | 1250 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 640108143 |
Product | DNA-directed RNA polymerase subunit beta |
Protein accession | YP_001039116 |
Protein GI | 125975206 |
COG category | [K] Transcription |
COG ID | [COG0085] DNA-directed RNA polymerase, beta subunit/140 kD subunit |
TIGRFAM ID | [TIGR02013] DNA-directed RNA polymerase, beta subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.280863 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTACATC CCGTGAAATT GGGAAGAAAC GTTAGAATGA GTTATTCAAA AATTGATGAA GTTATCGACA TGCCAAACCT TATCGAAATT CAGAAGAATT CGTACGAGCA ATTCCTCAAA GAGGGTTTTA AGGAGGTTTT CAAAGACGTC AATCCCATAA CTGATTACAC CGGAAATCTG ATTCTGGAAT TTGTAGACTA TTCATTGGAT GAACCCCCCA AATACAGTGT GGATGAATGT AAAGAAAGAG ACGCGACTTA TGCTGCACCC TTAAAAGTAA AGGTGCGTCT TATCAACAAG GAAACGGGCG AAGTTAAAGA ACAGGAAATT TTTATGGGTG ATTTCCCACT GATGACTGAA ACGGGAACTT TTATTATTAA TGGAGCTGAG AGGGTTATAG TCAGTCAGCT TGTAAGATCA CCCGGAATTT ACTATGCAAT GAAGATTGAT AAAGCAGGAA AACAGTTGTT TTCAAATACA GTTATTCCAA ACAGAGGAGC ATGGCTTGAA TATGAAACCG ATTCAAACGA CGTTTTGTCG GTGCGTATAG ACAGAACCAG GAAGCTGCCT CTGACGGTAT TGGTAAGAGC ACTGGGATAT GGAACGGATC TTGAGATTAC CGAGCTTTTT GGCGAGGATG AAAGAATACT TGCGACAATA CAGAAAGACA GTACCAAGAC AGAAGAGGAA GGACTTCTTG AGATATACAA AAGGCTCAGA CCGGGAGAGC CGCCGACGGT TGAAAGTGCA AAAGCACTTC TCCACGGACT GTTTTTCGAC CCTAAAAGGT ATGATTTGGC AAAGCCGGGA AGATTTAAAT TTAACAAAAA GCTTTCCATT GCAGCAAGAA TACACGGCTT TATAGCAGGT GAAAATATTA AAGACCCTGA TACCGGTGAA ATAATTGTTG CTGAAGGAGA AACCATTTCA AGGGAAAAGG CCGAAACGAT ACAAAATGCC GGTGTCAATA CGGTAATTCT CAGAGTTGAC GGCAAAAATG TCAAAGTCAT CGGCAATGAC ATGGTGGATA TCAAAAGATA TGTGGATTTT GACCCGAAGG AAATCGGCAT CAATGAAAAA GTCAAAAGAG ATGTTTTAAT GGAGATTCTT GAAGAGTACA AGGGAAAGGG CGACGATGCG ATCAAAAAGG CTTTACAGGA AAGAATTGAC GATCTGATTC CAAAGCATAT CACGAAAGAA GATATAATAT CGTCCATCAG CTATATAATA GGATTGAGCT ACGGAATCGG AAGCACGGAT GACATTGACC ATTTGGGTAA CAGAAGACTC CGTTCTGTAG GAGAGCTTCT GCAAAATCAG TTCAGGATTG GTCTTTCCAG AATGGAAAGA GTGGTAAGGG AAAGAATGAC AATCCAGGAC CTGGATGTTG TCACTCCACA GGCGCTTATT AATATAAGGC CGGTGGCGGC GGCAATAAAA GAGTTCTTTG GAAGCAGCCA GCTGTCCCAG TTCATGGACC AGACCAATCC TTTGGCGGAG CTTACGCATA AAAGAAGGCT CAGTGCGCTG GGCCCCGGAG GTTTGAGCAG GGAAAGAGCG GGATTTGAAG TTCGTGACGT TCACCATTCC CACTATGGGC GTATGTGCCC TATAGAGACA CCGGAAGGAC CGAACATTGG TCTTATCGGT TCGCTCAGTA CCTATGCAAG AGTAAATGAA TACGGTTTTA TTGAAACGCC GTACAGAAAA GTAAGCAAGG AAGAACCGGG TAAAGTTACC AACGAGATAG TTTACCTGAC TGCGGATGAG GAAGACGAAT ATATTATTGC GCAGGCAAAT GAGCCGCTGG ATGAAGAAGG AAGGTTTATT TCCAATAAAG TTGTATGTAG ATACAAGGAA GAGTTTATTG AGGTTGATCC GTCCAAAATT GACTTTATGG ATGTATCACC AAAGCAGATA GTTTCTGTTG CGACATCAAT GATACCGTTC CTTGAAAATG ATGACGCGAA CCGTGCACTG ATGGGAGCAA ACATGCAACG TCAGGCGGTT CCGCTTATAA AAACTGAGTC GCCGATTGTC GGAACGGGAA TTGAGTACAG GGCTGCAAGG GATTCAGGAG TTGTTATACT GGCGAAAAAT CCGGGAGTTG TTGAAAAGGT AACCGCCAAT GAGATTATTA TTCGTACAAA AGACGGTAAA AGAGATACTT ACAAGCTCCT CAAATACATG CGCTCCAACC AGGGAACATG TATCAACCAA AGACCGATAG TGAAAAAAGG TGAAGAGGTT GAAGCGGGAG ATGTAATAGC GGACGGTCCT TCCACAGATA ACGGTGAAAT TGCATTAGGA AAGAATGTTC TGGTAGGATT TATGACCTGG GAAGGATACA ACTACGAGGA CGCCATCCTG ATAAGTGAAA GACTGGTTAA AGATGATGTG TTTACATCCA TCCATATAGA AGAGTATGAG GCTGAGGCAA GGGACACGAA GCTCGGTCCC GAAGATATAA CCAGAGAAAT ACCCAATGTC AGCGAAGATG CATTGAAAGA CCTGAACAGC GAAGGTATTA TCAGAATAGG AGCCGAAGTC AGAGCCGGTG ATATCCTCGT AGGAAAGGTT ACGCCAAAAG GAGAGACGGA GCTTACCGCT GAAGAAAGAC TTCTTCGTGC AATTTTCGGA GAAAAAGCAA GGGAAGTCAG GGACACCTCT TTGCGTGTGC CTCACGGGGA ATCGGGTATT GTTGTTGACG TAAAAATATT TACAAGAGAA AACGGTGATG AACTGGCACC TGGAGTAAAC AAGCTGGTAA GGGTTTATGT TGCACAGAAG AGAAAAATAT CTGTTGGAGA CAAAATGGCA GGAAGACACG GAAACAAGGG TGTTATATCG AGGATTCTGC CGGTGGAAGA TATGCCTTTC CTTCCTGACG GTACTCCATT GGATATAGTT TTAAATCCGC TGGGCGTTCC GTCCCGTATG AATATCGGAC AGGTACTTGA GGTACATCTT GGTTATGCGG CTAAAGCCCT TGGCTGGAAA GTTGCAACTC CGGTTTTTGA CGGTGCTACC GAGGAAGATA TAGTTCAGAC GCTGAGAAAA GCAGGTCTTG CTGAAGACGG CAAGTCAATA TTGTATGACG GAAGAACGGG AGAACCGTTT GAAAACAGGG TGACTGTAGG TTATATGTAT ATGCTCAAGC TTGCCCACCT TGTAGATGAC AAGATACATG CCCGTTCAAC CGGTCCATAT TCTCTGGTAA CGCAGCAGCC TTTGGGAGGT AAGGCACAAT TCGGCGGACA GAGATTCGGA GAGATGGAAG TTTGGGCTTT GGAAGCTTAC GGTGCTGCCT ATACACTGCA GGAGATACTG ACGGTCAAGT CGGACGATGT AGTGGGCAGG GTTAAGACTT ACGAGGCAAT AGTAAAGGGC GAAAACGTTC CGGAGCCGGG TATCCCCGAA TGCTTCAAGG TTCTTATCAA GGAACTTCAA AGTTTGTGCC TTGATGTGAA AGTATACTCG GAGGAACAGG AAGAAATAGC TATTAAAGAA TCGGTTGAAG ATGATCTTGA AGAACTTAAC GTCAACATTG AGGGCAGGGA AGATGAAGTT AACTTCAATG AGTTTAACGA CATCGGAGAG GAAATAACGG ATGAAGACCT TGAAGTGGAA GACTTCGACC TGCAGGATTT AAACGACGAT GATATCAATC CTGATGACAC TATTGATGCC GAACTGGATG ACAATCTTTT TGACGATGAT TTTGATGATA CTTTTGATGA CGATGATTTA TAA
|
Protein sequence | MVHPVKLGRN VRMSYSKIDE VIDMPNLIEI QKNSYEQFLK EGFKEVFKDV NPITDYTGNL ILEFVDYSLD EPPKYSVDEC KERDATYAAP LKVKVRLINK ETGEVKEQEI FMGDFPLMTE TGTFIINGAE RVIVSQLVRS PGIYYAMKID KAGKQLFSNT VIPNRGAWLE YETDSNDVLS VRIDRTRKLP LTVLVRALGY GTDLEITELF GEDERILATI QKDSTKTEEE GLLEIYKRLR PGEPPTVESA KALLHGLFFD PKRYDLAKPG RFKFNKKLSI AARIHGFIAG ENIKDPDTGE IIVAEGETIS REKAETIQNA GVNTVILRVD GKNVKVIGND MVDIKRYVDF DPKEIGINEK VKRDVLMEIL EEYKGKGDDA IKKALQERID DLIPKHITKE DIISSISYII GLSYGIGSTD DIDHLGNRRL RSVGELLQNQ FRIGLSRMER VVRERMTIQD LDVVTPQALI NIRPVAAAIK EFFGSSQLSQ FMDQTNPLAE LTHKRRLSAL GPGGLSRERA GFEVRDVHHS HYGRMCPIET PEGPNIGLIG SLSTYARVNE YGFIETPYRK VSKEEPGKVT NEIVYLTADE EDEYIIAQAN EPLDEEGRFI SNKVVCRYKE EFIEVDPSKI DFMDVSPKQI VSVATSMIPF LENDDANRAL MGANMQRQAV PLIKTESPIV GTGIEYRAAR DSGVVILAKN PGVVEKVTAN EIIIRTKDGK RDTYKLLKYM RSNQGTCINQ RPIVKKGEEV EAGDVIADGP STDNGEIALG KNVLVGFMTW EGYNYEDAIL ISERLVKDDV FTSIHIEEYE AEARDTKLGP EDITREIPNV SEDALKDLNS EGIIRIGAEV RAGDILVGKV TPKGETELTA EERLLRAIFG EKAREVRDTS LRVPHGESGI VVDVKIFTRE NGDELAPGVN KLVRVYVAQK RKISVGDKMA GRHGNKGVIS RILPVEDMPF LPDGTPLDIV LNPLGVPSRM NIGQVLEVHL GYAAKALGWK VATPVFDGAT EEDIVQTLRK AGLAEDGKSI LYDGRTGEPF ENRVTVGYMY MLKLAHLVDD KIHARSTGPY SLVTQQPLGG KAQFGGQRFG EMEVWALEAY GAAYTLQEIL TVKSDDVVGR VKTYEAIVKG ENVPEPGIPE CFKVLIKELQ SLCLDVKVYS EEQEEIAIKE SVEDDLEELN VNIEGREDEV NFNEFNDIGE EITDEDLEVE DFDLQDLNDD DINPDDTIDA ELDDNLFDDD FDDTFDDDDL
|
| |