Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RoseRS_1265 |
Symbol | rpoB |
ID | 5208217 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus sp. RS-1 |
Kingdom | Bacteria |
Replicon accession | NC_009523 |
Strand | - |
Start bp | 1556187 |
End bp | 1559876 |
Gene Length | 3690 bp |
Protein Length | 1229 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 640594881 |
Product | DNA-directed RNA polymerase subunit beta |
Protein accession | YP_001275620 |
Protein GI | 148655415 |
COG category | [K] Transcription |
COG ID | [COG0085] DNA-directed RNA polymerase, beta subunit/140 kD subunit |
TIGRFAM ID | [TIGR02013] DNA-directed RNA polymerase, beta subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 3 |
Fosmid unclonability p-value | 0.00716616 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCCCCCGT TGACTCAAAG TGTTGTGCTC CAGCCATTGA TCGTGCCGAA CGATGTCGGC ATTGATGCGC TGCATCGCGG CAAGATCGAG CGCTATTCAT TCGCGCGCAT TTCAAACGCT ATCGAATTGC CCAAACTGAT TGAAACGCAA CTGAACAGTT TCGAGTGGTT CCGCAAAGAA GGGTTGCGTG AACTGTTTGA AGAGATTTCG CCGATTACCG ATTTCACCGG CAAAAATATG GAATTGCGCT TCCTCGACTA TCATTTCGGG GAACCGCGCT ACAACGAGCA TGAGTGCCGC GAACGGGGGA TTACCTATTC TGCACCGATC CGGGTCAATG TGCAGTTGCG CATCCTGAGC ACCGGAGAGC TGAAAGAGAG TGAAATCTTC CTTGGCGATT TCCCGCTGAT GACAGAGAAC GGCACCTTTG TCATCAACGG CGCCGAGCGC GTGGTGGTTT CCCAGTTGAT CCGATCGCCG GGAGTCTATT TCAAAGAAGA GAAAGATCCG ACCTCCGGGC GCAGTCTCCA TTCGGCAAAA TTGATCCCGA GTCGTGGCGC GTGGCTCGAG TTCGAGACCA ACAAGCGTGA TGTCATTTCG GTCAAAGTCG ATCGCAAGCG CAAGATTCCG GTCACGATCC TCCTGCGCGC CGTTCTTGGA TGGCGGGCGA ACCCCGACGG CAGCGGTCAG TGGGCGCCGG ATAACGAACT CGACCAGCGC GGGCGTGATG AGGAGATCCT GGAACTGTTC GCGCACCTGG AGACCGGCGA CCATCAGTAC ATCAAGGCGA CCCTCGATAA AGACCCGGCG CGCCATGCGA AGGAGGCGTT GCTGGAGTTG TACAAGCGCC TGCGACCGGG CGATCCGCCG ACTCTCGATA ACGCGCGCAA TCTGATTGAG GCGTTGCTCT TCAATCCGCG TCGTTACGAT TTGTCGCGCG TCGGGCGCTA TAAACTGAAT AAGAACCTCT GGGAAAAAGA TACCCGCCCT GAAGTGCGTC GTCAGGCGCC CGACGTGAAG GTGCGTGTGT TGCTCCCCGA CGATATTTTC AAGATCGTCG AGCGCATGAT TCAACTGAAC AACGGTACGC CAGGACTGCG CGCCGATGAT ATCGATCACC TCGGCAACCG CCGTGTGCGC ACCGTCGGCG AATTGATCCA GCAGCAGTTC CGCGTCGGTC TGCTGCGCAT GGAGCGCGTG ATCAAAGAAC GCATGTCGCT GCAGGACCCG GAGACGGCCA CGCCAAATGC GCTGGTGAAT ATTCGCCCGG TCGTCGCAGC CATGCGCGAG TTCTTCGGCG GGGCGCAACT CTCGCAGTTC ATGGATCAGA CCAACCCGCT GGCGGAATTG ACCCACAAGC GCCGCCTCTC CGCTCTCGGT CCGGGTGGTC TCAGTCGCGA TCGCGCCGGG TTCGAGGTGC GCGACGTGCA TCATTCGCAC TATGGGCGCA TCTGCCCGGT GGAGACGCCG GAAGGACCAA ACATCGGTCT GATCGGCACC ATGTCCACCT ATGCGCGGGT TAATGAAATG GGCTTCCTGG AGACGCCGTA TCGTAAGGTG TACCGCGAGG TGCCCAACGC CAGCGAGTGG GAGCGCCAGG GATTGTTACT GCGCGATGTG CGCGACCTGC GCACCGGCGA GTTGATCGCC GTGCGCGGCA CGCGTGTCGA TGCTGCGGTT GCGCGCCGGA TAGCCGTTGC GCTGCTGCGC GGGCAGATTC TGCGTGAGGA TGTGGTCGAT CCCACTACCG GCGAGGTGAT CGCCCACGCC GGGCAGGAAG TCAACCGCGC CCTGGCGGAG CGGATCGTCG AAACGCCATT GAAGATCATC AAGATCCGTC CGGTTGTCTC GCAGGAGGTG GATTATCTCT CCGCCGACGA GGAAGACAGG TTTGTCATTG TTCAGGCGAA CGCACCGCTA GACGAGCATA ATCGCTTCCT TGAAGGAACC GTGTCGGTGC GCTACGCCGG CGATTTTGAT GATGTGCCGA TTGAGCGCGT CGATTATATG GATGTGTCAC CCAAACAGGT GGTCAGCGTG TCAACGGCGC TGATCCCCTT CCTGGAGCAC GACGACGCCA ACCGCGCGCT GATGGGGTCG AATATGCAGC GACAGGCGGT GCCGCTCCTG CGTCCTGATG CGCCGATCGT CGGCACCGGC ATGGAGTATG TGGCGGCGCG CGATAGCGGG CAGGTGGTGG TTGCCAAAGC TGATGGCGTC GTTCTTTCGG CGACCGCTGA TGAGATCGTG ATCCTGGAGG ACGACGGCAA TGAGCGGTCG TACCGCCTGC GTAAGTTTAT GCGCTCGAAC CAGGACACCT GTATCAATCA GCGCCCGATT GTGTCACGTG GTCAGCGAGT GCGCAAGGGC GATATCATTG CCGACAGTTC CAGCACTGAT AACGGCGAAC TGGCGCTCGG GCAGAATGTG CTGGTTGCGT TCATGCCGTG GGAAGGCGGC AATTTTGAGG ATGCCATCCT GGTTTCAGAA CGCCTGGTGC GCGAAGATAT TTTCACCTCG ATCCATATCG AAAAGTATGA GGTCGAAGCG CGCGACACCA AACTGGGACC GGAAGAGATT ACCCGCGATA TTCCGAATGT CGGGCAGGAT AGCCTGCGTA ATCTGGATGA TCGCGGCATC ATCTATATCG GCGCCGAGGT GCAACCGAAC GATATTCTCG TCGGCAAGAT CACACCGAAA GGCGAAACCG ATCTCACGGC TGAGGAGCGT CTGCTGCGGG CAATCTTCGG CGAAAAAGCG CGCGAAGTGA AGGACTCGTC GCTGCGGGTG CCGAACGGCG TGCGCGGCAA GGTCATCGAT GTCAAGGTCT TTACCCGTGA TGACGATGTC GAACTGCCGG TCGGCGTCAA TCAGCGCGTC GATGTGTTGC TGTGTCAGAA GCGCAAGATC TCGGCTGGCG ATAAGATGGC CGGTCGCCAT GGGAACAAGG GCGTTGTCAG CCGCATTTTG CCGATCGAGG ATATGCCGTT TCTGCCCGAT GGGACGCCGG TCGATATCAT TTTGAACCCG ATCGGCGTGC CAAGCCGTAT GAACATCGGT CAGATTCTGG AGACGCATCT GGGATGGGCG GCTGCTCGCC TTGGCTATCG CGTCGCCACA CCGGTGTTCG ATGGCGCGAC TGAGACGGAG ATCAAGGAAT GGCTCAAACG CGCCGATCTG CCGCCTGATG GCAAGATCAC CCTCTACGAC GGGCGCACCG GCGAAGCGTT TGATCGCCCG GTGACCGTCG GGTATATCTA TATGATGAAA CTGGCGCACC TGGTGGAGGA TAAGATCCAC GCGCGCTCGA CCGGTCCATA CAGCCTGGTG ACGCAACAAC CGCTGGGCGG CAAGGCGCAG TTCGGCGGTC AGCGCTTCGG CGAAATGGAA GTCTGGGCGC TGGAAGCGTA TGGCGCTGCC TATACCCTCC AGGAAATGCT CACCGTGAAG TCGGACGATG TGGTGGGTCG CGTGAAGACG TATGAGGCGA TCGTGAAGGG TGAGCCGATC CAGGAGGCTG GCGTTCCGGA AAGCTTCAAG GTGCTGATCA AGGAATTGCA GTCGCTCGGT CTGTCGGTCG AGGTGCTGAG CGCCGATGAG ACGCCGGTGG AACTGACCGA TGACGCGGAT AGCGATCTGG CGGCGCTCGA TGGAATCAAC CTGTCGGGCA TGGAGCGTGG GGAATTCTAA
|
Protein sequence | MPPLTQSVVL QPLIVPNDVG IDALHRGKIE RYSFARISNA IELPKLIETQ LNSFEWFRKE GLRELFEEIS PITDFTGKNM ELRFLDYHFG EPRYNEHECR ERGITYSAPI RVNVQLRILS TGELKESEIF LGDFPLMTEN GTFVINGAER VVVSQLIRSP GVYFKEEKDP TSGRSLHSAK LIPSRGAWLE FETNKRDVIS VKVDRKRKIP VTILLRAVLG WRANPDGSGQ WAPDNELDQR GRDEEILELF AHLETGDHQY IKATLDKDPA RHAKEALLEL YKRLRPGDPP TLDNARNLIE ALLFNPRRYD LSRVGRYKLN KNLWEKDTRP EVRRQAPDVK VRVLLPDDIF KIVERMIQLN NGTPGLRADD IDHLGNRRVR TVGELIQQQF RVGLLRMERV IKERMSLQDP ETATPNALVN IRPVVAAMRE FFGGAQLSQF MDQTNPLAEL THKRRLSALG PGGLSRDRAG FEVRDVHHSH YGRICPVETP EGPNIGLIGT MSTYARVNEM GFLETPYRKV YREVPNASEW ERQGLLLRDV RDLRTGELIA VRGTRVDAAV ARRIAVALLR GQILREDVVD PTTGEVIAHA GQEVNRALAE RIVETPLKII KIRPVVSQEV DYLSADEEDR FVIVQANAPL DEHNRFLEGT VSVRYAGDFD DVPIERVDYM DVSPKQVVSV STALIPFLEH DDANRALMGS NMQRQAVPLL RPDAPIVGTG MEYVAARDSG QVVVAKADGV VLSATADEIV ILEDDGNERS YRLRKFMRSN QDTCINQRPI VSRGQRVRKG DIIADSSSTD NGELALGQNV LVAFMPWEGG NFEDAILVSE RLVREDIFTS IHIEKYEVEA RDTKLGPEEI TRDIPNVGQD SLRNLDDRGI IYIGAEVQPN DILVGKITPK GETDLTAEER LLRAIFGEKA REVKDSSLRV PNGVRGKVID VKVFTRDDDV ELPVGVNQRV DVLLCQKRKI SAGDKMAGRH GNKGVVSRIL PIEDMPFLPD GTPVDIILNP IGVPSRMNIG QILETHLGWA AARLGYRVAT PVFDGATETE IKEWLKRADL PPDGKITLYD GRTGEAFDRP VTVGYIYMMK LAHLVEDKIH ARSTGPYSLV TQQPLGGKAQ FGGQRFGEME VWALEAYGAA YTLQEMLTVK SDDVVGRVKT YEAIVKGEPI QEAGVPESFK VLIKELQSLG LSVEVLSADE TPVELTDDAD SDLAALDGIN LSGMERGEF
|
| |