Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9211_16071 |
Symbol | rpoB |
ID | 5730824 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9211 |
Kingdom | Bacteria |
Replicon accession | NC_009976 |
Strand | - |
Start bp | 1436468 |
End bp | 1439758 |
Gene Length | 3291 bp |
Protein Length | 1096 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 641285985 |
Product | DNA-directed RNA polymerase subunit beta |
Protein accession | YP_001551492 |
Protein GI | 159904148 |
COG category | [K] Transcription |
COG ID | [COG0085] DNA-directed RNA polymerase, beta subunit/140 kD subunit |
TIGRFAM ID | [TIGR02013] DNA-directed RNA polymerase, beta subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.614693 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 35 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTAGAA GCGCGATTCA GGTCGCTAAG ACCGCTACAC ACCTGCCTGA CTTGGTTGAG GTGCAGCGTG CAAGCTTTAA GTGGTTTTTG GAAAAAGGTT TAATTGAGGA GCTTGAGAAC TTCTCACCAA TTACTGATTA CACGGGCAAG CTAGAGCTAC ATTTTATTGG CAGTGAATAT CGGCTTAAGC GGCCTCGCCA TGATGTGGAA GAAGCTAAAA AGCGCGATGC AACTTTTGCT TCCCAAATGT ATGTAACTTG CCGTCTTATT AATAAAGAAA CTGGAGAAAT AAAAGAACAA GAAGTTTTTA TTGGTGAATT GCCATTAATG ACTGAAAGAG GAACATTTAT CATTAATGGA GCAGAAAGAG TAATTGTTAA TCAGATAGTA AGAAGCCCTG GCGTATATTT TAAAGATGAG CAAGATAAAA ATGGCCGAAG AACTTACAAT GCAAGTGTTA TTCCTAATCG CGGGGCATGG TTAAAGTTTG AAACAGATAA GAATGATTTA CTTCATGTTC GTGTAGATAA AACTAGAAAA ATTAATGCTC ATGTTCTAAT GAGAGCAATG GGTCTATCAG ATAATGATGT AATAGATAAA TTAAGACATC CTGAATACTA TAAGAAATCT ATTGAAGCCG CTGATGAAGA AGGTATTAGC TCCGAGGATC AGGCTTTGCT TGAGCTATAT AAAAAGTTGC GACCTGGCGA ACCACCCTCA GTCAGTGGTG GTCAGCAATT GCTTCAGAGC AGGTTCTTTG ACCCAAAACG TTATGATTTA GGACGTGTCG GTAGATATAA AATAAACAAG AAATTACGTT TAACTATACC TGACTCTGTA AGAACTCTGA CTCATGAGGA TGTTCTTTCT ACTATTGATT ATTTAATTAA TCTTGAGCTA GATGTTGGTG GGGCGACTTT AGATGATATT GACCATCTTG GTAATAGAAG AGTTAGATCT GTTGGAGAGT TATTGCAAAA TCAAGTTCGT GTTGGTTTAA ATCGTCTTGA AAGAATAATC AAGGAAAGAA TGACAGTAGG TGAAACAGAT TCATTGACTC CAGCACAACT AGTAAACCCC AAGCCTCTAG TGGCTGCTGT TAAAGAGTTC TTTGGATCAA GTCAACTTAG TCAGTTTATG GATCAGACCA ATCCTTTGGC TGAATTAACT CATAAAAGGC GGATTTCGGC ATTAGGACCA GGTGGTTTAA CTAGAGAAAG AGCAGGTTTT GCGGTAAGAG ATATTCATCC TTCTCATTAT GGACGCTTGT GTCCTATTGA AACTCCAGAG GGGCCAAATG CTGGTCTAAT TAATTCCTTA GCTACTCATG CCAGGGTTAA TGATTATGGA TTCATTGAAA CGCCTTTCTG GAAAGTAGAT AAAGGAAGAG TTATTAAGGA GGGAAAGCCA ATTTATCTAT CTGCTGATCT AGAAGATGAA TGCAGAGTTG CGCCTGGTGA TGTTGCTACG GATAAAGAAG GCATGATTGT TGCTGATTTG ATTCCAGTTA GATATAGGCA GGATTTTGAG AAGGTTCCTC CTGAGCAAGT GGATTATGTG CAACTTTCAC CAGTGCAAGT GATTTCTGTA GCGACATCTT TAATTCCATT TTTAGAACAT GACGATGCAA ACAGGGCTTT AATGGGTTCT AATATGCAGC GTCAAGCTGT CCCTCTTCTG CGTCCAGAAC GACCTTTAGT TGGGACAGGT TTGGAAACAC AGGTTGCTAG AGACTCCGGG ATGGTCCCTA TCTCTCAAGT CAATGGAACA GTAACTTATG TAGACGCTAA TATCATTGTT GTTACTGATG AAGAAGGTTC CGAGCATCAT CATTCTTTGC AGAAATATCA ACGTTCTAAT CAGGACACAT GCTTGAATCA AAGACCCATT GTTCATAATG GCGATCCGGT AATTATTGGT CAGGTTCTTG CAGACGGATC TGCATGTGAA GGAGGAGAAA TAGCCTTAGG TCAAAATGTC TTAATCGCTT ATATGCCTTG GGAGGGTTAT AACTATGAGG ATGCAATTTT AGTCAGTGAG AGATTAGTAA AAGATGATCT ATATACTTCA GTGCATATTG AAAAATATGA GATTGAAGCT CGACAAACAA AACTAGGCCC AGAGGAAATA ACGAGGGAAA TTCCCAACAT AGCCGAAGAA AGTTTGGGTA ATCTTGATGA AATGGGCATT ATTAGGATTG GTGCTTTTGT TGAGAGCGGA GATATTCTTG TAGGAAAAGT TACCCCTAAG GGGGAGTCTG ATCAACCTCC TGAAGAGAAG CTTTTAAGGG CTATATTTGG TGAAAAAGCA AGAGATGTAA GAGATAATTC ATTGAGAGTT CCTAGTACTG AAAGAGGTCG AGTCGTTGAT GTGCGTATTT ATACGAGAGA ACAGGGTGAT GAACTTCCGC CTGGAGCAAA TATGGTAGTT CGAGTTTATG TGGCACAAAG AAGAAAAATA CAAGTAGGGG ATAAGATGGC AGGACGCCAT GGGAATAAGG GAATTATTAG TCGGATACTT CCTAGAGAAG ACATGCCTTA TTTGCCTGAT GGAACCCCAG TAGATATTGT CCTTAACCCT CTTGGTGTGC CAAGTAGAAT GAATGTTGGA CAAGTTTTTG AGTTGTTGAT GGGTTGGGCA GCCTCGAATT TGGATTGCAG GGTTAAAGTT GTTCCATTCG ACGAAATGTA TGGTGCTGAA AAGTCCTATC AGACAGTAAC TGCTTACCTT AAAGAGGCCG CTAGTTTGCC AGGCAAAGAA TGGGTCTACA ACCCAGAAGA CCCAGGAAAG CTACTTCTAA GAGATGGAAG GACTGGAGAA CCTTTTGATC AGCCTGTTGC AGTTGGCTAC TCACATTTCC TTAAATTAGT TCACTTAGTA GACGATAAAA TTCACGCTCG CTCTACTGGT CCGTACTCTC TAGTTACTCA GCAACCTTTA GGAGGTAAAG CTCAACAAGG AGGACAACGT TTAGGAGAAA TGGAGGTTTG GGCTTTAGAG GCTTATGGAG CAGCATATAC TTTGCAAGAA TTGTTGACTG TCAAGTCTGA TGATATGCAA GGTCGAAATG AGGCGCTTAA TGCAATTGTC AAAGGCAAGC CAATTCCAAG GCCAGGAACC CCAGAATCGT TCAAAGTTTT AATGCGAGAA CTTCAGTCTC TTGGACTAGA CATTGGAGTG TATACAGATG AAGGGAAAGA AGTTGATTTG ATGCAGGATG TTAATCCACG TCGAAGTACT CCTAGTAGAC CAACTTATGA ATCATTGGGA TCAGATTATC AAGAGGATTA A
|
Protein sequence | MSRSAIQVAK TATHLPDLVE VQRASFKWFL EKGLIEELEN FSPITDYTGK LELHFIGSEY RLKRPRHDVE EAKKRDATFA SQMYVTCRLI NKETGEIKEQ EVFIGELPLM TERGTFIING AERVIVNQIV RSPGVYFKDE QDKNGRRTYN ASVIPNRGAW LKFETDKNDL LHVRVDKTRK INAHVLMRAM GLSDNDVIDK LRHPEYYKKS IEAADEEGIS SEDQALLELY KKLRPGEPPS VSGGQQLLQS RFFDPKRYDL GRVGRYKINK KLRLTIPDSV RTLTHEDVLS TIDYLINLEL DVGGATLDDI DHLGNRRVRS VGELLQNQVR VGLNRLERII KERMTVGETD SLTPAQLVNP KPLVAAVKEF FGSSQLSQFM DQTNPLAELT HKRRISALGP GGLTRERAGF AVRDIHPSHY GRLCPIETPE GPNAGLINSL ATHARVNDYG FIETPFWKVD KGRVIKEGKP IYLSADLEDE CRVAPGDVAT DKEGMIVADL IPVRYRQDFE KVPPEQVDYV QLSPVQVISV ATSLIPFLEH DDANRALMGS NMQRQAVPLL RPERPLVGTG LETQVARDSG MVPISQVNGT VTYVDANIIV VTDEEGSEHH HSLQKYQRSN QDTCLNQRPI VHNGDPVIIG QVLADGSACE GGEIALGQNV LIAYMPWEGY NYEDAILVSE RLVKDDLYTS VHIEKYEIEA RQTKLGPEEI TREIPNIAEE SLGNLDEMGI IRIGAFVESG DILVGKVTPK GESDQPPEEK LLRAIFGEKA RDVRDNSLRV PSTERGRVVD VRIYTREQGD ELPPGANMVV RVYVAQRRKI QVGDKMAGRH GNKGIISRIL PREDMPYLPD GTPVDIVLNP LGVPSRMNVG QVFELLMGWA ASNLDCRVKV VPFDEMYGAE KSYQTVTAYL KEAASLPGKE WVYNPEDPGK LLLRDGRTGE PFDQPVAVGY SHFLKLVHLV DDKIHARSTG PYSLVTQQPL GGKAQQGGQR LGEMEVWALE AYGAAYTLQE LLTVKSDDMQ GRNEALNAIV KGKPIPRPGT PESFKVLMRE LQSLGLDIGV YTDEGKEVDL MQDVNPRRST PSRPTYESLG SDYQED
|
| |