Gene P9211_16071 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9211_16071 
SymbolrpoB 
ID5730824 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9211 
KingdomBacteria 
Replicon accessionNC_009976 
Strand
Start bp1436468 
End bp1439758 
Gene Length3291 bp 
Protein Length1096 aa 
Translation table11 
GC content41% 
IMG OID641285985 
ProductDNA-directed RNA polymerase subunit beta 
Protein accessionYP_001551492 
Protein GI159904148 
COG category[K] Transcription 
COG ID[COG0085] DNA-directed RNA polymerase, beta subunit/140 kD subunit 
TIGRFAM ID[TIGR02013] DNA-directed RNA polymerase, beta subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.614693 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTAGAA GCGCGATTCA GGTCGCTAAG ACCGCTACAC ACCTGCCTGA CTTGGTTGAG 
GTGCAGCGTG CAAGCTTTAA GTGGTTTTTG GAAAAAGGTT TAATTGAGGA GCTTGAGAAC
TTCTCACCAA TTACTGATTA CACGGGCAAG CTAGAGCTAC ATTTTATTGG CAGTGAATAT
CGGCTTAAGC GGCCTCGCCA TGATGTGGAA GAAGCTAAAA AGCGCGATGC AACTTTTGCT
TCCCAAATGT ATGTAACTTG CCGTCTTATT AATAAAGAAA CTGGAGAAAT AAAAGAACAA
GAAGTTTTTA TTGGTGAATT GCCATTAATG ACTGAAAGAG GAACATTTAT CATTAATGGA
GCAGAAAGAG TAATTGTTAA TCAGATAGTA AGAAGCCCTG GCGTATATTT TAAAGATGAG
CAAGATAAAA ATGGCCGAAG AACTTACAAT GCAAGTGTTA TTCCTAATCG CGGGGCATGG
TTAAAGTTTG AAACAGATAA GAATGATTTA CTTCATGTTC GTGTAGATAA AACTAGAAAA
ATTAATGCTC ATGTTCTAAT GAGAGCAATG GGTCTATCAG ATAATGATGT AATAGATAAA
TTAAGACATC CTGAATACTA TAAGAAATCT ATTGAAGCCG CTGATGAAGA AGGTATTAGC
TCCGAGGATC AGGCTTTGCT TGAGCTATAT AAAAAGTTGC GACCTGGCGA ACCACCCTCA
GTCAGTGGTG GTCAGCAATT GCTTCAGAGC AGGTTCTTTG ACCCAAAACG TTATGATTTA
GGACGTGTCG GTAGATATAA AATAAACAAG AAATTACGTT TAACTATACC TGACTCTGTA
AGAACTCTGA CTCATGAGGA TGTTCTTTCT ACTATTGATT ATTTAATTAA TCTTGAGCTA
GATGTTGGTG GGGCGACTTT AGATGATATT GACCATCTTG GTAATAGAAG AGTTAGATCT
GTTGGAGAGT TATTGCAAAA TCAAGTTCGT GTTGGTTTAA ATCGTCTTGA AAGAATAATC
AAGGAAAGAA TGACAGTAGG TGAAACAGAT TCATTGACTC CAGCACAACT AGTAAACCCC
AAGCCTCTAG TGGCTGCTGT TAAAGAGTTC TTTGGATCAA GTCAACTTAG TCAGTTTATG
GATCAGACCA ATCCTTTGGC TGAATTAACT CATAAAAGGC GGATTTCGGC ATTAGGACCA
GGTGGTTTAA CTAGAGAAAG AGCAGGTTTT GCGGTAAGAG ATATTCATCC TTCTCATTAT
GGACGCTTGT GTCCTATTGA AACTCCAGAG GGGCCAAATG CTGGTCTAAT TAATTCCTTA
GCTACTCATG CCAGGGTTAA TGATTATGGA TTCATTGAAA CGCCTTTCTG GAAAGTAGAT
AAAGGAAGAG TTATTAAGGA GGGAAAGCCA ATTTATCTAT CTGCTGATCT AGAAGATGAA
TGCAGAGTTG CGCCTGGTGA TGTTGCTACG GATAAAGAAG GCATGATTGT TGCTGATTTG
ATTCCAGTTA GATATAGGCA GGATTTTGAG AAGGTTCCTC CTGAGCAAGT GGATTATGTG
CAACTTTCAC CAGTGCAAGT GATTTCTGTA GCGACATCTT TAATTCCATT TTTAGAACAT
GACGATGCAA ACAGGGCTTT AATGGGTTCT AATATGCAGC GTCAAGCTGT CCCTCTTCTG
CGTCCAGAAC GACCTTTAGT TGGGACAGGT TTGGAAACAC AGGTTGCTAG AGACTCCGGG
ATGGTCCCTA TCTCTCAAGT CAATGGAACA GTAACTTATG TAGACGCTAA TATCATTGTT
GTTACTGATG AAGAAGGTTC CGAGCATCAT CATTCTTTGC AGAAATATCA ACGTTCTAAT
CAGGACACAT GCTTGAATCA AAGACCCATT GTTCATAATG GCGATCCGGT AATTATTGGT
CAGGTTCTTG CAGACGGATC TGCATGTGAA GGAGGAGAAA TAGCCTTAGG TCAAAATGTC
TTAATCGCTT ATATGCCTTG GGAGGGTTAT AACTATGAGG ATGCAATTTT AGTCAGTGAG
AGATTAGTAA AAGATGATCT ATATACTTCA GTGCATATTG AAAAATATGA GATTGAAGCT
CGACAAACAA AACTAGGCCC AGAGGAAATA ACGAGGGAAA TTCCCAACAT AGCCGAAGAA
AGTTTGGGTA ATCTTGATGA AATGGGCATT ATTAGGATTG GTGCTTTTGT TGAGAGCGGA
GATATTCTTG TAGGAAAAGT TACCCCTAAG GGGGAGTCTG ATCAACCTCC TGAAGAGAAG
CTTTTAAGGG CTATATTTGG TGAAAAAGCA AGAGATGTAA GAGATAATTC ATTGAGAGTT
CCTAGTACTG AAAGAGGTCG AGTCGTTGAT GTGCGTATTT ATACGAGAGA ACAGGGTGAT
GAACTTCCGC CTGGAGCAAA TATGGTAGTT CGAGTTTATG TGGCACAAAG AAGAAAAATA
CAAGTAGGGG ATAAGATGGC AGGACGCCAT GGGAATAAGG GAATTATTAG TCGGATACTT
CCTAGAGAAG ACATGCCTTA TTTGCCTGAT GGAACCCCAG TAGATATTGT CCTTAACCCT
CTTGGTGTGC CAAGTAGAAT GAATGTTGGA CAAGTTTTTG AGTTGTTGAT GGGTTGGGCA
GCCTCGAATT TGGATTGCAG GGTTAAAGTT GTTCCATTCG ACGAAATGTA TGGTGCTGAA
AAGTCCTATC AGACAGTAAC TGCTTACCTT AAAGAGGCCG CTAGTTTGCC AGGCAAAGAA
TGGGTCTACA ACCCAGAAGA CCCAGGAAAG CTACTTCTAA GAGATGGAAG GACTGGAGAA
CCTTTTGATC AGCCTGTTGC AGTTGGCTAC TCACATTTCC TTAAATTAGT TCACTTAGTA
GACGATAAAA TTCACGCTCG CTCTACTGGT CCGTACTCTC TAGTTACTCA GCAACCTTTA
GGAGGTAAAG CTCAACAAGG AGGACAACGT TTAGGAGAAA TGGAGGTTTG GGCTTTAGAG
GCTTATGGAG CAGCATATAC TTTGCAAGAA TTGTTGACTG TCAAGTCTGA TGATATGCAA
GGTCGAAATG AGGCGCTTAA TGCAATTGTC AAAGGCAAGC CAATTCCAAG GCCAGGAACC
CCAGAATCGT TCAAAGTTTT AATGCGAGAA CTTCAGTCTC TTGGACTAGA CATTGGAGTG
TATACAGATG AAGGGAAAGA AGTTGATTTG ATGCAGGATG TTAATCCACG TCGAAGTACT
CCTAGTAGAC CAACTTATGA ATCATTGGGA TCAGATTATC AAGAGGATTA A
 
Protein sequence
MSRSAIQVAK TATHLPDLVE VQRASFKWFL EKGLIEELEN FSPITDYTGK LELHFIGSEY 
RLKRPRHDVE EAKKRDATFA SQMYVTCRLI NKETGEIKEQ EVFIGELPLM TERGTFIING
AERVIVNQIV RSPGVYFKDE QDKNGRRTYN ASVIPNRGAW LKFETDKNDL LHVRVDKTRK
INAHVLMRAM GLSDNDVIDK LRHPEYYKKS IEAADEEGIS SEDQALLELY KKLRPGEPPS
VSGGQQLLQS RFFDPKRYDL GRVGRYKINK KLRLTIPDSV RTLTHEDVLS TIDYLINLEL
DVGGATLDDI DHLGNRRVRS VGELLQNQVR VGLNRLERII KERMTVGETD SLTPAQLVNP
KPLVAAVKEF FGSSQLSQFM DQTNPLAELT HKRRISALGP GGLTRERAGF AVRDIHPSHY
GRLCPIETPE GPNAGLINSL ATHARVNDYG FIETPFWKVD KGRVIKEGKP IYLSADLEDE
CRVAPGDVAT DKEGMIVADL IPVRYRQDFE KVPPEQVDYV QLSPVQVISV ATSLIPFLEH
DDANRALMGS NMQRQAVPLL RPERPLVGTG LETQVARDSG MVPISQVNGT VTYVDANIIV
VTDEEGSEHH HSLQKYQRSN QDTCLNQRPI VHNGDPVIIG QVLADGSACE GGEIALGQNV
LIAYMPWEGY NYEDAILVSE RLVKDDLYTS VHIEKYEIEA RQTKLGPEEI TREIPNIAEE
SLGNLDEMGI IRIGAFVESG DILVGKVTPK GESDQPPEEK LLRAIFGEKA RDVRDNSLRV
PSTERGRVVD VRIYTREQGD ELPPGANMVV RVYVAQRRKI QVGDKMAGRH GNKGIISRIL
PREDMPYLPD GTPVDIVLNP LGVPSRMNVG QVFELLMGWA ASNLDCRVKV VPFDEMYGAE
KSYQTVTAYL KEAASLPGKE WVYNPEDPGK LLLRDGRTGE PFDQPVAVGY SHFLKLVHLV
DDKIHARSTG PYSLVTQQPL GGKAQQGGQR LGEMEVWALE AYGAAYTLQE LLTVKSDDMQ
GRNEALNAIV KGKPIPRPGT PESFKVLMRE LQSLGLDIGV YTDEGKEVDL MQDVNPRRST
PSRPTYESLG SDYQED