Gene P9303_04361 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_04361 
SymbolrpoB 
ID4776363 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp436973 
End bp440266 
Gene Length3294 bp 
Protein Length1097 aa 
Translation table11 
GC content51% 
IMG OID640085940 
ProductDNA-directed RNA polymerase subunit beta 
Protein accessionYP_001016453 
Protein GI124022146 
COG category[K] Transcription 
COG ID[COG0085] DNA-directed RNA polymerase, beta subunit/140 kD subunit 
TIGRFAM ID[TIGR02013] DNA-directed RNA polymerase, beta subunit 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAGCA GCGCGATTCA GGTCGCAAAG ACCGCCACCT ACCTGCCTGA TCTCGTTGAA 
GTTCAGCGGG CGAGCTTCAA GTGGTTTCTT GATGAGGGCT TGATCGAGGA GTTGGATAGC
TTTTCTCCGA TCACTGACTA CACAGGCAAG CTTGAACTCC ACTTCGTTGG CAATGAATAC
CGATTGAAGC GGCCTCGCCA CGATGTTGAA GAGGCCAAGC GCCGTGATGC GACTTTTGCT
TCACAGATGT ATGTGACCTG TCGCTTAGTG AATAAGGAAA CAGGTGAGAT TAAAGAGCAG
GAGGTATTTA TTGGCGAGCT TCCTCTGATG ACTGAGCGAG GTACGTTCAT TATCAATGGT
GCAGAGAGGG TTATTGTTAA TCAGATTGTG CGTAGCCCAG GGGTATATTT TAAGGATGAA
CAGGATAAGA ATGGACGACG AACATACAAT GCCAGTGTTA TTCCTAATCG TGGTGCCTGG
CTAAAGTTCG AAACAGATAA AAACGACCTT CTGCATGTGC GTGTTGATAA AACACGCAAA
ATTAATGCCC ATGTTTTAAT GCGGGCCATG GGGCTATCAG ATAACGATGT AATCGATAAG
CTTCGTCACC CTGAGTATTA TAAGAAGTCT ATTGAGGCTG CAAATGAGGA AGGAATTAGT
TCAGAGGATC AGGCTTTGCT TGAGCTTTAT AAAAAGCTTC GTCCTGGTGA GCCTCCTTCA
GTAAGCGGTG GACAACAGCT TCTGCAGACC CGATTTTTTG ATCCCAAGCG TTATGACCTT
GGCCGAGTTG GCCGTTACAA GATTAATAAA AAGCTGCGTC TAACAATCCC GGATACGGTG
CGTACCCTCA CCCATGAGGA TGTGCTCTCA ACCCTTGATT ATCTGATCAA TTTAGAACTC
GATGTGGGTG GTGCCAGCCT GGATGACATT GATCACCTGG GCAATCGTCG AGTGCGTTCA
GTGGGTGAAC TTTTGCAGAA CCAGGTCCGG GTTGGTCTCA ATCGCCTGGA GAGGATCATC
AAAGAACGGA TGACTGTGGG TGAGACCGAT TCGCTCACGC CAGCTCAGTT GGTGAACCCC
AAGCCTCTCG TTGCAGCTGT TAAGGAGTTC TTCGGTTCCA GTCAGCTGAG TCAGTTCATG
GATCAGACGA ATCCATTGGC TGAGCTCACC CACAAACGTC GTATCTCAGC TCTGGGGCCA
GGGGGCCTAA CTAGGGAGCG TGCTGGCTTC GCGGTGCGTG ATATTCACCC CTCCCATTAT
GGCCGTCTCT GCCCAATTGA GACACCGGAA GGCCCGAATG CAGGTTTGAT CAATTCACTG
GCTACCCACG CTCGGGTCAA TCAGTATGGC TTCATTGAGA CTCCTTTCTG GAAAGTTGAG
AACGGTCGCC TGATCAAGGA GGGAGACCCC ATCTATTTAT CAGCCGATCT TGAGGATGAG
TGTCGTGTGG CTCCTGGTGA TGTGGCCACT GATGCCGATG GACAGATTCT CGCTGAATTA
ATTCCGGTGC GTTATCGCCA GGACTTCGAA AAAGTTCCAC CTGAGCAGGT GGATTATGTG
CAGCTCTCGC CGGTGCAGGT GATCTCTGTC GCCACCTCCT TGATTCCTTT CCTCGAGCAC
GATGACGCCA ATAGGGCTCT GATGGGATCA AATATGCAAC GGCAGGCGGT ACCACTGTTG
CGGCCTGAGC GGCCTTTGGT GGGGACGGGC CTCGAAACTC AGGTGGCTCG TGATTCCGGG
ATGGTGCCCA TCTCCCGTGT TAATGGCATG GTCACATTCG TGGATGCCAC TGCGATCATC
GTCCGGGATG AGGATGGGGT TGATCACACC CACTACCTGC AGAAGTATCA GCGTTCAAAT
CAGGACACCT GTCTCAATCA GCGTCCGATT GTCTGCCAGG GTGATCCGGT GATCGTGGGT
CAGGTTCTTG CGGATGGTTC TGCTTGTGAA GGCGGTGAAA TTGCTCTAGG TCAGAACGTT
TTGGTCGCTT ATATGCCTTG GGAGGGTTAC AACTACGAGG ATGCTATTCT TGTCAGCGAA
CGCCTAGTTA AGGACGACCT CTACACCTCG GTGCATATCG AGAAGTATGA GATCGAGGCA
CGACAGACAA AGCTTGGTCC CGAGGAGATC ACCAGGGAGA TACCCAATGT TGCCGAGGAA
AGCCTTGGCA ATCTTGATGA AATGGGCATC ATTCGCATCG GTGCTTTTGT CGAGAGCGGC
GACATTCTGG TGGGCAAAGT GACACCTAAA GGTGAGTCAG ATCAGCCTCC AGAAGAGAAG
CTTTTGCGTG CCATTTTTGG TGAAAAAGCG CGCGATGTGC GTGATAACTC TCTACGCGTT
CCCAGTACTG AACGAGGACG AGTTGTGGAC GTTCGTATTT ACACTCGTGA ACAGGGTGAT
GAGCTTCCCC CTGGCGCCAA TATGGTTGCT CGAGTGTATG TGGCTCAGCG TCGCAAGATC
CAGGTTGGCG ACAAGATGGC CGGTCGCCAC GGCAATAAAG GCATCATCAG TCGCATTCTT
CCCCGCGAAG ATATGCCTTT TCTGCCCGAT GGCACTCCAG TTGACATCGT TTTGAATCCT
TTGGGCGTGC CCAGCCGGAT GAATGTGGGT CAGGTGTTTG AGTGCTTAAT GGGATGGGCC
GCAGCCAATC TCGACTGTCG GGTCAAGGTG GTGCCTTTTG ATGAAATGTA TGGAGCTGAA
AAGTCCCAGC AGACAGTGGA GGCTTATCTC AAGGAAGCTG CCAAACAGCC AGGTAAGGAG
TGGGTCTACA ACCCTGAAAA TCCTGGCAAG CTTCAATTGA TTGATGGACG CTCGGGTGAA
CCTTTCGACC AGCCGGTGAC CGTTGGTTAT GCACAGATTC TGAAGCTTGT TCATTTGGTT
GATGACAAGA TCCATGCTCG CTCAACAGGT CCCTATTCAC TGGTAACCCA GCAACCTCTT
GGCGGTAAGG CTCAACAAGG TGGCCAACGT TTAGGTGAGA TGGAGGTGTG GGCTCTTGAG
GCCTATGGCG CCGCTTACAC CCTGCAGGAG CTGCTTACTG TTAAATCAGA CGATATGCAG
GGTCGGAATG AAGCCCTCAA CGCAATCGTC AAGGGCAAGC CTATCCCCAG GCCGGGGACA
CCCGAGTCTT TCAAGGTATT GATGCGTGAG CTTCAGTCTC TAGGGCTCGA TATCGCTGTT
TATACCGATG AAGGCAAAGA AGTGGATCTG ATGCAGGATG TGAATCCTCG CCGTAGTACT
CCTAGTCGAC CCACTTACGA ATCCCTTGGA GTAGCGGATT ACGACGAAGA TTAA
 
Protein sequence
MSSSAIQVAK TATYLPDLVE VQRASFKWFL DEGLIEELDS FSPITDYTGK LELHFVGNEY 
RLKRPRHDVE EAKRRDATFA SQMYVTCRLV NKETGEIKEQ EVFIGELPLM TERGTFIING
AERVIVNQIV RSPGVYFKDE QDKNGRRTYN ASVIPNRGAW LKFETDKNDL LHVRVDKTRK
INAHVLMRAM GLSDNDVIDK LRHPEYYKKS IEAANEEGIS SEDQALLELY KKLRPGEPPS
VSGGQQLLQT RFFDPKRYDL GRVGRYKINK KLRLTIPDTV RTLTHEDVLS TLDYLINLEL
DVGGASLDDI DHLGNRRVRS VGELLQNQVR VGLNRLERII KERMTVGETD SLTPAQLVNP
KPLVAAVKEF FGSSQLSQFM DQTNPLAELT HKRRISALGP GGLTRERAGF AVRDIHPSHY
GRLCPIETPE GPNAGLINSL ATHARVNQYG FIETPFWKVE NGRLIKEGDP IYLSADLEDE
CRVAPGDVAT DADGQILAEL IPVRYRQDFE KVPPEQVDYV QLSPVQVISV ATSLIPFLEH
DDANRALMGS NMQRQAVPLL RPERPLVGTG LETQVARDSG MVPISRVNGM VTFVDATAII
VRDEDGVDHT HYLQKYQRSN QDTCLNQRPI VCQGDPVIVG QVLADGSACE GGEIALGQNV
LVAYMPWEGY NYEDAILVSE RLVKDDLYTS VHIEKYEIEA RQTKLGPEEI TREIPNVAEE
SLGNLDEMGI IRIGAFVESG DILVGKVTPK GESDQPPEEK LLRAIFGEKA RDVRDNSLRV
PSTERGRVVD VRIYTREQGD ELPPGANMVA RVYVAQRRKI QVGDKMAGRH GNKGIISRIL
PREDMPFLPD GTPVDIVLNP LGVPSRMNVG QVFECLMGWA AANLDCRVKV VPFDEMYGAE
KSQQTVEAYL KEAAKQPGKE WVYNPENPGK LQLIDGRSGE PFDQPVTVGY AQILKLVHLV
DDKIHARSTG PYSLVTQQPL GGKAQQGGQR LGEMEVWALE AYGAAYTLQE LLTVKSDDMQ
GRNEALNAIV KGKPIPRPGT PESFKVLMRE LQSLGLDIAV YTDEGKEVDL MQDVNPRRST
PSRPTYESLG VADYDED