Gene PMN2A_1016 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPMN2A_1016 
SymbolrpoB 
ID3606402 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL2A 
KingdomBacteria 
Replicon accessionNC_007335 
Strand
Start bp1509791 
End bp1513078 
Gene Length3288 bp 
Protein Length1095 aa 
Translation table11 
GC content39% 
IMG OID637687885 
ProductDNA-directed RNA polymerase subunit beta 
Protein accessionYP_292209 
Protein GI72382854 
COG category[K] Transcription 
COG ID[COG0085] DNA-directed RNA polymerase, beta subunit/140 kD subunit 
TIGRFAM ID[TIGR02013] DNA-directed RNA polymerase, beta subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0988562 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCAGAA GCGCGATTCA GGTAGCCAAA GCAGCTACAT ATCTGCCTGA TCTGGTTGAG 
GTGCAAAGAT CAAGTTTTAA GTGGTTTTTG GATAAAGGCT TAATAGAAGA ATTAGACAAT
TTTTCTCCCA TTACTGACTA TACAGGTAAG CTTGAGCTGC ATTTTATTGG AGCAGAATAT
AAGCTTAAGC GTCCTAGGCA TGATGTAGAA GAAGCAAAAA GAAGAGATGC AACGTTTGCT
TCTCAAATGT ATGTCACTTG TCGCTTAGTG AATAAAGAAA CCGGAGAGAT TAAAGAACAA
GAAGTTTTTA TAGGTGAACT ACCTTTAATG ACTGAACGTG GAACTTTTAT AATTAATGGT
GCTGAGAGAG TAATAGTCAA TCAAATCGTT CGCAGTCCAG GAGTTTATTT CAAAGATGAG
CAAGATAAAA ATGGTAGGAG AACTTACAAT GCAAGCGTTA TTCCTAATCG CGGAGCATGG
TTAAAGTTTG AAACTGATAA AAACGATTTG CTGCATGTTC GTGTAGATAA AACAAGAAAA
ATAAATGCCC ATGTTTTAAT GAGGGCAATG GGTTTATCAG ATAATGACGT TATAGATAAA
TTACGACATC CTGAGTTTTA TAAAAAGTCA ATTGATGCGG CAAATGAAGA AGGTATAAGT
TCAGAAGACC AAGCATTGTT AGAGCTTTAT AAAAAGTTAA GACCAGGTGA ACCTCCATCA
GTAAGTGGTG GCCAACAGCT ACTCCAAACA AGATTTTTTG ATCCAAAACG ATATGATTTA
GGAAGGGTTG GTAGATATAA AATAAATAAG AAATTACGCT TAACTATTCC TGATAATTTA
AGAACACTAA CCAACGAAGA TGTTCTTTCT ACCTTAGATT ATTTAATCAA TTTAGAATTA
GATGTTGGTG GAGCTACTTT GGATGATATT GATCATTTAG GTAATAGAAG AGTTAGGTCA
GTTGGTGAAC TTCTACAAAA TCAAGTTCGA GTTGGTTTAA ATAGACTTGA AAGGATAATT
AAAGAGAGGA TGACCGTTGG TGAGACAGAT TCACTTACCC CTGCGCAATT GGTAAATCCA
AAACCTTTAG TAGCAGCAAT AAAAGAATTT TTTGGTTCAA GTCAATTAAG TCAATTCATG
GATCAAACCA ATCCATTAGC TGAATTAACT CATAAAAGAC GTATCTCTGC TTTGGGACCA
GGAGGTTTAA CTCGTGAAAG AGCTGGCTTT GCCGTTCGGG ATATTCATCC ATCTCACTAC
GGAAGACTTT GTCCGATTGA AACACCTGAA GGACCAAATG CAGGACTGAT TAATTCGTTA
GCAACTCACG CACGAGTAAA TGAATACGGT TTTATTGAGA CACCATTTTG GAAAGTGGAA
AATGGTCGAT TAATCAAGGA AGGTGATCCT ATTTATCTTT CTGCAGATCT AGAAGATGAA
TGTAGGGTTG CGCCAGGTGA TGTGGCTACA AACGAAGAAG GGAAAATAAT GGCAGAACTT
GTTCCAGTTA GGTATCGACA AGATTTTGAA ACGGTTTCTC CTGAACAAGT TGATTATGTC
CAACTATCAC CTGTTCAGGT TATTTCTGTG GCTGCATCAT TAATTCCATT TTTGGAACAC
GATGATGCTA ACAGGGCTTT GATGGGTTCC AACATGCAAA GACAAGCAGT TCCTCTTTTG
CGTCCAGAAA GGCCTTTAGT TGGGACTGGT TTAGAGACTC AAGTTGCTAG AGACTCTGGC
ATGGTTCCAA TTTCAAAAGT AAATGGAAAA GTGACTTATG TTGACGCCAA TGCAATTGTC
GTTACTGATG ACGAGGGAAA TGATCACACG CATTACTTGC AAAAATATCA GAGATCTAAT
CAAGATACTT GTTTAAACCA TAGGCCTATA GTGTTTAATG GTGACCCAGT AATTGTTGGT
CAAGTTTTAG CTGATGGTTC GGCATGTGAA GGAGGGGAAA TAGCTCTTGG TCAAAATGTA
TTAATTGCAT ACATGCCATG GGAAGGATAC AACTATGAAG ATGCAATTCT TGTTAGTGAA
AGGTTAGTTA AAGATGATCT ATATACCTCT GTCCATATAG AAAAATATGA AATAGAAGCT
CGTCAAACCA AGCTAGGTCC TGAAGAAATA ACAAGAGAAA TCCCAAATGT TTCAGAAGAA
AACCTTGGGA ATTTGGATGA AATGGGAATA ATACGAATAG GAGCTTACGT TGAGAGTGGT
GACATACTTG TTGGAAAAGT AACCCCTAAA GGAGAATCAG ATCAGCCTCC CGAGGAAAAA
CTTTTAAGAG CTATTTTTGG AGAAAAAGCT AGAGATGTAA GAGATAACTC TCTCCGAGTT
CCTTCAACTG AAAGAGGAAG AGTAGTTGAT GTAAGGATCT ATACAAGAGA GCAAGGTGAT
GAGTTACCAC CTGGCGCAAA TATGGTTGTA AGAGTCTATG TAGCGCAACG CAGGAAAATT
CAAGTTGGCG ATAAAATGGC AGGAAGGCAT GGCAACAAAG GGATTATAAG TAGAATACTG
CCAAGAGAGG ATATGCCTTA TCTTCCTGAT GGCACACCTG TAGACATATG CCTCAATCCA
CTTGGGGTTC CAAGCAGAAT GAATGTAGGA CAAGTCTTTG AGCTTCTTAT GGGTTGGGCT
GCCTCAAACT TGGATTGCAG AGTTAAAATT GTCCCATTTG ATGAGATGTA TGGACCAGAA
ATGTCTAATC AGACTGTACA AGCCTATTTA AAAAAGGCTG CAAAGCAGCC AGGTAAATCA
TGGGTATACA ACCCTAAGGA CCCTGGGAAG TTACTCCTTA AGGATGGTCG AACCGGTGAA
CCTTTTGATC AACCAGTTGC CGTTGGCTAC GCTCACTTCC TAAAACTTGT GCATCTAGTC
GATGATAAAA TTCATGCTCG ATCAACAGGT CCATATTCTT TAGTTACTCA ACAGCCTCTA
GGTGGAAAGG CTCAACAAGG AGGACAGAGG CTTGGGGAAA TGGAGGTATG GGCACTTGAG
GCTTATGGGG CAGCATATAC TTTGCAAGAA CTTTTAACTG TAAAGTCTGA CGACATGCAA
GGTCGGAATG AGGCTCTTAA TTCGATTGTT AAAGGCAAGC CAATTCCAAG GCCTGGGACT
CCAGAATCTT TCAAGGTTTT AATGAGAGAA CTTCAGTCAT TGGGATTAGA TATTGGAGTT
TATACAGATG ATGGAAAAGA AGTTGATCTA ATGCAAGATG TCAATCCTCG TAGAAGCACG
CCAAGTAGAC CTACCTATGA ATCATTAGGT AAAGAATACG AGGAGTAA
 
Protein sequence
MSRSAIQVAK AATYLPDLVE VQRSSFKWFL DKGLIEELDN FSPITDYTGK LELHFIGAEY 
KLKRPRHDVE EAKRRDATFA SQMYVTCRLV NKETGEIKEQ EVFIGELPLM TERGTFIING
AERVIVNQIV RSPGVYFKDE QDKNGRRTYN ASVIPNRGAW LKFETDKNDL LHVRVDKTRK
INAHVLMRAM GLSDNDVIDK LRHPEFYKKS IDAANEEGIS SEDQALLELY KKLRPGEPPS
VSGGQQLLQT RFFDPKRYDL GRVGRYKINK KLRLTIPDNL RTLTNEDVLS TLDYLINLEL
DVGGATLDDI DHLGNRRVRS VGELLQNQVR VGLNRLERII KERMTVGETD SLTPAQLVNP
KPLVAAIKEF FGSSQLSQFM DQTNPLAELT HKRRISALGP GGLTRERAGF AVRDIHPSHY
GRLCPIETPE GPNAGLINSL ATHARVNEYG FIETPFWKVE NGRLIKEGDP IYLSADLEDE
CRVAPGDVAT NEEGKIMAEL VPVRYRQDFE TVSPEQVDYV QLSPVQVISV AASLIPFLEH
DDANRALMGS NMQRQAVPLL RPERPLVGTG LETQVARDSG MVPISKVNGK VTYVDANAIV
VTDDEGNDHT HYLQKYQRSN QDTCLNHRPI VFNGDPVIVG QVLADGSACE GGEIALGQNV
LIAYMPWEGY NYEDAILVSE RLVKDDLYTS VHIEKYEIEA RQTKLGPEEI TREIPNVSEE
NLGNLDEMGI IRIGAYVESG DILVGKVTPK GESDQPPEEK LLRAIFGEKA RDVRDNSLRV
PSTERGRVVD VRIYTREQGD ELPPGANMVV RVYVAQRRKI QVGDKMAGRH GNKGIISRIL
PREDMPYLPD GTPVDICLNP LGVPSRMNVG QVFELLMGWA ASNLDCRVKI VPFDEMYGPE
MSNQTVQAYL KKAAKQPGKS WVYNPKDPGK LLLKDGRTGE PFDQPVAVGY AHFLKLVHLV
DDKIHARSTG PYSLVTQQPL GGKAQQGGQR LGEMEVWALE AYGAAYTLQE LLTVKSDDMQ
GRNEALNSIV KGKPIPRPGT PESFKVLMRE LQSLGLDIGV YTDDGKEVDL MQDVNPRRST
PSRPTYESLG KEYEE