Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PMN2A_1016 |
Symbol | rpoB |
ID | 3606402 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. NATL2A |
Kingdom | Bacteria |
Replicon accession | NC_007335 |
Strand | - |
Start bp | 1509791 |
End bp | 1513078 |
Gene Length | 3288 bp |
Protein Length | 1095 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 637687885 |
Product | DNA-directed RNA polymerase subunit beta |
Protein accession | YP_292209 |
Protein GI | 72382854 |
COG category | [K] Transcription |
COG ID | [COG0085] DNA-directed RNA polymerase, beta subunit/140 kD subunit |
TIGRFAM ID | [TIGR02013] DNA-directed RNA polymerase, beta subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0988562 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCAGAA GCGCGATTCA GGTAGCCAAA GCAGCTACAT ATCTGCCTGA TCTGGTTGAG GTGCAAAGAT CAAGTTTTAA GTGGTTTTTG GATAAAGGCT TAATAGAAGA ATTAGACAAT TTTTCTCCCA TTACTGACTA TACAGGTAAG CTTGAGCTGC ATTTTATTGG AGCAGAATAT AAGCTTAAGC GTCCTAGGCA TGATGTAGAA GAAGCAAAAA GAAGAGATGC AACGTTTGCT TCTCAAATGT ATGTCACTTG TCGCTTAGTG AATAAAGAAA CCGGAGAGAT TAAAGAACAA GAAGTTTTTA TAGGTGAACT ACCTTTAATG ACTGAACGTG GAACTTTTAT AATTAATGGT GCTGAGAGAG TAATAGTCAA TCAAATCGTT CGCAGTCCAG GAGTTTATTT CAAAGATGAG CAAGATAAAA ATGGTAGGAG AACTTACAAT GCAAGCGTTA TTCCTAATCG CGGAGCATGG TTAAAGTTTG AAACTGATAA AAACGATTTG CTGCATGTTC GTGTAGATAA AACAAGAAAA ATAAATGCCC ATGTTTTAAT GAGGGCAATG GGTTTATCAG ATAATGACGT TATAGATAAA TTACGACATC CTGAGTTTTA TAAAAAGTCA ATTGATGCGG CAAATGAAGA AGGTATAAGT TCAGAAGACC AAGCATTGTT AGAGCTTTAT AAAAAGTTAA GACCAGGTGA ACCTCCATCA GTAAGTGGTG GCCAACAGCT ACTCCAAACA AGATTTTTTG ATCCAAAACG ATATGATTTA GGAAGGGTTG GTAGATATAA AATAAATAAG AAATTACGCT TAACTATTCC TGATAATTTA AGAACACTAA CCAACGAAGA TGTTCTTTCT ACCTTAGATT ATTTAATCAA TTTAGAATTA GATGTTGGTG GAGCTACTTT GGATGATATT GATCATTTAG GTAATAGAAG AGTTAGGTCA GTTGGTGAAC TTCTACAAAA TCAAGTTCGA GTTGGTTTAA ATAGACTTGA AAGGATAATT AAAGAGAGGA TGACCGTTGG TGAGACAGAT TCACTTACCC CTGCGCAATT GGTAAATCCA AAACCTTTAG TAGCAGCAAT AAAAGAATTT TTTGGTTCAA GTCAATTAAG TCAATTCATG GATCAAACCA ATCCATTAGC TGAATTAACT CATAAAAGAC GTATCTCTGC TTTGGGACCA GGAGGTTTAA CTCGTGAAAG AGCTGGCTTT GCCGTTCGGG ATATTCATCC ATCTCACTAC GGAAGACTTT GTCCGATTGA AACACCTGAA GGACCAAATG CAGGACTGAT TAATTCGTTA GCAACTCACG CACGAGTAAA TGAATACGGT TTTATTGAGA CACCATTTTG GAAAGTGGAA AATGGTCGAT TAATCAAGGA AGGTGATCCT ATTTATCTTT CTGCAGATCT AGAAGATGAA TGTAGGGTTG CGCCAGGTGA TGTGGCTACA AACGAAGAAG GGAAAATAAT GGCAGAACTT GTTCCAGTTA GGTATCGACA AGATTTTGAA ACGGTTTCTC CTGAACAAGT TGATTATGTC CAACTATCAC CTGTTCAGGT TATTTCTGTG GCTGCATCAT TAATTCCATT TTTGGAACAC GATGATGCTA ACAGGGCTTT GATGGGTTCC AACATGCAAA GACAAGCAGT TCCTCTTTTG CGTCCAGAAA GGCCTTTAGT TGGGACTGGT TTAGAGACTC AAGTTGCTAG AGACTCTGGC ATGGTTCCAA TTTCAAAAGT AAATGGAAAA GTGACTTATG TTGACGCCAA TGCAATTGTC GTTACTGATG ACGAGGGAAA TGATCACACG CATTACTTGC AAAAATATCA GAGATCTAAT CAAGATACTT GTTTAAACCA TAGGCCTATA GTGTTTAATG GTGACCCAGT AATTGTTGGT CAAGTTTTAG CTGATGGTTC GGCATGTGAA GGAGGGGAAA TAGCTCTTGG TCAAAATGTA TTAATTGCAT ACATGCCATG GGAAGGATAC AACTATGAAG ATGCAATTCT TGTTAGTGAA AGGTTAGTTA AAGATGATCT ATATACCTCT GTCCATATAG AAAAATATGA AATAGAAGCT CGTCAAACCA AGCTAGGTCC TGAAGAAATA ACAAGAGAAA TCCCAAATGT TTCAGAAGAA AACCTTGGGA ATTTGGATGA AATGGGAATA ATACGAATAG GAGCTTACGT TGAGAGTGGT GACATACTTG TTGGAAAAGT AACCCCTAAA GGAGAATCAG ATCAGCCTCC CGAGGAAAAA CTTTTAAGAG CTATTTTTGG AGAAAAAGCT AGAGATGTAA GAGATAACTC TCTCCGAGTT CCTTCAACTG AAAGAGGAAG AGTAGTTGAT GTAAGGATCT ATACAAGAGA GCAAGGTGAT GAGTTACCAC CTGGCGCAAA TATGGTTGTA AGAGTCTATG TAGCGCAACG CAGGAAAATT CAAGTTGGCG ATAAAATGGC AGGAAGGCAT GGCAACAAAG GGATTATAAG TAGAATACTG CCAAGAGAGG ATATGCCTTA TCTTCCTGAT GGCACACCTG TAGACATATG CCTCAATCCA CTTGGGGTTC CAAGCAGAAT GAATGTAGGA CAAGTCTTTG AGCTTCTTAT GGGTTGGGCT GCCTCAAACT TGGATTGCAG AGTTAAAATT GTCCCATTTG ATGAGATGTA TGGACCAGAA ATGTCTAATC AGACTGTACA AGCCTATTTA AAAAAGGCTG CAAAGCAGCC AGGTAAATCA TGGGTATACA ACCCTAAGGA CCCTGGGAAG TTACTCCTTA AGGATGGTCG AACCGGTGAA CCTTTTGATC AACCAGTTGC CGTTGGCTAC GCTCACTTCC TAAAACTTGT GCATCTAGTC GATGATAAAA TTCATGCTCG ATCAACAGGT CCATATTCTT TAGTTACTCA ACAGCCTCTA GGTGGAAAGG CTCAACAAGG AGGACAGAGG CTTGGGGAAA TGGAGGTATG GGCACTTGAG GCTTATGGGG CAGCATATAC TTTGCAAGAA CTTTTAACTG TAAAGTCTGA CGACATGCAA GGTCGGAATG AGGCTCTTAA TTCGATTGTT AAAGGCAAGC CAATTCCAAG GCCTGGGACT CCAGAATCTT TCAAGGTTTT AATGAGAGAA CTTCAGTCAT TGGGATTAGA TATTGGAGTT TATACAGATG ATGGAAAAGA AGTTGATCTA ATGCAAGATG TCAATCCTCG TAGAAGCACG CCAAGTAGAC CTACCTATGA ATCATTAGGT AAAGAATACG AGGAGTAA
|
Protein sequence | MSRSAIQVAK AATYLPDLVE VQRSSFKWFL DKGLIEELDN FSPITDYTGK LELHFIGAEY KLKRPRHDVE EAKRRDATFA SQMYVTCRLV NKETGEIKEQ EVFIGELPLM TERGTFIING AERVIVNQIV RSPGVYFKDE QDKNGRRTYN ASVIPNRGAW LKFETDKNDL LHVRVDKTRK INAHVLMRAM GLSDNDVIDK LRHPEFYKKS IDAANEEGIS SEDQALLELY KKLRPGEPPS VSGGQQLLQT RFFDPKRYDL GRVGRYKINK KLRLTIPDNL RTLTNEDVLS TLDYLINLEL DVGGATLDDI DHLGNRRVRS VGELLQNQVR VGLNRLERII KERMTVGETD SLTPAQLVNP KPLVAAIKEF FGSSQLSQFM DQTNPLAELT HKRRISALGP GGLTRERAGF AVRDIHPSHY GRLCPIETPE GPNAGLINSL ATHARVNEYG FIETPFWKVE NGRLIKEGDP IYLSADLEDE CRVAPGDVAT NEEGKIMAEL VPVRYRQDFE TVSPEQVDYV QLSPVQVISV AASLIPFLEH DDANRALMGS NMQRQAVPLL RPERPLVGTG LETQVARDSG MVPISKVNGK VTYVDANAIV VTDDEGNDHT HYLQKYQRSN QDTCLNHRPI VFNGDPVIVG QVLADGSACE GGEIALGQNV LIAYMPWEGY NYEDAILVSE RLVKDDLYTS VHIEKYEIEA RQTKLGPEEI TREIPNVSEE NLGNLDEMGI IRIGAYVESG DILVGKVTPK GESDQPPEEK LLRAIFGEKA RDVRDNSLRV PSTERGRVVD VRIYTREQGD ELPPGANMVV RVYVAQRRKI QVGDKMAGRH GNKGIISRIL PREDMPYLPD GTPVDICLNP LGVPSRMNVG QVFELLMGWA ASNLDCRVKI VPFDEMYGPE MSNQTVQAYL KKAAKQPGKS WVYNPKDPGK LLLKDGRTGE PFDQPVAVGY AHFLKLVHLV DDKIHARSTG PYSLVTQQPL GGKAQQGGQR LGEMEVWALE AYGAAYTLQE LLTVKSDDMQ GRNEALNSIV KGKPIPRPGT PESFKVLMRE LQSLGLDIGV YTDDGKEVDL MQDVNPRRST PSRPTYESLG KEYEE
|
| |