Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_04361 |
Symbol | rpoB |
ID | 4776363 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | + |
Start bp | 436973 |
End bp | 440266 |
Gene Length | 3294 bp |
Protein Length | 1097 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 640085940 |
Product | DNA-directed RNA polymerase subunit beta |
Protein accession | YP_001016453 |
Protein GI | 124022146 |
COG category | [K] Transcription |
COG ID | [COG0085] DNA-directed RNA polymerase, beta subunit/140 kD subunit |
TIGRFAM ID | [TIGR02013] DNA-directed RNA polymerase, beta subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCAGCA GCGCGATTCA GGTCGCAAAG ACCGCCACCT ACCTGCCTGA TCTCGTTGAA GTTCAGCGGG CGAGCTTCAA GTGGTTTCTT GATGAGGGCT TGATCGAGGA GTTGGATAGC TTTTCTCCGA TCACTGACTA CACAGGCAAG CTTGAACTCC ACTTCGTTGG CAATGAATAC CGATTGAAGC GGCCTCGCCA CGATGTTGAA GAGGCCAAGC GCCGTGATGC GACTTTTGCT TCACAGATGT ATGTGACCTG TCGCTTAGTG AATAAGGAAA CAGGTGAGAT TAAAGAGCAG GAGGTATTTA TTGGCGAGCT TCCTCTGATG ACTGAGCGAG GTACGTTCAT TATCAATGGT GCAGAGAGGG TTATTGTTAA TCAGATTGTG CGTAGCCCAG GGGTATATTT TAAGGATGAA CAGGATAAGA ATGGACGACG AACATACAAT GCCAGTGTTA TTCCTAATCG TGGTGCCTGG CTAAAGTTCG AAACAGATAA AAACGACCTT CTGCATGTGC GTGTTGATAA AACACGCAAA ATTAATGCCC ATGTTTTAAT GCGGGCCATG GGGCTATCAG ATAACGATGT AATCGATAAG CTTCGTCACC CTGAGTATTA TAAGAAGTCT ATTGAGGCTG CAAATGAGGA AGGAATTAGT TCAGAGGATC AGGCTTTGCT TGAGCTTTAT AAAAAGCTTC GTCCTGGTGA GCCTCCTTCA GTAAGCGGTG GACAACAGCT TCTGCAGACC CGATTTTTTG ATCCCAAGCG TTATGACCTT GGCCGAGTTG GCCGTTACAA GATTAATAAA AAGCTGCGTC TAACAATCCC GGATACGGTG CGTACCCTCA CCCATGAGGA TGTGCTCTCA ACCCTTGATT ATCTGATCAA TTTAGAACTC GATGTGGGTG GTGCCAGCCT GGATGACATT GATCACCTGG GCAATCGTCG AGTGCGTTCA GTGGGTGAAC TTTTGCAGAA CCAGGTCCGG GTTGGTCTCA ATCGCCTGGA GAGGATCATC AAAGAACGGA TGACTGTGGG TGAGACCGAT TCGCTCACGC CAGCTCAGTT GGTGAACCCC AAGCCTCTCG TTGCAGCTGT TAAGGAGTTC TTCGGTTCCA GTCAGCTGAG TCAGTTCATG GATCAGACGA ATCCATTGGC TGAGCTCACC CACAAACGTC GTATCTCAGC TCTGGGGCCA GGGGGCCTAA CTAGGGAGCG TGCTGGCTTC GCGGTGCGTG ATATTCACCC CTCCCATTAT GGCCGTCTCT GCCCAATTGA GACACCGGAA GGCCCGAATG CAGGTTTGAT CAATTCACTG GCTACCCACG CTCGGGTCAA TCAGTATGGC TTCATTGAGA CTCCTTTCTG GAAAGTTGAG AACGGTCGCC TGATCAAGGA GGGAGACCCC ATCTATTTAT CAGCCGATCT TGAGGATGAG TGTCGTGTGG CTCCTGGTGA TGTGGCCACT GATGCCGATG GACAGATTCT CGCTGAATTA ATTCCGGTGC GTTATCGCCA GGACTTCGAA AAAGTTCCAC CTGAGCAGGT GGATTATGTG CAGCTCTCGC CGGTGCAGGT GATCTCTGTC GCCACCTCCT TGATTCCTTT CCTCGAGCAC GATGACGCCA ATAGGGCTCT GATGGGATCA AATATGCAAC GGCAGGCGGT ACCACTGTTG CGGCCTGAGC GGCCTTTGGT GGGGACGGGC CTCGAAACTC AGGTGGCTCG TGATTCCGGG ATGGTGCCCA TCTCCCGTGT TAATGGCATG GTCACATTCG TGGATGCCAC TGCGATCATC GTCCGGGATG AGGATGGGGT TGATCACACC CACTACCTGC AGAAGTATCA GCGTTCAAAT CAGGACACCT GTCTCAATCA GCGTCCGATT GTCTGCCAGG GTGATCCGGT GATCGTGGGT CAGGTTCTTG CGGATGGTTC TGCTTGTGAA GGCGGTGAAA TTGCTCTAGG TCAGAACGTT TTGGTCGCTT ATATGCCTTG GGAGGGTTAC AACTACGAGG ATGCTATTCT TGTCAGCGAA CGCCTAGTTA AGGACGACCT CTACACCTCG GTGCATATCG AGAAGTATGA GATCGAGGCA CGACAGACAA AGCTTGGTCC CGAGGAGATC ACCAGGGAGA TACCCAATGT TGCCGAGGAA AGCCTTGGCA ATCTTGATGA AATGGGCATC ATTCGCATCG GTGCTTTTGT CGAGAGCGGC GACATTCTGG TGGGCAAAGT GACACCTAAA GGTGAGTCAG ATCAGCCTCC AGAAGAGAAG CTTTTGCGTG CCATTTTTGG TGAAAAAGCG CGCGATGTGC GTGATAACTC TCTACGCGTT CCCAGTACTG AACGAGGACG AGTTGTGGAC GTTCGTATTT ACACTCGTGA ACAGGGTGAT GAGCTTCCCC CTGGCGCCAA TATGGTTGCT CGAGTGTATG TGGCTCAGCG TCGCAAGATC CAGGTTGGCG ACAAGATGGC CGGTCGCCAC GGCAATAAAG GCATCATCAG TCGCATTCTT CCCCGCGAAG ATATGCCTTT TCTGCCCGAT GGCACTCCAG TTGACATCGT TTTGAATCCT TTGGGCGTGC CCAGCCGGAT GAATGTGGGT CAGGTGTTTG AGTGCTTAAT GGGATGGGCC GCAGCCAATC TCGACTGTCG GGTCAAGGTG GTGCCTTTTG ATGAAATGTA TGGAGCTGAA AAGTCCCAGC AGACAGTGGA GGCTTATCTC AAGGAAGCTG CCAAACAGCC AGGTAAGGAG TGGGTCTACA ACCCTGAAAA TCCTGGCAAG CTTCAATTGA TTGATGGACG CTCGGGTGAA CCTTTCGACC AGCCGGTGAC CGTTGGTTAT GCACAGATTC TGAAGCTTGT TCATTTGGTT GATGACAAGA TCCATGCTCG CTCAACAGGT CCCTATTCAC TGGTAACCCA GCAACCTCTT GGCGGTAAGG CTCAACAAGG TGGCCAACGT TTAGGTGAGA TGGAGGTGTG GGCTCTTGAG GCCTATGGCG CCGCTTACAC CCTGCAGGAG CTGCTTACTG TTAAATCAGA CGATATGCAG GGTCGGAATG AAGCCCTCAA CGCAATCGTC AAGGGCAAGC CTATCCCCAG GCCGGGGACA CCCGAGTCTT TCAAGGTATT GATGCGTGAG CTTCAGTCTC TAGGGCTCGA TATCGCTGTT TATACCGATG AAGGCAAAGA AGTGGATCTG ATGCAGGATG TGAATCCTCG CCGTAGTACT CCTAGTCGAC CCACTTACGA ATCCCTTGGA GTAGCGGATT ACGACGAAGA TTAA
|
Protein sequence | MSSSAIQVAK TATYLPDLVE VQRASFKWFL DEGLIEELDS FSPITDYTGK LELHFVGNEY RLKRPRHDVE EAKRRDATFA SQMYVTCRLV NKETGEIKEQ EVFIGELPLM TERGTFIING AERVIVNQIV RSPGVYFKDE QDKNGRRTYN ASVIPNRGAW LKFETDKNDL LHVRVDKTRK INAHVLMRAM GLSDNDVIDK LRHPEYYKKS IEAANEEGIS SEDQALLELY KKLRPGEPPS VSGGQQLLQT RFFDPKRYDL GRVGRYKINK KLRLTIPDTV RTLTHEDVLS TLDYLINLEL DVGGASLDDI DHLGNRRVRS VGELLQNQVR VGLNRLERII KERMTVGETD SLTPAQLVNP KPLVAAVKEF FGSSQLSQFM DQTNPLAELT HKRRISALGP GGLTRERAGF AVRDIHPSHY GRLCPIETPE GPNAGLINSL ATHARVNQYG FIETPFWKVE NGRLIKEGDP IYLSADLEDE CRVAPGDVAT DADGQILAEL IPVRYRQDFE KVPPEQVDYV QLSPVQVISV ATSLIPFLEH DDANRALMGS NMQRQAVPLL RPERPLVGTG LETQVARDSG MVPISRVNGM VTFVDATAII VRDEDGVDHT HYLQKYQRSN QDTCLNQRPI VCQGDPVIVG QVLADGSACE GGEIALGQNV LVAYMPWEGY NYEDAILVSE RLVKDDLYTS VHIEKYEIEA RQTKLGPEEI TREIPNVAEE SLGNLDEMGI IRIGAFVESG DILVGKVTPK GESDQPPEEK LLRAIFGEKA RDVRDNSLRV PSTERGRVVD VRIYTREQGD ELPPGANMVA RVYVAQRRKI QVGDKMAGRH GNKGIISRIL PREDMPFLPD GTPVDIVLNP LGVPSRMNVG QVFECLMGWA AANLDCRVKV VPFDEMYGAE KSQQTVEAYL KEAAKQPGKE WVYNPENPGK LQLIDGRSGE PFDQPVTVGY AQILKLVHLV DDKIHARSTG PYSLVTQQPL GGKAQQGGQR LGEMEVWALE AYGAAYTLQE LLTVKSDDMQ GRNEALNAIV KGKPIPRPGT PESFKVLMRE LQSLGLDIAV YTDEGKEVDL MQDVNPRRST PSRPTYESLG VADYDED
|
| |