Gene A9601_16891 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagA9601_16891 
SymbolrpoB 
ID4718419 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. AS9601 
KingdomBacteria 
Replicon accessionNC_008816 
Strand
Start bp1430197 
End bp1433490 
Gene Length3294 bp 
Protein Length1097 aa 
Translation table11 
GC content38% 
IMG OID640079415 
ProductDNA-directed RNA polymerase subunit beta 
Protein accessionYP_001010079 
Protein GI123969221 
COG category[K] Transcription 
COG ID[COG0085] DNA-directed RNA polymerase, beta subunit/140 kD subunit 
TIGRFAM ID[TIGR02013] DNA-directed RNA polymerase, beta subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCAGTA GCGCTTTACA GGTAGCAAAA ACAGCTACTT ATCTACCAGA TTTAGTTGAA 
GTACAAAGAG CAAGCTTTAA ATGGTTTTTG GAAAAGGGTT TAATAGAAGA ACTACAAAAT
TTTTCTCCTA TTTCTGATTA CACAGGTAAA TTAGAATTAC ATTTTATCGG TGAGGAGTAC
AGGTTAAAAA GACCTAGACA TGATGTTGAG GAAGCTAAGA GAAGAGATGC TACATTCGCA
TCTCAGATGT ATGTAACTTG CAGGTTAATC AATAAAGAGA CAGGGGAAAT TAAAGAACAA
GAGGTATTTA TTGGCGAATT ACCATTAATG ACTGAAAGAG GAACTTTTAT CATTAATGGC
GCCGAAAGAG TTATTGTTAA TCAAATTGTT CGAAGTCCCG GAGTCTATTT CAAAGATGAA
CTGGATAAAA ATGGTCGAAG AACTTACAAC GCTAACGTTA TTCCTAATAG AGGTGCATGG
TTAAAATTTG AGACCGACAA AAATAATTTA CTTTATGTGA GAGTTGATAA AACTAGAAAA
ATTAATGCTC ATGTTCTTAT GAGAGCAATG GGTCTTTCAG ATAATGATGT GGTTGATAAA
CTTAGGCATC CTGAATTTTA TCAAAACTCA ATTGATTCAG CTAACGACGA GGGTATAAAT
TCAGAAGATC AGGCATTACT TGAGCTGTAT AAGAAGCTTC GTCCTGGTGA ACCGCCCTCT
GTGTCTGGTG GACAACAACT ATTAAATAGT AGATTTTTTG ATCCTAAAAG ATATGATTTA
GGCCGAGTTG GTAGATATAA AATAAATAAA AAATTGAGAC TAACCGTACC AGACGATGTG
AGAACACTTA CCCATGAAGA TGTTCTATCT ACCATTGATT ATTTAATTAA CCTAGAATTG
GATATTGGTG GAGCTAGTTT GGATGATATT GACCATCTTG GTAATCGAAG GGTTAGATCT
GTAGGAGAAC TTCTTCAAAA TCAAGTCAGG GTTGGGCTTA ATCGGTTAGA GAGAATTATC
AAAGAAAGAA TGACTGTAGG AGAAACAGAT TCTTTAACTC CTGCTCAACT AGTCAATCCA
AAACCTTTGG TCGCTGCTAT AAAGGAATTT TTTGGTTCCA GTCAATTAAG TCAGTTCATG
GATCAAACTA ATCCTTTAGC TGAATTAACA CATAAAAGAA GAATCTCTGC ATTAGGTCCA
GGAGGTTTAA CTAGAGAAAG AGCAGGCTTT GCGGTAAGAG ATATACACCC TTCACATTAC
GGTAGATTAT GCCCAATCGA GACTCCTGAA GGTCCTAATG CAGGACTTAT AAATTCTTTA
GCTACCCACG CAAGAGTTAA TGAGTATGGT TTTATTGAAA CACCTTTTTG GGAAGTTAAA
AACGGTAAAG TTAATAAAGA AGGTAATCCT GTTTATCTTT CTGCTGATTT AGAAGATGAG
TGTAGAGTGG CTCCAGGTGA CGTCGCAACT GATAAGGATG GCAATATAAT TGCAGATTTA
ATACCAGTAA GATATAGACA GGATTTTGAA AAAGTACCTC CTCATCAAGT TGATTACGTT
CAGCTTTCTC CTGTTCAGGT AATTTCAGTT GCTACTTCAC TTATTCCTTT CTTGGAACAT
GATGATGCTA ATAGAGCTCT TATGGGATCG AATATGCAAC GCCAAGCGGT TCCATTGCTC
AGGCCAGAAC GGCCTTTAGT TGGTACAGGT TTAGAATCTC AAGTTGCTAG AGATTCGGGT
ATGGTTCCCA TAACAAAAGT TAATGGAACT GTATCTTACG TAGACGCTAA TGAGATTGTC
GTTAAAGACG ATCATGGCAA TGAACATTTT CATTATCTTC AGAAATATCA AAGATCAAAT
CAAGATACTT GCCTCAACCA AAGACCCATA GTGAAAATTG GAGATAAAGT GATATCGGGT
CAGGTTTTAG CAGATGGATC TGCATGTGAA GGTGGTGAAA TAGCCCTTGG CCAAAACGTT
TTAATTGCTT ACATGCCATG GGAGGGGTAC AACTACGAAG ATGCGATACT TGTAAGCGAG
AGGATGGTAA CTGATGATTT ATATACTTCA GTACATATTG AAAAATATGA AATTGAAGCA
AGACAAACGA AGCTAGGACC TGAAGAAATT ACGAGAGAGA TTCCTAACAT CTCGGAAGAA
AGCTTGAATA ATCTTGATGA GATGGGAATT ATTAGGATTG GTGCTTTTGT TGAGAGTGGA
GATATCCTTG TAGGAAAGGT GACACCTAAA GGTGAATCAG ATCAACCACC TGAAGAAAAA
CTGTTAAGAG CTATTTTCGG TGAAAAGGCT CGAGATGTGA GAGACAATTC CCTTAGGGTA
CCCAAAACTG AAAAGGGAAG AGTTTTAGAT GTTCGCATTT ACACTAGAGA ACAAGGAGAT
GAATTACCTC CAGGGGCCAA CATGGTTGTT AGAGTGTATG TGGCTCAGAG AAGGAAAATT
CAAGTAGGCG ATAAAATGGC TGGAAGGCAT GGAAATAAAG GGATTATTAG CAGAATTTTA
CCAAGAGAAG ATATGCCTTA TTTACCTGAT GGAACGCCAG TAGATATAGT TCTTAACCCT
TTAGGAGTTC CAAGTAGGAT GAATGTAGGT CAAGTTTTTG AATTATTGAT GGGTTGGGCA
GCTGCCAACT TAAATTGCAG GGTTAAAGTT GTTCCATTTG ATGAAATGTA TGGAGCTGAA
AAGTCACATC AAACTGTTCA AGCATTTTTA GAGGAAGCTT CAAAACAGCC AGGTAAAGCA
TGGGTTTACA ATCCTGAGGA TCCTGGAAAG TTATTACTTA AAGATGGCAG AACAGGGGAA
CCCTTCGATC AGCCAGTTGC TGTCGGATAC TCTCACTTCC TCAAATTAGT TCATTTGGTG
GATGATAAAA TTCATGCTAG ATCTACTGGT CCTTACTCTT TAGTTACACA GCAACCATTG
GGTGGTAAAG CACAACAAGG TGGACAAAGG CTTGGAGAAA TGGAAGTATG GGCTCTTGAA
GCTTATGGAG CTGCTTATAC TCTTCAGGAA TTGTTAACAG TTAAATCTGA TGACATGCAA
GGAAGAAATG AAGCTCTTAA TGCGATCGTA AAAGGTAAAC CGATCCCAAG GCCAGGTACT
CCTGAGTCAT TTAAAGTTCT TATGAGGGAA TTACAATCTC TAGGCTTGGA TATAGGGGTT
TATACAGATG AAGGAAAAGA GGTAGATTTA ATGCAAGATA TCAATCCGAG AAGAAATACT
CCATCAAGGC CTACTTACGA ATCACTAGGA ACCTCTGAAT ATGAGGAAGA TTAA
 
Protein sequence
MSSSALQVAK TATYLPDLVE VQRASFKWFL EKGLIEELQN FSPISDYTGK LELHFIGEEY 
RLKRPRHDVE EAKRRDATFA SQMYVTCRLI NKETGEIKEQ EVFIGELPLM TERGTFIING
AERVIVNQIV RSPGVYFKDE LDKNGRRTYN ANVIPNRGAW LKFETDKNNL LYVRVDKTRK
INAHVLMRAM GLSDNDVVDK LRHPEFYQNS IDSANDEGIN SEDQALLELY KKLRPGEPPS
VSGGQQLLNS RFFDPKRYDL GRVGRYKINK KLRLTVPDDV RTLTHEDVLS TIDYLINLEL
DIGGASLDDI DHLGNRRVRS VGELLQNQVR VGLNRLERII KERMTVGETD SLTPAQLVNP
KPLVAAIKEF FGSSQLSQFM DQTNPLAELT HKRRISALGP GGLTRERAGF AVRDIHPSHY
GRLCPIETPE GPNAGLINSL ATHARVNEYG FIETPFWEVK NGKVNKEGNP VYLSADLEDE
CRVAPGDVAT DKDGNIIADL IPVRYRQDFE KVPPHQVDYV QLSPVQVISV ATSLIPFLEH
DDANRALMGS NMQRQAVPLL RPERPLVGTG LESQVARDSG MVPITKVNGT VSYVDANEIV
VKDDHGNEHF HYLQKYQRSN QDTCLNQRPI VKIGDKVISG QVLADGSACE GGEIALGQNV
LIAYMPWEGY NYEDAILVSE RMVTDDLYTS VHIEKYEIEA RQTKLGPEEI TREIPNISEE
SLNNLDEMGI IRIGAFVESG DILVGKVTPK GESDQPPEEK LLRAIFGEKA RDVRDNSLRV
PKTEKGRVLD VRIYTREQGD ELPPGANMVV RVYVAQRRKI QVGDKMAGRH GNKGIISRIL
PREDMPYLPD GTPVDIVLNP LGVPSRMNVG QVFELLMGWA AANLNCRVKV VPFDEMYGAE
KSHQTVQAFL EEASKQPGKA WVYNPEDPGK LLLKDGRTGE PFDQPVAVGY SHFLKLVHLV
DDKIHARSTG PYSLVTQQPL GGKAQQGGQR LGEMEVWALE AYGAAYTLQE LLTVKSDDMQ
GRNEALNAIV KGKPIPRPGT PESFKVLMRE LQSLGLDIGV YTDEGKEVDL MQDINPRRNT
PSRPTYESLG TSEYEED