Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | NATL1_18861 |
Symbol | rpoB |
ID | 4780040 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. NATL1A |
Kingdom | Bacteria |
Replicon accession | NC_008819 |
Strand | - |
Start bp | 1546401 |
End bp | 1549688 |
Gene Length | 3288 bp |
Protein Length | 1095 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 640085175 |
Product | DNA-directed RNA polymerase subunit beta |
Protein accession | YP_001015706 |
Protein GI | 124026591 |
COG category | [K] Transcription |
COG ID | [COG0085] DNA-directed RNA polymerase, beta subunit/140 kD subunit |
TIGRFAM ID | [TIGR02013] DNA-directed RNA polymerase, beta subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.816976 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCAGAA GCGCGATTCA GGTAGCCAAA GCAGCTACAT ATCTGCCTGA TCTGGTTGAG GTGCAAAGAT CAAGTTTTAA GTGGTTTTTG GATAAAGGCT TAATAGAAGA ATTAGACAAT TTTTCTCCCA TTACTGACTA TACAGGTAAG CTTGAGCTGC ATTTTATTGG AGCAGAATAT AAGCTTAAGC GTCCTAGGCA TGATGTAGAA GAAGCAAAAA GAAGAGATGC AACGTTTGCT TCTCAAATGT ATGTCACTTG TCGCTTAGTT AATAAAGAAA CCGGAGAGAT TAAAGAACAA GAAGTTTTTA TAGGTGAACT ACCTTTAATG ACTGAACGTG GAACTTTTAT AATTAATGGT GCTGAGAGAG TAATAGTCAA TCAAATCGTT CGAAGTCCAG GAGTTTATTT CAAAGATGAG CAAGATAAAA ATGGTAGGAG AACTTACAAT GCAAGCGTTA TTCCTAATCG CGGAGCATGG TTAAAGTTTG AAACTGATAA AAACGATTTG CTGCATGTTC GTGTAGATAA AACAAGAAAA ATTAATGCCC ATGTCTTAAT GAGGGCAATG GGTTTATCAG ATAATGACGT TATAGATAAA TTACGACATC CTGAGTTTTA TAAAAAGTCA ATTGATGCGG CAAATGAAGA AGGTATAAGT TCAGAAGACC AAGCATTGTT AGAGCTTTAT AAAAAGTTAA GACCAGGTGA ACCTCCATCA GTAAGTGGTG GCCAACAGCT ACTCCAAACA AGATTTTTTG ATCCAAAACG ATATGATTTA GGAAGGGTTG GTAGATATAA AATAAATAAG AAATTACGCT TAACTATTCC TGATAATTTA AGAACACTAA CCAACGAAGA TGTTCTTTCT ACCTTAGATT ATTTAATCAA TTTAGAATTA GATGTTGGTG GAGCTACTTT GGATGATATT GATCATTTAG GTAATAGAAG AGTTAGGTCA GTCGGTGAAC TTCTACAAAA TCAAGTTCGA GTTGGTTTAA ATAGACTTGA AAGGATAATT AAAGAGAGGA TGACCGTTGG TGAGACAGAT TCACTTACTC CTGCGCAATT GGTAAATCCA AAACCTTTAG TAGCAGCAAT AAAAGAATTT TTTGGTTCAA GTCAATTAAG TCAATTCATG GATCAAACCA ATCCATTAGC TGAATTAACT CATAAAAGAC GTATCTCTGC TTTGGGACCA GGAGGTTTAA CTCGTGAAAG AGCTGGCTTT GCCGTTCGGG ATATTCATCC ATCTCACTAC GGAAGACTTT GTCCGATTGA AACACCTGAA GGACCAAATG CAGGTCTGAT TAATTCGTTA GCAACTCACG CACGAGTAAA TGAATACGGT TTTATTGAGA CACCATTTTG GAAAGTTGAA AATGGTCGAT TAATTAAGGA AGGTGATCCT ATTTATCTTT CTGCAGATCT AGAAGATGAA TGTAGGGTTG CGCCAGGTGA TGTGGCTACA AACGAAGAGG GGAAAATAAT GGCAGAACTT GTTCCAGTTA GGTATCGACA AGATTTTGAA ACGGTTTCTC CCGAACAAGT TGATTATGTC CAACTATCAC CTGTTCAGGT TATTTCTGTA GCTGCATCAT TAATTCCATT TCTGGAACAC GATGATGCTA ACAGGGCTTT GATGGGTTCC AACATGCAAA GACAAGCAGT TCCTCTTTTG CGTCCAGAAA GGCCTTTAGT TGGGACTGGT TTAGAGACTC AAGTTGCTAG AGACTCTGGC ATGGTTCCAA TTTCAAAAGT AAATGGAACA GTGAGTTATG TTGACGCCAA TGCAATTGTC GTTACTGATG ATGAGGGAAA TGATCACACG CATTACTTGC AAAAATATCA GAGATCTAAT CAAGATACTT GTTTAAACCA TAGGCCTATA GTTTTTAATG GTGACCCAGT AATTGTGGGG CAAGTTTTAG CTGATGGTTC GGCATGTGAA GGAGGGGAAA TAGCTCTTGG TCAAAATGTA TTAATTGCAT ACATGCCATG GGAAGGGTAC AACTATGAAG ATGCAATTCT TGTTAGTGAA AGGTTAGTTA AAGATGATCT ATATACCTCT GTCCATATAG AAAAATATGA AATAGAAGCT CGTCAAACCA AGCTAGGTCC AGAAGAAATA ACAAGAGAAA TTCCAAATGT TTCAGAAGAA AACCTTGGGA ATTTGGATGA AATGGGAATA ATACGAATAG GGGCTTACGT TGAGAGTGGT GACATACTTG TTGGAAAAGT AACCCCTAAA GGAGAATCAG ATCAGCCTCC TGAGGAAAAA CTTTTAAGAG CTATTTTTGG AGAAAAAGCT AGAGATGTAA GAGATAACTC TCTCCGAGTT CCCTCAACTG AAAGAGGAAG AGTAGTTGAT GTAAGGATCT ATACAAGAGA GCAAGGTGAT GAGTTACCAC CTGGCGCAAA TATGGTTGTA AGAGTCTATG TAGCGCAACG CAGGAAAATA CAAGTTGGCG ATAAAATGGC AGGAAGGCAT GGCAATAAAG GGATTATAAG CAGAATACTG CCAAGAGAGG ATATGCCTTA TCTTCCTGAT GGCACACCTG TAGACATATG CCTCAATCCA CTTGGGGTCC CAAGCAGAAT GAATGTAGGG CAAGTCTTTG AGCTTCTTAT GGGTTGGGCT GCCTCAAACT TGGATTGCAG AGTTAAAATT GTCCCATTTG ATGAGATGTA TGGACCAGAA ATGTCTAATC AGACTGTACA AGCCTATTTA AAAGAGGCTG CAAAGCAGCC AGGTAAATCA TGGGTATACA ACCCTAAGGA CCCTGGGAAG TTACTCCTTA AGGATGGCCG AACCGGTGAA CCTTTTGATC AACCAGTTGC CGTTGGCTAC GCTCACTTCC TAAAACTTGT GCACCTAGTC GATGATAAAA TTCATGCTCG ATCAACAGGT CCATATTCTT TAGTTACTCA ACAGCCTCTA GGTGGAAAGG CTCAACAAGG AGGCCAGAGA CTTGGGGAAA TGGAGGTATG GGCACTTGAG GCTTATGGGG CAGCATACAC TTTACAAGAA CTTTTAACTG TAAAGTCTGA CGATATGCAA GGTCGGAATG AGGCTCTTAA TTCTATTGTT AAAGGCAAGC CAATTCCAAG GCCTGGGACT CCAGAATCTT TCAAGGTTTT AATGAGAGAA CTTCAGTCAT TGGGATTAGA TATTGGAGTT TATACAGATG ATGGAAAAGA AGTTGATCTA ATGCAAGATG TAAATCCTCG TAGAAGCACG CCAAGTAGAC CTACCTATGA ATCATTAGGT AAAGAATACG AGGAGTAA
|
Protein sequence | MSRSAIQVAK AATYLPDLVE VQRSSFKWFL DKGLIEELDN FSPITDYTGK LELHFIGAEY KLKRPRHDVE EAKRRDATFA SQMYVTCRLV NKETGEIKEQ EVFIGELPLM TERGTFIING AERVIVNQIV RSPGVYFKDE QDKNGRRTYN ASVIPNRGAW LKFETDKNDL LHVRVDKTRK INAHVLMRAM GLSDNDVIDK LRHPEFYKKS IDAANEEGIS SEDQALLELY KKLRPGEPPS VSGGQQLLQT RFFDPKRYDL GRVGRYKINK KLRLTIPDNL RTLTNEDVLS TLDYLINLEL DVGGATLDDI DHLGNRRVRS VGELLQNQVR VGLNRLERII KERMTVGETD SLTPAQLVNP KPLVAAIKEF FGSSQLSQFM DQTNPLAELT HKRRISALGP GGLTRERAGF AVRDIHPSHY GRLCPIETPE GPNAGLINSL ATHARVNEYG FIETPFWKVE NGRLIKEGDP IYLSADLEDE CRVAPGDVAT NEEGKIMAEL VPVRYRQDFE TVSPEQVDYV QLSPVQVISV AASLIPFLEH DDANRALMGS NMQRQAVPLL RPERPLVGTG LETQVARDSG MVPISKVNGT VSYVDANAIV VTDDEGNDHT HYLQKYQRSN QDTCLNHRPI VFNGDPVIVG QVLADGSACE GGEIALGQNV LIAYMPWEGY NYEDAILVSE RLVKDDLYTS VHIEKYEIEA RQTKLGPEEI TREIPNVSEE NLGNLDEMGI IRIGAYVESG DILVGKVTPK GESDQPPEEK LLRAIFGEKA RDVRDNSLRV PSTERGRVVD VRIYTREQGD ELPPGANMVV RVYVAQRRKI QVGDKMAGRH GNKGIISRIL PREDMPYLPD GTPVDICLNP LGVPSRMNVG QVFELLMGWA ASNLDCRVKI VPFDEMYGPE MSNQTVQAYL KEAAKQPGKS WVYNPKDPGK LLLKDGRTGE PFDQPVAVGY AHFLKLVHLV DDKIHARSTG PYSLVTQQPL GGKAQQGGQR LGEMEVWALE AYGAAYTLQE LLTVKSDDMQ GRNEALNSIV KGKPIPRPGT PESFKVLMRE LQSLGLDIGV YTDDGKEVDL MQDVNPRRST PSRPTYESLG KEYEE
|
| |