Gene NATL1_18861 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_18861 
SymbolrpoB 
ID4780040 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp1546401 
End bp1549688 
Gene Length3288 bp 
Protein Length1095 aa 
Translation table11 
GC content39% 
IMG OID640085175 
ProductDNA-directed RNA polymerase subunit beta 
Protein accessionYP_001015706 
Protein GI124026591 
COG category[K] Transcription 
COG ID[COG0085] DNA-directed RNA polymerase, beta subunit/140 kD subunit 
TIGRFAM ID[TIGR02013] DNA-directed RNA polymerase, beta subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.816976 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAGAA GCGCGATTCA GGTAGCCAAA GCAGCTACAT ATCTGCCTGA TCTGGTTGAG 
GTGCAAAGAT CAAGTTTTAA GTGGTTTTTG GATAAAGGCT TAATAGAAGA ATTAGACAAT
TTTTCTCCCA TTACTGACTA TACAGGTAAG CTTGAGCTGC ATTTTATTGG AGCAGAATAT
AAGCTTAAGC GTCCTAGGCA TGATGTAGAA GAAGCAAAAA GAAGAGATGC AACGTTTGCT
TCTCAAATGT ATGTCACTTG TCGCTTAGTT AATAAAGAAA CCGGAGAGAT TAAAGAACAA
GAAGTTTTTA TAGGTGAACT ACCTTTAATG ACTGAACGTG GAACTTTTAT AATTAATGGT
GCTGAGAGAG TAATAGTCAA TCAAATCGTT CGAAGTCCAG GAGTTTATTT CAAAGATGAG
CAAGATAAAA ATGGTAGGAG AACTTACAAT GCAAGCGTTA TTCCTAATCG CGGAGCATGG
TTAAAGTTTG AAACTGATAA AAACGATTTG CTGCATGTTC GTGTAGATAA AACAAGAAAA
ATTAATGCCC ATGTCTTAAT GAGGGCAATG GGTTTATCAG ATAATGACGT TATAGATAAA
TTACGACATC CTGAGTTTTA TAAAAAGTCA ATTGATGCGG CAAATGAAGA AGGTATAAGT
TCAGAAGACC AAGCATTGTT AGAGCTTTAT AAAAAGTTAA GACCAGGTGA ACCTCCATCA
GTAAGTGGTG GCCAACAGCT ACTCCAAACA AGATTTTTTG ATCCAAAACG ATATGATTTA
GGAAGGGTTG GTAGATATAA AATAAATAAG AAATTACGCT TAACTATTCC TGATAATTTA
AGAACACTAA CCAACGAAGA TGTTCTTTCT ACCTTAGATT ATTTAATCAA TTTAGAATTA
GATGTTGGTG GAGCTACTTT GGATGATATT GATCATTTAG GTAATAGAAG AGTTAGGTCA
GTCGGTGAAC TTCTACAAAA TCAAGTTCGA GTTGGTTTAA ATAGACTTGA AAGGATAATT
AAAGAGAGGA TGACCGTTGG TGAGACAGAT TCACTTACTC CTGCGCAATT GGTAAATCCA
AAACCTTTAG TAGCAGCAAT AAAAGAATTT TTTGGTTCAA GTCAATTAAG TCAATTCATG
GATCAAACCA ATCCATTAGC TGAATTAACT CATAAAAGAC GTATCTCTGC TTTGGGACCA
GGAGGTTTAA CTCGTGAAAG AGCTGGCTTT GCCGTTCGGG ATATTCATCC ATCTCACTAC
GGAAGACTTT GTCCGATTGA AACACCTGAA GGACCAAATG CAGGTCTGAT TAATTCGTTA
GCAACTCACG CACGAGTAAA TGAATACGGT TTTATTGAGA CACCATTTTG GAAAGTTGAA
AATGGTCGAT TAATTAAGGA AGGTGATCCT ATTTATCTTT CTGCAGATCT AGAAGATGAA
TGTAGGGTTG CGCCAGGTGA TGTGGCTACA AACGAAGAGG GGAAAATAAT GGCAGAACTT
GTTCCAGTTA GGTATCGACA AGATTTTGAA ACGGTTTCTC CCGAACAAGT TGATTATGTC
CAACTATCAC CTGTTCAGGT TATTTCTGTA GCTGCATCAT TAATTCCATT TCTGGAACAC
GATGATGCTA ACAGGGCTTT GATGGGTTCC AACATGCAAA GACAAGCAGT TCCTCTTTTG
CGTCCAGAAA GGCCTTTAGT TGGGACTGGT TTAGAGACTC AAGTTGCTAG AGACTCTGGC
ATGGTTCCAA TTTCAAAAGT AAATGGAACA GTGAGTTATG TTGACGCCAA TGCAATTGTC
GTTACTGATG ATGAGGGAAA TGATCACACG CATTACTTGC AAAAATATCA GAGATCTAAT
CAAGATACTT GTTTAAACCA TAGGCCTATA GTTTTTAATG GTGACCCAGT AATTGTGGGG
CAAGTTTTAG CTGATGGTTC GGCATGTGAA GGAGGGGAAA TAGCTCTTGG TCAAAATGTA
TTAATTGCAT ACATGCCATG GGAAGGGTAC AACTATGAAG ATGCAATTCT TGTTAGTGAA
AGGTTAGTTA AAGATGATCT ATATACCTCT GTCCATATAG AAAAATATGA AATAGAAGCT
CGTCAAACCA AGCTAGGTCC AGAAGAAATA ACAAGAGAAA TTCCAAATGT TTCAGAAGAA
AACCTTGGGA ATTTGGATGA AATGGGAATA ATACGAATAG GGGCTTACGT TGAGAGTGGT
GACATACTTG TTGGAAAAGT AACCCCTAAA GGAGAATCAG ATCAGCCTCC TGAGGAAAAA
CTTTTAAGAG CTATTTTTGG AGAAAAAGCT AGAGATGTAA GAGATAACTC TCTCCGAGTT
CCCTCAACTG AAAGAGGAAG AGTAGTTGAT GTAAGGATCT ATACAAGAGA GCAAGGTGAT
GAGTTACCAC CTGGCGCAAA TATGGTTGTA AGAGTCTATG TAGCGCAACG CAGGAAAATA
CAAGTTGGCG ATAAAATGGC AGGAAGGCAT GGCAATAAAG GGATTATAAG CAGAATACTG
CCAAGAGAGG ATATGCCTTA TCTTCCTGAT GGCACACCTG TAGACATATG CCTCAATCCA
CTTGGGGTCC CAAGCAGAAT GAATGTAGGG CAAGTCTTTG AGCTTCTTAT GGGTTGGGCT
GCCTCAAACT TGGATTGCAG AGTTAAAATT GTCCCATTTG ATGAGATGTA TGGACCAGAA
ATGTCTAATC AGACTGTACA AGCCTATTTA AAAGAGGCTG CAAAGCAGCC AGGTAAATCA
TGGGTATACA ACCCTAAGGA CCCTGGGAAG TTACTCCTTA AGGATGGCCG AACCGGTGAA
CCTTTTGATC AACCAGTTGC CGTTGGCTAC GCTCACTTCC TAAAACTTGT GCACCTAGTC
GATGATAAAA TTCATGCTCG ATCAACAGGT CCATATTCTT TAGTTACTCA ACAGCCTCTA
GGTGGAAAGG CTCAACAAGG AGGCCAGAGA CTTGGGGAAA TGGAGGTATG GGCACTTGAG
GCTTATGGGG CAGCATACAC TTTACAAGAA CTTTTAACTG TAAAGTCTGA CGATATGCAA
GGTCGGAATG AGGCTCTTAA TTCTATTGTT AAAGGCAAGC CAATTCCAAG GCCTGGGACT
CCAGAATCTT TCAAGGTTTT AATGAGAGAA CTTCAGTCAT TGGGATTAGA TATTGGAGTT
TATACAGATG ATGGAAAAGA AGTTGATCTA ATGCAAGATG TAAATCCTCG TAGAAGCACG
CCAAGTAGAC CTACCTATGA ATCATTAGGT AAAGAATACG AGGAGTAA
 
Protein sequence
MSRSAIQVAK AATYLPDLVE VQRSSFKWFL DKGLIEELDN FSPITDYTGK LELHFIGAEY 
KLKRPRHDVE EAKRRDATFA SQMYVTCRLV NKETGEIKEQ EVFIGELPLM TERGTFIING
AERVIVNQIV RSPGVYFKDE QDKNGRRTYN ASVIPNRGAW LKFETDKNDL LHVRVDKTRK
INAHVLMRAM GLSDNDVIDK LRHPEFYKKS IDAANEEGIS SEDQALLELY KKLRPGEPPS
VSGGQQLLQT RFFDPKRYDL GRVGRYKINK KLRLTIPDNL RTLTNEDVLS TLDYLINLEL
DVGGATLDDI DHLGNRRVRS VGELLQNQVR VGLNRLERII KERMTVGETD SLTPAQLVNP
KPLVAAIKEF FGSSQLSQFM DQTNPLAELT HKRRISALGP GGLTRERAGF AVRDIHPSHY
GRLCPIETPE GPNAGLINSL ATHARVNEYG FIETPFWKVE NGRLIKEGDP IYLSADLEDE
CRVAPGDVAT NEEGKIMAEL VPVRYRQDFE TVSPEQVDYV QLSPVQVISV AASLIPFLEH
DDANRALMGS NMQRQAVPLL RPERPLVGTG LETQVARDSG MVPISKVNGT VSYVDANAIV
VTDDEGNDHT HYLQKYQRSN QDTCLNHRPI VFNGDPVIVG QVLADGSACE GGEIALGQNV
LIAYMPWEGY NYEDAILVSE RLVKDDLYTS VHIEKYEIEA RQTKLGPEEI TREIPNVSEE
NLGNLDEMGI IRIGAYVESG DILVGKVTPK GESDQPPEEK LLRAIFGEKA RDVRDNSLRV
PSTERGRVVD VRIYTREQGD ELPPGANMVV RVYVAQRRKI QVGDKMAGRH GNKGIISRIL
PREDMPYLPD GTPVDICLNP LGVPSRMNVG QVFELLMGWA ASNLDCRVKI VPFDEMYGPE
MSNQTVQAYL KEAAKQPGKS WVYNPKDPGK LLLKDGRTGE PFDQPVAVGY AHFLKLVHLV
DDKIHARSTG PYSLVTQQPL GGKAQQGGQR LGEMEVWALE AYGAAYTLQE LLTVKSDDMQ
GRNEALNSIV KGKPIPRPGT PESFKVLMRE LQSLGLDIGV YTDDGKEVDL MQDVNPRRST
PSRPTYESLG KEYEE