Gene P9211_16551 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9211_16551 
SymbolrpoA 
ID5730206 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9211 
KingdomBacteria 
Replicon accessionNC_009976 
Strand
Start bp1491597 
End bp1492535 
Gene Length939 bp 
Protein Length312 aa 
Translation table11 
GC content40% 
IMG OID641286035 
ProductDNA-directed RNA polymerase subunit alpha 
Protein accessionYP_001551540 
Protein GI159904196 
COG category[K] Transcription 
COG ID[COG0202] DNA-directed RNA polymerase, alpha subunit/40 kD subunit 
TIGRFAM ID[TIGR02027] DNA-directed RNA polymerase, alpha subunit, bacterial and chloroplast-type 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.271091 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones49 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTTGCAAT ACCAGATTGA TCGGATCGAC CATCAAATTT CCGATGACCG CTCTCAAACA 
GGTTTATTCC TCATAGGACC TCTTGAAAGA GGTCAAGCAA CTACGTTAGG TAACTCTCTT
CGAAGAGTTC TTATGGGAGG TCTTGAAGGA AGCGCTGTTA CGGCAGTTCG CATATCTGGC
GTAAATCATG AATATGCAAC TATCCCTGGA GTGAGAGAAG ATGTTTTAGA TATTCTCCTA
AATTGCAAGC AGCTTTCAGT AAATAGTAGA AGTCCAGAAC TTGAAATAGG CCGTTTAGTG
GTTAATGGAC CTGCAGAAGT TAAAGCCCGT GATGTGCAAT TTTCTTCTCA GGTACAAGTT
GTTGATGGAG ATAGACCAAT AGCCACTGTG CATTCAGGAC ATAGCCTTGA ATTGGAGCTG
CATGTAGAAA GGGGAGTTGG CTATCGTCCA GTTGATCGTC ATAACGAAGA AACAACTTCA
ATTGATTTGC TTCATATTGA TGCTGTTTTT ATGCCAATTA AGAAGGTGAA TTTCAATATT
GACGAAACGG CTGTTGCTGA AGGTGGTTCA ACTAGAGAAA GATTGAAAAT GGAGATAGTA
ACTGATGGAT CCATGTCTCC TGATGATGCT TTAGCAGAGG CGGCAAATCA ATTAATAGAA
CTATTTCAAC CTTTAGCAAC AGTAACTATG GTTGAAGAAA TACCTCAAGA GCCAGAACCT
TCTGCTGAGG CTCAAATTCC TTTAGAAGAA TTGAATCTAT CTGTTAGAGC CTATAACTGT
TTGAAAAGAG CCCAAGTGAA TTCAGTTTCT GATTTGATGG GATTTAGCTA CGAGGATCTA
TTAGAAATTA AGAATTTCGG GTCCAAGTCT GCTGATGAAG TTATTGAAGC TTTAGAACGA
ATTGGAATTT CTATTCCACA GAGTCGGACT TCAGCGTAA
 
Protein sequence
MLQYQIDRID HQISDDRSQT GLFLIGPLER GQATTLGNSL RRVLMGGLEG SAVTAVRISG 
VNHEYATIPG VREDVLDILL NCKQLSVNSR SPELEIGRLV VNGPAEVKAR DVQFSSQVQV
VDGDRPIATV HSGHSLELEL HVERGVGYRP VDRHNEETTS IDLLHIDAVF MPIKKVNFNI
DETAVAEGGS TRERLKMEIV TDGSMSPDDA LAEAANQLIE LFQPLATVTM VEEIPQEPEP
SAEAQIPLEE LNLSVRAYNC LKRAQVNSVS DLMGFSYEDL LEIKNFGSKS ADEVIEALER
IGISIPQSRT SA