Gene P9303_23261 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_23261 
SymbolrpoA 
ID4777179 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp2046104 
End bp2047042 
Gene Length939 bp 
Protein Length312 aa 
Translation table11 
GC content55% 
IMG OID640087846 
ProductDNA-directed RNA polymerase subunit alpha 
Protein accessionYP_001018326 
Protein GI124024019 
COG category[K] Transcription 
COG ID[COG0202] DNA-directed RNA polymerase, alpha subunit/40 kD subunit 
TIGRFAM ID[TIGR02027] DNA-directed RNA polymerase, alpha subunit, bacterial and chloroplast-type 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCTGCAAT ACCAGATCGA CCGGATCGAG CATCAAGTTG CCGATGATCG CGCCCAAACA 
GGCGTTTTCC TCATTGGACC GCTCGAACGC GGGCAAGCCA CCACGCTTGG CAATTCCCTC
CGCAGGGTGC TGATGGGGGG CCTTGAGGGG AGCGCCATCA CCGCCGTGCG CATTGCCGGT
GTCAACCACG AGTATGCGAC CATCCCAGGA GTCCGAGAAG ATGTACTCGA CATCCTGTTG
AACTGCAAAC AGCTGTCGGT AAACAGTCGA ACGAGTGAAC TAGAGATTGG GCGACTTGTG
GTTACCGGCC CTGCCCAGGT CAAAGCTAAA GACCTGCAAT TCTCCTCACA AGTCCAAGTG
GTGGATGGAG ACCGCCAAAT TGCCACTGTG CATGAGGGTC ACAGCCTTGA GTTAGAAGTT
CATGTTGAGC GGGGGATCGG TTACCGCCCT GTTGATCGCC ACAACGAAGA TATAAGCGCC
ATTGATCTAC TGCAGATCGA TGCTGTGTTC ATGCCCGTCC GTAGGGTGAA CTTCACCATC
GACGAGACTG CCGTCGCAGA AGGAGGATCT ACCCGTGAGA GGCTGCGCAT AGAAATTGTC
ACCGATGGTT CAACTACACC AGATGATGCC CTTGCAGAGT CAGCCAACCA ACTGATCGAA
CTCTTTCAGC CCCTTGCCAC CGTCACCATG GTGGAAGAAC CAGGCCTGGA ACCCGAGCCT
TCCGCTGAAT CACAGATCCC CCTTGAAGAG CTCAACCTCT CTGTAAGGGC TTACAACTGC
CTCAAGCGAG CCCAGGTGAA CTCAGTTTCT GACCTGATGG GCTTCAGCTA CGAGGACCTG
CTTGAGATCA AGAACTTCGG CTCTAAGTCT GCAGACGAGG TCATCGAAGC CCTAGAGCGC
ATCGGCATTT CAATTCCACA AAGCCGTACC AGCACCTAG
 
Protein sequence
MLQYQIDRIE HQVADDRAQT GVFLIGPLER GQATTLGNSL RRVLMGGLEG SAITAVRIAG 
VNHEYATIPG VREDVLDILL NCKQLSVNSR TSELEIGRLV VTGPAQVKAK DLQFSSQVQV
VDGDRQIATV HEGHSLELEV HVERGIGYRP VDRHNEDISA IDLLQIDAVF MPVRRVNFTI
DETAVAEGGS TRERLRIEIV TDGSTTPDDA LAESANQLIE LFQPLATVTM VEEPGLEPEP
SAESQIPLEE LNLSVRAYNC LKRAQVNSVS DLMGFSYEDL LEIKNFGSKS ADEVIEALER
IGISIPQSRT ST