Gene P9303_09841 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_09841 
Symbol 
ID4776657 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp893575 
End bp894537 
Gene Length963 bp 
Protein Length320 aa 
Translation table11 
GC content54% 
IMG OID640086492 
ProductType II alternative RNA polymerase sigma factor, sigma-70 family protein 
Protein accessionYP_001016998 
Protein GI124022691 
COG category[K] Transcription 
COG ID[COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) 
TIGRFAM ID[TIGR02937] RNA polymerase sigma factor, sigma-70 family
[TIGR02997] RNA polymerase sigma factor, cyanobacterial RpoD-like family 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTCCCT TGGCGGTGCT TTCAGATGTC GACCTGGTGC GTTCATACCT GCGCGATATC 
GGTCGAGTGC CGCTACTGAG CCATGAGCAG GAGATCACGC TGGGTCGTCA GGTGCAGGAG
TTGATGTCTT TAGAGCAGCT TGAGTCTGAA CTGGAAGGTA AAACAGGTGC GCCAGCGAGT
CGTAAAGAAC TAGCGAAGGC AGCTGGATTG AGTGAGTTGC AGCTCAAGAA GAAGTTGCAG
AGCGGACGAC GTGCGAAGGA GCGGATGGTG TCGGCGAACC TGCGCTTAGT GGTGAGTGTT
GCCAAGAAGT ACACCAAAAG GAATATGGAG CTTCTTGATT TGATCCAAGA GGGAACGATC
GGCTTGGTGA GGGGAGTGGA GAAGTTCGAC CCAACCCGTG GCTACAAGTT TTCGACCTAT
GCGTATTGGT GGATTCGCCA GGGGATCACG CGTGCGATTG CGGAGAAGAG CCGGACGATC
CGTCTGCCGA TCCACATCAC AGAGATGCTG AACAAGCTCA AGAAAGGCCA GCGAGAGTTA
AGTCAGGAGA TGGGGCGCAC GCCAACAGTG AGCGAACTTG CAGAGTTTGT GGAGTTGCCC
GAGGAGGAGG TGAAGGATCT GATGTGCCGT GCACGTCAGC CGATGAGTTT GGAGATGAAG
GTGGGAGATG GGGATGAAAC GGAGTTGCTT GAGTTGCTTG CCGGGGAAGA GGAGTTACCG
AGTGAGAAGG TGGAAGTGGA TTGCATGAAA GGCGATTTAC GTACCTTGCT GGAAAAGTTG
CCCGAGTTGC AGGGTCGTGT GCTGCGGATG CGTTATGGAA TCGACGGAGG CGAGCCGATG
AACCTCACCG GGATTGCTAA GACTTTAGGA ATGAGTCGCG ATCGAACACG CCGTCTGGAG
AGGGAAGGCT TGGCGTTGAT GCGAACCTCC TCGTTTGAAC TTGAGGCTTA TATGGTGGTT
TGA
 
Protein sequence
MAPLAVLSDV DLVRSYLRDI GRVPLLSHEQ EITLGRQVQE LMSLEQLESE LEGKTGAPAS 
RKELAKAAGL SELQLKKKLQ SGRRAKERMV SANLRLVVSV AKKYTKRNME LLDLIQEGTI
GLVRGVEKFD PTRGYKFSTY AYWWIRQGIT RAIAEKSRTI RLPIHITEML NKLKKGQREL
SQEMGRTPTV SELAEFVELP EEEVKDLMCR ARQPMSLEMK VGDGDETELL ELLAGEEELP
SEKVEVDCMK GDLRTLLEKL PELQGRVLRM RYGIDGGEPM NLTGIAKTLG MSRDRTRRLE
REGLALMRTS SFELEAYMVV