Gene P9301_18211 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9301_18211 
Symbol 
ID4911470 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9301 
KingdomBacteria 
Replicon accessionNC_009091 
Strand
Start bp1541677 
End bp1542711 
Gene Length1035 bp 
Protein Length344 aa 
Translation table11 
GC content37% 
IMG OID640161425 
Producttype II alternative sigma-70 family RNA polymerase sigma factor 
Protein accessionYP_001092045 
Protein GI126697159 
COG category[K] Transcription 
COG ID[COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) 
TIGRFAM ID[TIGR02937] RNA polymerase sigma factor, sigma-70 family
[TIGR02997] RNA polymerase sigma factor, cyanobacterial RpoD-like family 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTATTGT TTACAAAACA TCATTTAGCA TTCATGTCCT CTGAAACAAT AAGTGAAAAT 
AAACTAACGT CAATCGCAAG TTTAAAAGCA AGTAATGATG TAGATCTTGT TCGATCATAT
TTGAGGGATA TAGGAAGAGT TCCCTTACTA TCGCATGAGC AAGAAATTAC TCTAGGTCGA
CAAGTTCAAG AGTATATGCA AGTTGAAAGA GCTGAATTAG AAATCAGCGA ATTAACAGGA
GATAAACCCA GTATTGATGA ATTATCGACG AAATTAAACT TGTCTACTTC AGTAATAAAA
AAAAGATTGA GAGCTGGACA GAGAGCTAAA GAAAGAATGG TTGCAGCAAA TTTAAGATTG
GTAGTGAGCG TTGCAAAAAA ATATACAAAA AGAAATATGG AACTTTTAGA TTTAATTCAG
GAGGGAACTA TAGGACTTGT TAGAGGAGTT GAAAAATTTG ATCCAGCGAG AGGCTACAAG
TTTTCAACAT ATGCATATTG GTGGATTAGA CAAGGTATTA CTAGAGCAAT AGCTGAAAAG
AGTCGGGCGA TTAGGTTGCC TATCCACATT ACCGAAATGT TGAATAAGTT AAAAAAGGGT
CAAAGAGAGC TCAGTCAAGA AATGTCTAGA ACTCCAACTG TAAGTGAACT TGCGAAATAC
GTAGAGCTTC CAGAAGATGA CGTTAAAGAC TTAATGTGCA AAGCTGGGCA GCCAGTTAGT
CTTGAAACCA AGGTAGGTGA TGGTGAAGAT ACTGTTTTAC TAGATTTACT TGCAGGGGGC
GAGGATTTGC CAGACGAACA AATTGAGATG GATTGTATGA GAGGTGATCT TCATTCTCTT
TTACATCAAT TACCTGATCT GCAATGTAGG GTTTTAAGAA TGAGATACGG GATGGATGGT
GATGAGCCAA TGTCTCTTAC AGGTATAGGA AGGGTCCTAG GAATAAGTAG AGATCGAGTA
AGAAACCTAG AACGTGATGG TTTAAGAGGC TTGAGAAGAC TTAGTCATAA TGTAGAAGCT
TACTTCGTTT CTTGA
 
Protein sequence
MVLFTKHHLA FMSSETISEN KLTSIASLKA SNDVDLVRSY LRDIGRVPLL SHEQEITLGR 
QVQEYMQVER AELEISELTG DKPSIDELST KLNLSTSVIK KRLRAGQRAK ERMVAANLRL
VVSVAKKYTK RNMELLDLIQ EGTIGLVRGV EKFDPARGYK FSTYAYWWIR QGITRAIAEK
SRAIRLPIHI TEMLNKLKKG QRELSQEMSR TPTVSELAKY VELPEDDVKD LMCKAGQPVS
LETKVGDGED TVLLDLLAGG EDLPDEQIEM DCMRGDLHSL LHQLPDLQCR VLRMRYGMDG
DEPMSLTGIG RVLGISRDRV RNLERDGLRG LRRLSHNVEA YFVS