Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9301_18211 |
Symbol | |
ID | 4911470 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9301 |
Kingdom | Bacteria |
Replicon accession | NC_009091 |
Strand | - |
Start bp | 1541677 |
End bp | 1542711 |
Gene Length | 1035 bp |
Protein Length | 344 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 640161425 |
Product | type II alternative sigma-70 family RNA polymerase sigma factor |
Protein accession | YP_001092045 |
Protein GI | 126697159 |
COG category | [K] Transcription |
COG ID | [COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) |
TIGRFAM ID | [TIGR02937] RNA polymerase sigma factor, sigma-70 family [TIGR02997] RNA polymerase sigma factor, cyanobacterial RpoD-like family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTATTGT TTACAAAACA TCATTTAGCA TTCATGTCCT CTGAAACAAT AAGTGAAAAT AAACTAACGT CAATCGCAAG TTTAAAAGCA AGTAATGATG TAGATCTTGT TCGATCATAT TTGAGGGATA TAGGAAGAGT TCCCTTACTA TCGCATGAGC AAGAAATTAC TCTAGGTCGA CAAGTTCAAG AGTATATGCA AGTTGAAAGA GCTGAATTAG AAATCAGCGA ATTAACAGGA GATAAACCCA GTATTGATGA ATTATCGACG AAATTAAACT TGTCTACTTC AGTAATAAAA AAAAGATTGA GAGCTGGACA GAGAGCTAAA GAAAGAATGG TTGCAGCAAA TTTAAGATTG GTAGTGAGCG TTGCAAAAAA ATATACAAAA AGAAATATGG AACTTTTAGA TTTAATTCAG GAGGGAACTA TAGGACTTGT TAGAGGAGTT GAAAAATTTG ATCCAGCGAG AGGCTACAAG TTTTCAACAT ATGCATATTG GTGGATTAGA CAAGGTATTA CTAGAGCAAT AGCTGAAAAG AGTCGGGCGA TTAGGTTGCC TATCCACATT ACCGAAATGT TGAATAAGTT AAAAAAGGGT CAAAGAGAGC TCAGTCAAGA AATGTCTAGA ACTCCAACTG TAAGTGAACT TGCGAAATAC GTAGAGCTTC CAGAAGATGA CGTTAAAGAC TTAATGTGCA AAGCTGGGCA GCCAGTTAGT CTTGAAACCA AGGTAGGTGA TGGTGAAGAT ACTGTTTTAC TAGATTTACT TGCAGGGGGC GAGGATTTGC CAGACGAACA AATTGAGATG GATTGTATGA GAGGTGATCT TCATTCTCTT TTACATCAAT TACCTGATCT GCAATGTAGG GTTTTAAGAA TGAGATACGG GATGGATGGT GATGAGCCAA TGTCTCTTAC AGGTATAGGA AGGGTCCTAG GAATAAGTAG AGATCGAGTA AGAAACCTAG AACGTGATGG TTTAAGAGGC TTGAGAAGAC TTAGTCATAA TGTAGAAGCT TACTTCGTTT CTTGA
|
Protein sequence | MVLFTKHHLA FMSSETISEN KLTSIASLKA SNDVDLVRSY LRDIGRVPLL SHEQEITLGR QVQEYMQVER AELEISELTG DKPSIDELST KLNLSTSVIK KRLRAGQRAK ERMVAANLRL VVSVAKKYTK RNMELLDLIQ EGTIGLVRGV EKFDPARGYK FSTYAYWWIR QGITRAIAEK SRAIRLPIHI TEMLNKLKKG QRELSQEMSR TPTVSELAKY VELPEDDVKD LMCKAGQPVS LETKVGDGED TVLLDLLAGG EDLPDEQIEM DCMRGDLHSL LHQLPDLQCR VLRMRYGMDG DEPMSLTGIG RVLGISRDRV RNLERDGLRG LRRLSHNVEA YFVS
|
| |