Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | A9601_05531 |
Symbol | |
ID | 4717252 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. AS9601 |
Kingdom | Bacteria |
Replicon accession | NC_008816 |
Strand | - |
Start bp | 480292 |
End bp | 481476 |
Gene Length | 1185 bp |
Protein Length | 394 aa |
Translation table | 11 |
GC content | 34% |
IMG OID | 640078265 |
Product | RNA polymerase sigma factor RpoD |
Protein accession | YP_001008946 |
Protein GI | 123968088 |
COG category | [K] Transcription |
COG ID | [COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) |
TIGRFAM ID | [TIGR02393] RNA polymerase sigma factor RpoD, C-terminal domain [TIGR02937] RNA polymerase sigma factor, sigma-70 family [TIGR02997] RNA polymerase sigma factor, cyanobacterial RpoD-like family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTGTCCAG TAGCAGCAGA ATCAAAGAAT TCCAAGCCTA GTTCAAAAAA AAAGATCAAT AAAAAAATGA ATACAAATTT AGAAACAGTA GTTGAAGAAG ATATTAATAA AAATAATCAA ACTTTACAGA CATCTTCTGA GTCAAACGAA AGCAATATTG AAAATAACAA TAATGAATTT AGTGATTCTG AAGAAGAAGA TAAAGGGCTA GGGAATATAA AACTTGGGCC AAAGGGTATC TACACTGAAG ATTCAATAAG AGTTTATCTC CAAGAAATTG GAAGAATTAG ACTTTTAAGA CCAGATGAAG AAATTGAGCT TGCAAGAAAA ATTGCTGACT TACTCCAATT AGAAGAGCTA GCAACTCAAT ATGAGTCAGA AAAAGGACAT TTCCCATCTG TAAGAGAATG GGCCGAGTTA ATAGATATGC CTCTTGCCAA ATTTAGAAGA AGGCTTCTCT TAGGGCGGAG AGCTAAAGAA AAAATGGTTC AATCAAATTT AAGATTAGTT GTTTCCATTG CTAAAAAATA TATGAACAGA GGTTTATCAT TTCAAGATTT AATCCAAGAA GGAAGTTTGG GTTTAATTAG GGCAGCAGAA AAATTCGACC ATGAAAAAGG TTATAAGTTC TCTACATATG CAACTTGGTG GATTCGCCAA GCAATTACAA GAGCAATTGC GGATCAAAGT AGAACTATTA GATTGCCAGT TCACTTATAC GAGACAATAT CCAGAATTAA AAAAACCACA AAAGTTCTTA GCCAAGAATT TGGCAGGAAA CCTAGTGAAG AAGAAATCGC TGAGAGTATG GAAATGACAA TCGAAAAATT AAGATTTATA GCTAAAAGTG CGCAACTTCC TATTTCTTTA GAAACTCCAA TAGGGAAAGA AGAAGACTCA AGACTAGGGG ACTTTATAGA GGCTGATATA GAAAATCCAG AGCAAGATGT TTCCAAAACT TTACTAAGAG AAGATTTGGA AGGAGTATTA GCCACTCTTA GTCCAAGAGA AAGAGATGTT CTCAGATTGA GATATGGAAT TGACGATGGA AGAATGAAAA CTCTTGAAGA AATTGGCCAG ATTTTTGATG TGACGAGAGA AAGAATTAGA CAAATAGAGG CAAAAGCTCT AAGAAAACTT AGACATCCAA ATCGAAATGG GGTTTTAAAA GAATATATAA AATAA
|
Protein sequence | MCPVAAESKN SKPSSKKKIN KKMNTNLETV VEEDINKNNQ TLQTSSESNE SNIENNNNEF SDSEEEDKGL GNIKLGPKGI YTEDSIRVYL QEIGRIRLLR PDEEIELARK IADLLQLEEL ATQYESEKGH FPSVREWAEL IDMPLAKFRR RLLLGRRAKE KMVQSNLRLV VSIAKKYMNR GLSFQDLIQE GSLGLIRAAE KFDHEKGYKF STYATWWIRQ AITRAIADQS RTIRLPVHLY ETISRIKKTT KVLSQEFGRK PSEEEIAESM EMTIEKLRFI AKSAQLPISL ETPIGKEEDS RLGDFIEADI ENPEQDVSKT LLREDLEGVL ATLSPRERDV LRLRYGIDDG RMKTLEEIGQ IFDVTRERIR QIEAKALRKL RHPNRNGVLK EYIK
|
| |