Gene P9301_05231 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9301_05231 
Symbol 
ID4912354 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9301 
KingdomBacteria 
Replicon accessionNC_009091 
Strand
Start bp454709 
End bp455893 
Gene Length1185 bp 
Protein Length394 aa 
Translation table11 
GC content34% 
IMG OID640160103 
ProductRNA polymerase sigma factor RpoD 
Protein accessionYP_001090747 
Protein GI126695861 
COG category[K] Transcription 
COG ID[COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) 
TIGRFAM ID[TIGR02393] RNA polymerase sigma factor RpoD, C-terminal domain
[TIGR02937] RNA polymerase sigma factor, sigma-70 family
[TIGR02997] RNA polymerase sigma factor, cyanobacterial RpoD-like family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTGTCCAG TCGCAGCAGA ATCAAAGAAT TCCAAGCCCA GTTCCAAAAA AAAGATCAAT 
AAAAAAATTA ATACAAATTT AGAAACCGTA GTTGAAGAAG ATAGTAATAA AAATAGCCAA
CCCTTAAAAA CATCTGCTGA GTTAAACGAA AGCAGTATTG AAAATAACAA TAATGAATTT
AGTGATTCTG AAGAAGAAGA TAAAGGGCTC GGGAATATAA AACTTGGGCC AAAAGGTATC
TATACTGAAG ATTCAATAAG AGTTTATCTC CAAGAAATCG GAAGAATTAG ACTTTTAAGA
CCAGATGAAG AAATTGAGCT TGCAAGAAAA ATTGCTGACT TACTCCAATT AGAAGAGCTA
GCAACTCAAT ATGAGTCAGA AAAAGGGCAT TTCCCATCTG TAAGAGAATG GGCCGAGTTA
ATAGATATGC CTCTTCCCAA ATTTAGAAGA AGACTGCTCC TAGGGCGGAG AGCAAAAGAA
AAAATGGTTC AATCAAATTT AAGATTAGTT GTTTCAATTG CTAAAAAATA TATGAACAGA
GGTTTATCAT TTCAAGATTT AATCCAAGAA GGAAGTTTGG GTCTAATTAG AGCAGCGGAA
AAATTCGACC ATGAAAAAGG TTATAAGTTC TCTACATATG CAACTTGGTG GATTCGCCAA
GCTATTACAA GAGCAATTGC GGATCAAAGT AGAACTATTA GATTGCCAGT TCACTTATAC
GAGACAATAT CCAGAATTAA AAAAACCACA AAAGTTCTTA GTCAAGAATT TGGCAGGAAA
CCTAGTGAAG AAGAAATCGC TGAGAGTATG GAAATGACAA TCGAAAAATT AAGATTTATA
GCTAAAAGTG CGCAACTTCC TATTTCTTTA GAAACTCCAA TAGGGAAAGA AGAAGACTCA
AGACTAGGAG ACTTTATAGA GGCTGATATA GAAAATCCAG AGCAAGATGT ATCTAAAACT
TTACTAAGAG AAGATTTGGA AGGAGTATTA GCCACTCTTA GTCCAAGAGA AAGAGATGTT
CTCAGATTGA GATATGGAAT TGATGATGGA AGAATGAAAA CTCTTGAAGA AATTGGCCAA
ATTTTTGATG TGACGAGAGA AAGAATTAGA CAAATAGAGG CAAAAGCCCT AAGAAAACTT
AGACATCCAA ATCGAAATGG GGTTTTAAAA GAATATATAA AATAA
 
Protein sequence
MCPVAAESKN SKPSSKKKIN KKINTNLETV VEEDSNKNSQ PLKTSAELNE SSIENNNNEF 
SDSEEEDKGL GNIKLGPKGI YTEDSIRVYL QEIGRIRLLR PDEEIELARK IADLLQLEEL
ATQYESEKGH FPSVREWAEL IDMPLPKFRR RLLLGRRAKE KMVQSNLRLV VSIAKKYMNR
GLSFQDLIQE GSLGLIRAAE KFDHEKGYKF STYATWWIRQ AITRAIADQS RTIRLPVHLY
ETISRIKKTT KVLSQEFGRK PSEEEIAESM EMTIEKLRFI AKSAQLPISL ETPIGKEEDS
RLGDFIEADI ENPEQDVSKT LLREDLEGVL ATLSPRERDV LRLRYGIDDG RMKTLEEIGQ
IFDVTRERIR QIEAKALRKL RHPNRNGVLK EYIK