Gene A9601_05531 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagA9601_05531 
Symbol 
ID4717252 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. AS9601 
KingdomBacteria 
Replicon accessionNC_008816 
Strand
Start bp480292 
End bp481476 
Gene Length1185 bp 
Protein Length394 aa 
Translation table11 
GC content34% 
IMG OID640078265 
ProductRNA polymerase sigma factor RpoD 
Protein accessionYP_001008946 
Protein GI123968088 
COG category[K] Transcription 
COG ID[COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) 
TIGRFAM ID[TIGR02393] RNA polymerase sigma factor RpoD, C-terminal domain
[TIGR02937] RNA polymerase sigma factor, sigma-70 family
[TIGR02997] RNA polymerase sigma factor, cyanobacterial RpoD-like family 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTGTCCAG TAGCAGCAGA ATCAAAGAAT TCCAAGCCTA GTTCAAAAAA AAAGATCAAT 
AAAAAAATGA ATACAAATTT AGAAACAGTA GTTGAAGAAG ATATTAATAA AAATAATCAA
ACTTTACAGA CATCTTCTGA GTCAAACGAA AGCAATATTG AAAATAACAA TAATGAATTT
AGTGATTCTG AAGAAGAAGA TAAAGGGCTA GGGAATATAA AACTTGGGCC AAAGGGTATC
TACACTGAAG ATTCAATAAG AGTTTATCTC CAAGAAATTG GAAGAATTAG ACTTTTAAGA
CCAGATGAAG AAATTGAGCT TGCAAGAAAA ATTGCTGACT TACTCCAATT AGAAGAGCTA
GCAACTCAAT ATGAGTCAGA AAAAGGACAT TTCCCATCTG TAAGAGAATG GGCCGAGTTA
ATAGATATGC CTCTTGCCAA ATTTAGAAGA AGGCTTCTCT TAGGGCGGAG AGCTAAAGAA
AAAATGGTTC AATCAAATTT AAGATTAGTT GTTTCCATTG CTAAAAAATA TATGAACAGA
GGTTTATCAT TTCAAGATTT AATCCAAGAA GGAAGTTTGG GTTTAATTAG GGCAGCAGAA
AAATTCGACC ATGAAAAAGG TTATAAGTTC TCTACATATG CAACTTGGTG GATTCGCCAA
GCAATTACAA GAGCAATTGC GGATCAAAGT AGAACTATTA GATTGCCAGT TCACTTATAC
GAGACAATAT CCAGAATTAA AAAAACCACA AAAGTTCTTA GCCAAGAATT TGGCAGGAAA
CCTAGTGAAG AAGAAATCGC TGAGAGTATG GAAATGACAA TCGAAAAATT AAGATTTATA
GCTAAAAGTG CGCAACTTCC TATTTCTTTA GAAACTCCAA TAGGGAAAGA AGAAGACTCA
AGACTAGGGG ACTTTATAGA GGCTGATATA GAAAATCCAG AGCAAGATGT TTCCAAAACT
TTACTAAGAG AAGATTTGGA AGGAGTATTA GCCACTCTTA GTCCAAGAGA AAGAGATGTT
CTCAGATTGA GATATGGAAT TGACGATGGA AGAATGAAAA CTCTTGAAGA AATTGGCCAG
ATTTTTGATG TGACGAGAGA AAGAATTAGA CAAATAGAGG CAAAAGCTCT AAGAAAACTT
AGACATCCAA ATCGAAATGG GGTTTTAAAA GAATATATAA AATAA
 
Protein sequence
MCPVAAESKN SKPSSKKKIN KKMNTNLETV VEEDINKNNQ TLQTSSESNE SNIENNNNEF 
SDSEEEDKGL GNIKLGPKGI YTEDSIRVYL QEIGRIRLLR PDEEIELARK IADLLQLEEL
ATQYESEKGH FPSVREWAEL IDMPLAKFRR RLLLGRRAKE KMVQSNLRLV VSIAKKYMNR
GLSFQDLIQE GSLGLIRAAE KFDHEKGYKF STYATWWIRQ AITRAIADQS RTIRLPVHLY
ETISRIKKTT KVLSQEFGRK PSEEEIAESM EMTIEKLRFI AKSAQLPISL ETPIGKEEDS
RLGDFIEADI ENPEQDVSKT LLREDLEGVL ATLSPRERDV LRLRYGIDDG RMKTLEEIGQ
IFDVTRERIR QIEAKALRKL RHPNRNGVLK EYIK