Gene P9211_04961 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9211_04961 
Symbol 
ID5730552 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9211 
KingdomBacteria 
Replicon accessionNC_009976 
Strand
Start bp464840 
End bp466117 
Gene Length1278 bp 
Protein Length425 aa 
Translation table11 
GC content39% 
IMG OID641284855 
ProductRNA polymerase sigma factor RpoD 
Protein accessionYP_001550381 
Protein GI159903037 
COG category[K] Transcription 
COG ID[COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) 
TIGRFAM ID[TIGR02393] RNA polymerase sigma factor RpoD, C-terminal domain
[TIGR02937] RNA polymerase sigma factor, sigma-70 family
[TIGR02997] RNA polymerase sigma factor, cyanobacterial RpoD-like family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.147427 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTCCTG TCGCCTCCAA AAAAACAGTT GCAAAGGCTA AAACCAAAGC AACAAAAAAG 
TCCAGCACAA AGGCTGGTGA GAGTAAAACA TTGAAAAAAG GGAAATCAAT CTCCGTTACA
AAAAAATCAA AGCCTAGCTC TACGAAGAAA GCTTCCTCAT CGCAAGACAA AGACTTAGAT
CTCGCAGCGG ACCAATTACT TGCAGAAGCT AATAACAATG AAATCAATCC AGAAACAGAA
ATAACCTTGA ATATGAATTC TCTGGATGAT TCAAATGACC AAGAGGCTAG TGACAAAACT
CTTGCAAGTA TAAAAATTGG ACCAAAGGGA GTTTATACAG AAGACTCAAT CAGAGTGTAT
TTACAAGAAA TTGGCCGCAT TAGATTACTT AGGCCTGATG AAGAAATTGA ACTCGCAAGG
AAGATTGCAG ATCTTCTAAA TTTAGAAGAA ATCGCTATCC AGTTTGAAAG CGAAAATGGA
CATTACCCCT CCAAAAAAGA GTGGGCTGCG TTGGTTGAAA TGCCAGTCAT AAGGTTTCGC
CGGAGACTAA TGCTTGGAAG GCGTGCGAAA GAAAAAATGG TTCAATCAAA CCTTCGCTTA
GTTGTTTCAA TAGCTAAAAA ATATATGAAT AGAGGACTTT CATTCCAAGA TCTGATTCAA
GAAGGAAGTC TGGGGCTTAT TCGTGCAGCA GAGAAATTCG ACCATGAAAA AGGCTATAAG
TTCTCCACTT ATGCAACTTG GTGGATTCGT CAAGCAATTA CAAGAGCGAT TGCAGATCAA
AGTAGAACAA TTAGGCTTCC TGTCCACCTT TACGAAACAA TTTCAAGAAT CAAAAAAACA
ACCAAAGTTC TAAGTCAAGA ATTTGGGAGA AAACCTTCAG AAGAAGAAAT TGCAGAGAGT
ATGGAGATGA CTATTGAAAA GCTTCGCTTT ATTGCGAAGA GTGCTCAGTT ACCGATTTCT
TTAGAAACTC CTATAGGTAA AGAAGAAGAC TCACGTCTTG GTGATTTCAT TGAATCAGAC
TCAGAGAACC CTGAATTAGA TGTTGCCAAA ACACTACTTC GCGAGGACCT AGAAGGAGTT
CTCGCAACTC TGAGCCCAAG AGAACGAGAT GTCCTAAGAC TTCGTTATGG ACTTGATGAT
GGTCGTATGA AGACCCTTGA AGAAATTGGG CAAATATTTG ATGTAACCCG CGAGCGTATC
CGTCAAATTG AAGCAAAAGC TCTACGCAAA CTTCGTCATC CAAATCGCAA TGGAGTGCTT
AAAGAATACA TTAAATAA
 
Protein sequence
MSPVASKKTV AKAKTKATKK SSTKAGESKT LKKGKSISVT KKSKPSSTKK ASSSQDKDLD 
LAADQLLAEA NNNEINPETE ITLNMNSLDD SNDQEASDKT LASIKIGPKG VYTEDSIRVY
LQEIGRIRLL RPDEEIELAR KIADLLNLEE IAIQFESENG HYPSKKEWAA LVEMPVIRFR
RRLMLGRRAK EKMVQSNLRL VVSIAKKYMN RGLSFQDLIQ EGSLGLIRAA EKFDHEKGYK
FSTYATWWIR QAITRAIADQ SRTIRLPVHL YETISRIKKT TKVLSQEFGR KPSEEEIAES
MEMTIEKLRF IAKSAQLPIS LETPIGKEED SRLGDFIESD SENPELDVAK TLLREDLEGV
LATLSPRERD VLRLRYGLDD GRMKTLEEIG QIFDVTRERI RQIEAKALRK LRHPNRNGVL
KEYIK