Gene P9303_29561 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_29561 
Symbol 
ID4777820 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp2614605 
End bp2615735 
Gene Length1131 bp 
Protein Length376 aa 
Translation table11 
GC content53% 
IMG OID640088480 
Productputative type II alternative RNA polymerase sigma factor 
Protein accessionYP_001018951 
Protein GI124024644 
COG category[K] Transcription 
COG ID[COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) 
TIGRFAM ID[TIGR02937] RNA polymerase sigma factor, sigma-70 family
[TIGR02997] RNA polymerase sigma factor, cyanobacterial RpoD-like family 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGAAGAGCA GCCCCCAGCA AACCAAGACT TCCCTAGTCC GCAGAAGCGG CAGCACTGAT 
CCTGTGCGTC TCTACTTACA GGACATTGGA CGCGTAGAGC TACTGACCCA TGAAGAGGAA
GTAACCCTGG CGCGACTTGT GCAGCGCAGG GAAGCCCTAC TCAAACAGGA AAGGCAACTG
GCCTCCAGCC AAGAAGCGAT CAAAGAATTA CAAAGACTGG AGGAGTTACA GCAGCGAGAA
GCGAACCATT CCTGCCACTG GCCCACCAAA CAGGAATGGG CTATGGCTGC TGGGCTCACC
CTGGCCGAGC TGCAAGACAA AATCGAGACT GGTTACAAAA CCTGGGGAGC CCTGACTGGT
CTTGACCCCT TGGAACTCAA GCGAAGTTTG CGAGCTGGTC GGCGTGCCAA GGATCAGATG
ATCCAGGCCA ACCTTCGGCT TGTGGTGGCT GTAGCCAAGA AATATCAACA ACGGGGCATA
GAACTGCTTG ATCTGGTGCA AGAAGGCACC CTGGGCTTGG AACGCGCAGT AGAGAAATTT
GACCCGGCTA GAGGTTTCCG CTTCAGCACC TACGCCTACT GGTGGATCCG TCAGGGCATC
ACAAGGGCCA TTGCGACGCA AAGTCGGACG ATCCGACTGC CGATGCACAT CACCGAAAAA
CTAAACCGCA TCAAACGGGT TCAACAGGAG ATTGCTAGCA ACCAAGGACG ATTAGCTTCG
ATTGCCGATC TCGCCAAGGC ACTTGGCCTT AGTGAAGAAA CAGTGCGCCT AACCCTAATG
AGGGTCCCCC GTTCGATCTC CTTGGAAACT CGAATAGGCC AAGAACAAGA CAGCCAACTA
GGCGATCTGC TGGAAGACAG CAACGCGACC CCAGAGGAGA AACTCACCCG CGATCAATTG
CACAACGACC TTGAAATCTT GCTGGATGAA CTAAGCAACC GCGAAGCGAC AGTGATCAGA
CGACGTTTTG GACTTGAAGA CGACACTCCT CAAACCCTGA CGCAAATTGG CGAGGCAATG
CATCTATCGC GAGAACGAGT TCGTCAGATC GAAAGCCATG CCCTCTTGAA ATTGCGTCAA
CCACAACGTC GCTGCAAGGT ACGGGACTAC ATTCAAAATC TCGATTCCTG A
 
Protein sequence
MKSSPQQTKT SLVRRSGSTD PVRLYLQDIG RVELLTHEEE VTLARLVQRR EALLKQERQL 
ASSQEAIKEL QRLEELQQRE ANHSCHWPTK QEWAMAAGLT LAELQDKIET GYKTWGALTG
LDPLELKRSL RAGRRAKDQM IQANLRLVVA VAKKYQQRGI ELLDLVQEGT LGLERAVEKF
DPARGFRFST YAYWWIRQGI TRAIATQSRT IRLPMHITEK LNRIKRVQQE IASNQGRLAS
IADLAKALGL SEETVRLTLM RVPRSISLET RIGQEQDSQL GDLLEDSNAT PEEKLTRDQL
HNDLEILLDE LSNREATVIR RRFGLEDDTP QTLTQIGEAM HLSRERVRQI ESHALLKLRQ
PQRRCKVRDY IQNLDS