Gene P9303_01641 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_01641 
Symbol 
ID4776823 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp181490 
End bp182542 
Gene Length1053 bp 
Protein Length350 aa 
Translation table11 
GC content55% 
IMG OID640085663 
ProductType II alternative RNA polymerase sigma factor, sigma-70 family protein 
Protein accessionYP_001016184 
Protein GI124021877 
COG category[K] Transcription 
COG ID[COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) 
TIGRFAM ID[TIGR02937] RNA polymerase sigma factor, sigma-70 family
[TIGR02997] RNA polymerase sigma factor, cyanobacterial RpoD-like family 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGACTGCT CGGGAGTAAG CTTTACAAAA CGCAATAGAG TTGTCGTGGT CGACTCCGCA 
GTCTCCAAAG CCCTTGTTAA ATCAGCGGTT GTGCCTGCTC GTCAGTTGCC CGCAGATGTC
GACCTGGTGC GTTCATACCT GCGCGATATC GGTCGAGTGC CGCTACTGAG CCATGAGCAG
GAGATCACGC TGGGTCGTCA GGTGCAGGAG TTGATGTCTT TAGAGCAGCT TGAGTCTGAA
CTGGAAGGTA AAACAGGCGC GCCAGCGAGT CGTAAAGAAC TAGCGAAGGC AGCTGGATTG
AGTGAGTTGC AGCTCAAGAA GAAGTTGCAG ATCGGACGAC GTGCGAAGGA GCGGATGGTG
TCGGCGAACC TGCGCTTAGT GGTGAGTGTT GCCAAGAAGT ACACCAAAAG GAATATGGAG
CTTCTTGATT TGATCCAAGA GGGAACGATC GGCTTGGTGA GGGGAGTGGA GAAGTTCGAC
CCAACCCGTG GCTACAAGTT TTCGACCTAT GCGTATTGGT GGATTCGCCA GGGGATCACG
CGTGCGATTG CGGAGAAGAG CCGGACGATC CGTCTGCCGA TCCACATCAC AGAGATGCTG
AACAAGCTCA AGAAAGGCCA GCGAGAGTTA AGTCAGGAGA TGGGGCGCAC GCCAACAGTG
AGCGAACTTG CAGAGTTTGT GGAGTTGCCC GAGGAGGAGG TGAAGGATCT GATGTGCCGT
GCACGTCAGC CGATGAGTTT GGAGATGAAG GTGGGAGATG GGGATGAAAC GGAGTTGCTT
GAGTTGCTTG CCGGGGAAGA GGAGTTACCG AGTGAGAAGG TGGAAGTGGA TTGCATGAAA
GGCGATTTAC GTACCTTGCT GGAAAAGTTG CCCGAGTTGC AGGGTCGTGT GCTGCGGATG
CGTTATGGAA TCGACGGAGG CGAGCCGATG AACCTCACCG GGATTGGTCG CATCCTCGAC
ATCAGTCGTG ACCGTGTTCG CAATCTGGAG CGCCATGGAC TCAATGGTCT GCGCCAGTTG
AGTGAAACGG TTGAGGCCTA TGCGGCTTGC TGA
 
Protein sequence
MDCSGVSFTK RNRVVVVDSA VSKALVKSAV VPARQLPADV DLVRSYLRDI GRVPLLSHEQ 
EITLGRQVQE LMSLEQLESE LEGKTGAPAS RKELAKAAGL SELQLKKKLQ IGRRAKERMV
SANLRLVVSV AKKYTKRNME LLDLIQEGTI GLVRGVEKFD PTRGYKFSTY AYWWIRQGIT
RAIAEKSRTI RLPIHITEML NKLKKGQREL SQEMGRTPTV SELAEFVELP EEEVKDLMCR
ARQPMSLEMK VGDGDETELL ELLAGEEELP SEKVEVDCMK GDLRTLLEKL PELQGRVLRM
RYGIDGGEPM NLTGIGRILD ISRDRVRNLE RHGLNGLRQL SETVEAYAAC