Gene P9303_18251 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_18251 
Symbol 
ID4776089 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp1585070 
End bp1585996 
Gene Length927 bp 
Protein Length308 aa 
Translation table11 
GC content51% 
IMG OID640087334 
Productputative type II alternative sigma factor, sigma70 family 
Protein accessionYP_001017832 
Protein GI124023525 
COG category[K] Transcription 
COG ID[COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) 
TIGRFAM ID[TIGR02937] RNA polymerase sigma factor, sigma-70 family 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0401007 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTGAGTT CTTTAAGTGC CTATCTTGGG GAGATCGGGC GCCACCAACT TCTGACACCT 
GAACAGGAAC TCACAATGGG TCGGAAGGTT CAGGCCATGG TTGCGCTTAC TGACCGCTGC
CATCTTGCTG GTGGAGCAGG GCCCGAATGC GAATACTCAG ATGATCAGCG CCTCACCATT
CGTCGTGGTG AAAAAGCCAA GAATCAGATG ATCACTGCCA ATTTAAGACT TGTAGTGAAT
CTTGCTAAGC GATACCAGGG CAAGGGTCTT GATCTTCTCG ATCTGATTCA AGAAGGCACG
CTTGGACTTA CCAGAGCAGT GGAGAAATAT GACCCAACCC GTGGCCATAG ATTTTCGACC
TATGCCTATT GGTGGATTCG CCAGGGGCTT AACCGGGCAC TATCCACCCA GAGCAGGACG
ATACGAATTC CCGTCAATGT GAATGAGAAG CTAACCAAAT TAAGGGCAGC AAAGTCAAGA
CTGATGCAGA GCAACGGCTT GCCTCCCACT GCTGAGCAAT TGGTAAAAAC CATGCGACTG
CCGATGGCCG AAGTGGAAGA TCTGCTCGCT TGTGAATTAC GCAGTGTGAC AGTGAGCCTT
CAAGGAGTTG TGAAGTCAAA ATCCGACCCT TCAGAACTTG TGGATGTCCT TCCCAGCGAA
GAGATCCCTC CGATGGAGCG TGCCGAAATG GCGGAAAGGA CTGCATCGGT ATGGACTTTG
CTCAATCGTG CGAACCTCAC TCCTAAAGAA CGGATGGTGG TCACACTTCG TTTTGGACTC
GACGGATCTC ATGAATGGCG CACCCTCGCA GAAGTTGCAC GACATATGAG TTGCAGTCGC
GAATACTGCA GGCAGGTGGT ACAAAGGGCA CTGCGAAAAC TTCGCAAAAC AGGAATTCAG
AGTGGTTTGG TAGAAAGCAC CCTCTAA
 
Protein sequence
MVSSLSAYLG EIGRHQLLTP EQELTMGRKV QAMVALTDRC HLAGGAGPEC EYSDDQRLTI 
RRGEKAKNQM ITANLRLVVN LAKRYQGKGL DLLDLIQEGT LGLTRAVEKY DPTRGHRFST
YAYWWIRQGL NRALSTQSRT IRIPVNVNEK LTKLRAAKSR LMQSNGLPPT AEQLVKTMRL
PMAEVEDLLA CELRSVTVSL QGVVKSKSDP SELVDVLPSE EIPPMERAEM AERTASVWTL
LNRANLTPKE RMVVTLRFGL DGSHEWRTLA EVARHMSCSR EYCRQVVQRA LRKLRKTGIQ
SGLVESTL