Gene P9211_03471 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9211_03471 
Symbolrps1a 
ID5731488 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9211 
KingdomBacteria 
Replicon accessionNC_009976 
Strand
Start bp327371 
End bp328486 
Gene Length1116 bp 
Protein Length371 aa 
Translation table11 
GC content39% 
IMG OID641284695 
Product30S ribosomal protein S1 
Protein accessionYP_001550232 
Protein GI159902888 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0539] Ribosomal protein S1 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTTGAAA ACTCACCTGA GGCTTCACAA AAGCTAGAGA CAGAAGAACA GGCAGCTAAG 
AGTATTGATG AGAGTCAAAC CTCTTCACCA GAAATTAATA ACAAGACTCC TGAAGAAACA
GGTAACCAAG TAGACACAGA TATTCCTGAA GATATCCCAA CAGCAGATGA TCCTTCTAGC
AGAGTAAAAA AACACGATTT TGATGGAGTA GGTTTTACTC TTGAAGAGTT TGACTCACTT
TTAAGCAAAT ACGATTACAA CTTCAAGCCT GGCGACATTG TCAATGGAAC AGTTTTTGCT
CTTGAAACAA AGGGAGCGAT GATTGATATA GGAGCAAAAA CGGCAGCTTT CATGCCAATG
CAAGAAGTAT CCATTAATCG TGTTGAAGGT TTAAGCGATG TACTCCAACC TTCAGAAGTA
AGACAATTTT TTATAATGAG TGAAGAGAAT GAAGATGGTC AACTTTCACT TTCTATCCGG
AGGATTGAAT ATCAGCGCGC ATGGGAAAGA GTGAGACAAC TTCAAAAGGA AGATGCAACT
ATTTACTCAG AAGTATTTGC AACAAATAGA GGCGGAGCAC TTGTAAGAGT AGAAGGACTT
AGAGGCTTTA TACCTGGTTC TCATATTAGT ACTAGGAAAG CTAAAGAAGA ATTAGTCGCA
GAGTTCTTAC CACTAAAGTT TTTAGAAGTT GACGAAGAGA GAAATAGATT AGTACTAAGT
CATAGGCGTG CCTTAGTCGA AAGAAAAATG AATCGATTAG AAGTTGGTGA AGTTGTTGTT
GGTGCTGTGA GAGGAATAAA ACCATATGGA GCTTTCATAG ATATAGGAGG GGTAAGTGGA
CTGCTTCATA TCTCGGAAAT TAGCCATGAA CATATTGAAA CCCCTCATTC AGTTCTAAAT
GTCAATGATC AGATGAAAGT GATGATTATT GACCTAGATG CAGAGAGAGG ACGTATTTCT
CTTTCGACAA AAGCACTTGA GCCTGAGCCT GGGGATATGC TGAGCGACCC ACAGAAAGTA
TTTGACAAAG CCGAAGAAAT GGCTGCTAAA TACAAGGAAA TGTTACTTGA GCAAGCAGAG
GAAGGTGAAA ACCCAATAGC AACAATGGAA ATTTAG
 
Protein sequence
MVENSPEASQ KLETEEQAAK SIDESQTSSP EINNKTPEET GNQVDTDIPE DIPTADDPSS 
RVKKHDFDGV GFTLEEFDSL LSKYDYNFKP GDIVNGTVFA LETKGAMIDI GAKTAAFMPM
QEVSINRVEG LSDVLQPSEV RQFFIMSEEN EDGQLSLSIR RIEYQRAWER VRQLQKEDAT
IYSEVFATNR GGALVRVEGL RGFIPGSHIS TRKAKEELVA EFLPLKFLEV DEERNRLVLS
HRRALVERKM NRLEVGEVVV GAVRGIKPYG AFIDIGGVSG LLHISEISHE HIETPHSVLN
VNDQMKVMII DLDAERGRIS LSTKALEPEP GDMLSDPQKV FDKAEEMAAK YKEMLLEQAE
EGENPIATME I