Gene P9301_03371 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9301_03371 
Symbolrps1a 
ID4912480 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9301 
KingdomBacteria 
Replicon accessionNC_009091 
Strand
Start bp308720 
End bp309811 
Gene Length1092 bp 
Protein Length363 aa 
Translation table11 
GC content36% 
IMG OID640159907 
Product30S ribosomal protein S1 
Protein accessionYP_001090561 
Protein GI126695675 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0539] Ribosomal protein S1 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACGAAA ATTCTTCTCA AACCATTAAA GAACTTTCTG AGAATCAAGA AATTAAAAAT 
TCGTCTGAGT TAGATAATGA TGCAGCCTCT CAAAATGAGG AGGATTTATC ATTCGAAAAG
AGCGATATAC CTTCAGCAGA TTCTTCCTCT AGCAGAACAA ATACTGACTT TGACAATGCA
GGATTTACAC AAGAAGAATT TGCATCACTT TTGGGTAAGT ATGACTATAA CTTTAAGCCT
GGCGATCTAG TTAAAGGCAC CGTTTTTGCT CTAGAACCCA AAGGGGCCAT GATAGATATA
GGGGCAAAAA CAGCTGCTTT TATGCCTGTT CAGGAGGTTT CAATAAATAG AGTTGAAGGA
CTTAATGATG TTTTGCAGCC TTCTGAAAGT AGAGAATTTT TCATAATGAG CGAAGAAAAT
GAAGATGGCC AATTAGCTCT CTCCATTAGA AGAATTGAAT ATCAAAGAGC ATGGGAAAGG
GTTAGACAAC TCCAAAAAGA AGATGCCACT ATCTATTCTG AAGTTTTTGC AACAAACAGA
GGCGGGGCAC TTGTTAGGGT AGAAGGTTTG AGAGGTTTTA TCCCAGGCTC ACATATAAGT
GCTCGAAAAA TCAAAGATGA CTTAGAAGGT GAATATTTAC CTTTAAAGTT TCTTGAAGTT
GATGAAGAGA GAAATAGATT AGTACTAAGT CATAGAAGAG CTTTGGTTGA GAAAAAGATG
AACCGACTTG AGGTAGGAGA AGTTGTTGTT GGTAATGTAA AAGGTATTAA ACCTTATGGA
GCTTTCATTG ATATTGGTGG AGTTAGTGGT CTATTGCACA TTTCTGAGAT TAGTCATGAA
CATATTGAGA CTCCTCATAA TGTTTTAAAT GTGAATGACC AAATGAAAGT TATGATAATT
GACCTTGATT CAGAAAGAGG ACGTATTTCA TTATCTACTA AAGCACTTGA GCCTGAACCA
GGAGATATGC TAACTGACCC TCAAAAAGTT TTTAGTAAAG CTGAAGAAAT GGCTGCGAAA
TACAAACAAA TGTTATTTGA ACAAACTGAC GATATTGAAG AGATTCCCAC AGCGTCAAAT
GAAGCAGAAT AA
 
Protein sequence
MNENSSQTIK ELSENQEIKN SSELDNDAAS QNEEDLSFEK SDIPSADSSS SRTNTDFDNA 
GFTQEEFASL LGKYDYNFKP GDLVKGTVFA LEPKGAMIDI GAKTAAFMPV QEVSINRVEG
LNDVLQPSES REFFIMSEEN EDGQLALSIR RIEYQRAWER VRQLQKEDAT IYSEVFATNR
GGALVRVEGL RGFIPGSHIS ARKIKDDLEG EYLPLKFLEV DEERNRLVLS HRRALVEKKM
NRLEVGEVVV GNVKGIKPYG AFIDIGGVSG LLHISEISHE HIETPHNVLN VNDQMKVMII
DLDSERGRIS LSTKALEPEP GDMLTDPQKV FSKAEEMAAK YKQMLFEQTD DIEEIPTASN
EAE