Gene A9601_03361 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagA9601_03361 
Symbolrps1a 
ID4717024 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. AS9601 
KingdomBacteria 
Replicon accessionNC_008816 
Strand
Start bp309140 
End bp310231 
Gene Length1092 bp 
Protein Length363 aa 
Translation table11 
GC content37% 
IMG OID640078039 
Product30S ribosomal protein S1 
Protein accessionYP_001008731 
Protein GI123967873 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0539] Ribosomal protein S1 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACGAAA ATTCTTCCCA AACCATTAAA GAAATTTCTG AGGATCAAGA AATTAAAAAT 
TCGTCTGAGT TAGATAATAA TTCAGCATCT CAAAATGAGG AAGATTTATC ATTCGAGAAG
AGCGATATAC CTCAAGCAGA TTCTTCCTCT AGCAGAACCA ATACGGATTT TGACAACGCA
GGATTCACTC AAGAAGAATT TGCATCACTT TTGGGTAAGT ATGACTATAA CTTTAAGCCT
GGCGATCTAG TTAAAGGTAC CGTTTTTGCT CTAGAGCCCA AAGGGGCCAT GATAGATATA
GGGGCAAAAA CAGCTGCTTT TATGCCTGTT CAGGAGGTTT CAATAAATAG AGTTGAAGGA
CTTAATGATG TTTTACAACC TTCAGAAAGT AGAGAATTTT TCATAATGAG CGAAGAAAAT
GAAGATGGCC AGTTAGCCCT CTCCATTAGA AGAATTGAAT ATCAAAGAGC ATGGGAAAGG
GTTAGACAAC TCCAAAAAGA AGATGCCACT ATATATTCTG AAGTTTTTGC AACAAACAGA
GGTGGGGCTC TTGTGAGAGT GGAGGGCTTG AGAGGCTTTA TCCCAGGCTC ACACATAAGT
GCACGAAGAA TTAAAGATGA CTTAGAAGGT GAATATTTAC CTTTAAAATT TCTTGAAGTC
GATGAAGAGA GAAACAGATT AGTACTAAGC CATAGAAGAG CTTTGGTTGA GAAAAAAATG
AACCGACTCG AGGTAGGCGA AGTTGTTGTT GGTTCTGTAA AAGGTATTAA ACCTTATGGG
GCCTTTATTG ATATTGGTGG AGTTAGTGGT CTATTGCATA TTTCTGAGAT TAGTCATGAA
CATATTGAAA CTCCGCATAA TGTTTTAAAT GTGAGTGACC AAATGAAAGT GATGATAATT
GACCTTGATT CAGAAAGAGG ACGAATTTCA TTATCTACTA AAGCACTTGA ACCTGAACCA
GGAGATATGC TAACTGACCC TCAAAAAGTT TTTAGTAAAG CTGAAGAAAT GGCTGCTAAA
TATAAACAAA TGTTATTCGA ACAGACTGAC GAGAACGAAG AGATCGCCAC AGCTTCAGCT
GAAACACTAT AA
 
Protein sequence
MNENSSQTIK EISEDQEIKN SSELDNNSAS QNEEDLSFEK SDIPQADSSS SRTNTDFDNA 
GFTQEEFASL LGKYDYNFKP GDLVKGTVFA LEPKGAMIDI GAKTAAFMPV QEVSINRVEG
LNDVLQPSES REFFIMSEEN EDGQLALSIR RIEYQRAWER VRQLQKEDAT IYSEVFATNR
GGALVRVEGL RGFIPGSHIS ARRIKDDLEG EYLPLKFLEV DEERNRLVLS HRRALVEKKM
NRLEVGEVVV GSVKGIKPYG AFIDIGGVSG LLHISEISHE HIETPHNVLN VSDQMKVMII
DLDSERGRIS LSTKALEPEP GDMLTDPQKV FSKAEEMAAK YKQMLFEQTD ENEEIATASA
ETL