Gene A9601_05861 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagA9601_05861 
Symbolrps1b 
ID4717286 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. AS9601 
KingdomBacteria 
Replicon accessionNC_008816 
Strand
Start bp511516 
End bp512721 
Gene Length1206 bp 
Protein Length401 aa 
Translation table11 
GC content34% 
IMG OID640078298 
Product30S ribosomal protein S1 protein B, putative Nbp1 
Protein accessionYP_001008979 
Protein GI123968121 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0539] Ribosomal protein S1 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.974619 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGAGCAA GTAATAAAAA TGCCCAAGAT AATATCCAAC CAAAGGGCAA TAAAAAACCT 
CTTCAGGTAC TTCACATAAG CAAGAAAGAT TCTCAAGAGA TAAATAATGA GCAAAACAAT
TCGCAAGAAG ATATAAAAAA AGAAAATATT GCAATCAAAC CTCAAATCAT TAAAGATAAT
TCAGTAAAAG AAATTGAGGA CAGTAATGAA AAGACTAAGG ATTTTGATAT TTCTCAACAA
GATTTAACTC AACAAGATTT AAATAGACCG CTTAATTTTT CTGAGCAAAA AATAGATTTT
CAATTAGAAA GAACAGTTGA TGAATTCGAT TTTGATGAAA GTGCTTTTTT GGAGGCTTTA
AATGCAAATG AGCCAATTGG GGCTACTGGA GAAACAATTT CAGGTAAGGT TATAGCAATC
GAAAGTGATG GATTATATGT TGACATTGGC GGAAAGGCAC CTGGTTATAT GCCCAAAAAA
GAATGTGGTT TGGGTGTCAT AACTAACTTT AAAGAAAAGT TTTCTATAGG CCTTGAAATG
GAAGTTTTGG TTATCAAAGA ACAAAATGCT GATGGAATGG TAACAGTGAG CGCTCGGGCA
TTAATTCTCA GGCAAAGTTG GGAGAAAGTA TCAAATTCCG CAAAAAATGG AGAATTAATT
AACGTTTTAA TTAATGGATT TAACAGAGGT GGGCTTACTT GTGACGTAGA TGGATTAAGA
GGATTTATCC CCAGATCCCA ACTTGAAGAT GGTCAAGATT ATCAATCTTT TGTTGGCAAA
AATCTAAAAG TGGCGTTTCT TGAGGTTAAT CCAGAATCCA GAAAATTAGT TCTCTCTGAG
AAGAAAGCAT CATTAGTCTC TAAACTTACA AGTCTTGAAT TAGGTCAATT AATTGAAGGA
GAAGTTTTAG CTGTAAAACC ATATGGCTTT TTTATTGATT TAGGGGGAGC TAGTGGACTT
CTTCATCAAT CCTCACTAAC AAATGGATCG ATTCGTTCTT TACGAGAAGT TTTTAGAGAA
GGGGAAGTTA TAAAAGCTTT GATATCTGAA ATAGACCTCG AAAAAGGGCG CATTGGTCTC
AATACAGCAC TCCTAGAAAA CTCTGCGGGA GAATTAATTA TTGATAAGCA AAAAGTTATG
CAAGAAGCCA CAGAGAGAGC ACTAAAAACT AAAGCACTCT TCGATAAAAA AGAACAAGAT
AAATGA
 
Protein sequence
MGASNKNAQD NIQPKGNKKP LQVLHISKKD SQEINNEQNN SQEDIKKENI AIKPQIIKDN 
SVKEIEDSNE KTKDFDISQQ DLTQQDLNRP LNFSEQKIDF QLERTVDEFD FDESAFLEAL
NANEPIGATG ETISGKVIAI ESDGLYVDIG GKAPGYMPKK ECGLGVITNF KEKFSIGLEM
EVLVIKEQNA DGMVTVSARA LILRQSWEKV SNSAKNGELI NVLINGFNRG GLTCDVDGLR
GFIPRSQLED GQDYQSFVGK NLKVAFLEVN PESRKLVLSE KKASLVSKLT SLELGQLIEG
EVLAVKPYGF FIDLGGASGL LHQSSLTNGS IRSLREVFRE GEVIKALISE IDLEKGRIGL
NTALLENSAG ELIIDKQKVM QEATERALKT KALFDKKEQD K