Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_07751 |
Symbol | rps1b |
ID | 4778351 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | - |
Start bp | 713694 |
End bp | 715139 |
Gene Length | 1446 bp |
Protein Length | 481 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 640086284 |
Product | 30S ribosomal protein S1 protein B, putative Nbp1 |
Protein accession | YP_001016791 |
Protein GI | 124022484 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0539] Ribosomal protein S1 |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.761988 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTGTTCA CAGGCCCCGA ATGGCCAAAC CAAGGTGGAT CCCTCCAGCC CAACCGCCTC TGCAGCCTCT GGCCAGCTGA GATGGGCAAA GCGCCTGCAA GAAATAGACA TCAGCACCAA TCTGCGAACA GAACCACACC CTGCCTGTCA GAATGCTTGA CCGACGACCC CAAGGCCCCC ATGGCCGGAT CAGGCAGTCC GCAGCCCAAT AGACCAAAGC CCCCTAAACC CGCAGCGGAA GCCCCCCGCA AGCCCCTGCA GGTCATGCAC ATCAGCAGGC GTGGGGAGCA AGAAAAGCTA GTACGAGAAG CCGCCGAAAC AACTTCTCCT GGCAGCGAAG CAACTACAGG ATCAGGCCAG CTCAGCAACG CCCCCAATCG ATCCGTCTCA GCTGATGCAG CCTCAGATGA GAGTCGTTTT GATCTCGGCG AGCTGCAAAA CATGACGATG GCCGATCTAC TTGGTCCGGC CGATCAGTCA CGCCGAAGCG GTGCGGCCCC CAAGGGCAAC GATTATCGAA ACGAAGAAGG GCAATCCAAC CCAGCACGCA GTGTCGACGA TTTCGACTTC GATGAAGACG CCTTCTTGGC TGCTCTAGAT GAAAACGAAC CAATTGGAAC CACAGGGGAA GTGGCTACAG GCAAGGTCAT CGCTTTGGAA AGTGATGGTG TTTACGTCGA CATCGGCGGG AAAGCACCAG GCTTTATGCC CAAAAATGAA TGCGGCCTTG GCGTAATTAC CAACCTCAAA GAGCGCTTCC CAAAGGGTTT AGAGGTCGAA GTACTCGTCA CCCGAGAGCA AAATGCCGAT GGGATGGTCA CCATCAGCTG TCGAGCTCTA GAGCTGCGTA AGAGCTGGAG CAAGGTGCAG CAGATGGAGA AGGAAGGCAA GGTCGCCCAG GTCAAGGTCA ATGGATTCAA CCGTGGTGGA GTGACCTGCG ACCTTGAAGG CCTGAGAGGA TTTATCCCCC GCTCTCAGCT CCAAAATGGA GAGAATCACG AAGCGCTTGT CGGAAAAACC CTTGGTGTGG CCTTCTTAGA AGTCAATCCA GAAACCCGCA AGCTGGTGCT TTCAGAGAAG CGGGCCGCTA CCGCCGCCCG CTTCTCTGAA CTTGAAGTAG GACAGCTCGT GGAAGGTCAA GTCGTAGCAG TGAAGCCCTA CGGTTTCTTC ATAGACCTAG GCGGTGTGAG TGGCCTCCTT CACCAATCCA TGATCACCGG TGGCAGTCTT AGATCCCTGC GGGAGGTATT CAACCAAGGC GATCGAGTCA AAGCCTTGAT CACCGAAATG GACCCTGGTC GCGGACGCAT TGCCCTGAAC ACAGCCCTAC TGGAAGGACA ACCGGGCGAA CTCCTAATCG AAAAAGATAA GGTTATGGCT GAAGCGACTG ATCGAGCCAA CAAAGCTCGT AACGTCCTTA GGCAGCAGGA ACAGTCAGCA GGATGA
|
Protein sequence | MLFTGPEWPN QGGSLQPNRL CSLWPAEMGK APARNRHQHQ SANRTTPCLS ECLTDDPKAP MAGSGSPQPN RPKPPKPAAE APRKPLQVMH ISRRGEQEKL VREAAETTSP GSEATTGSGQ LSNAPNRSVS ADAASDESRF DLGELQNMTM ADLLGPADQS RRSGAAPKGN DYRNEEGQSN PARSVDDFDF DEDAFLAALD ENEPIGTTGE VATGKVIALE SDGVYVDIGG KAPGFMPKNE CGLGVITNLK ERFPKGLEVE VLVTREQNAD GMVTISCRAL ELRKSWSKVQ QMEKEGKVAQ VKVNGFNRGG VTCDLEGLRG FIPRSQLQNG ENHEALVGKT LGVAFLEVNP ETRKLVLSEK RAATAARFSE LEVGQLVEGQ VVAVKPYGFF IDLGGVSGLL HQSMITGGSL RSLREVFNQG DRVKALITEM DPGRGRIALN TALLEGQPGE LLIEKDKVMA EATDRANKAR NVLRQQEQSA G
|
| |