Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9211_04221 |
Symbol | sun |
ID | 5730806 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9211 |
Kingdom | Bacteria |
Replicon accession | NC_009976 |
Strand | + |
Start bp | 397512 |
End bp | 398810 |
Gene Length | 1299 bp |
Protein Length | 432 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 641284779 |
Product | Sun protein (Fmu protein) |
Protein accession | YP_001550307 |
Protein GI | 159902963 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0144] tRNA and rRNA cytosine-C5-methylases |
TIGRFAM ID | [TIGR00563] ribosomal RNA small subunit methyltransferase RsmB |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 30 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGCCAAGAA TGGCTGCTTG GAAGGTTTTG CAGGCTGTCT CTGCTGGTGC TTATGCAGAG ACGGCCCTTG ACCAGGTCTT AAACAAATAC TCTATGAAAG CAATTGATAA AGCTCTGACA ACAGAGATTG CTTACGGTTC AATTCGACAA AGGAAGTATT TAGATTCTTG GATTGATAAC TTGGCAAAGA TTTCGGCCTT AAAACAACCT CCTAGGCTGA GATGGCTACT GCATATAGGC CTTTACCAAA TCTTTTTAAT GGAGAGAATA CCTGTTTCTG CAGTAGTGAA TACAACCGTT CAATTGGCCA AAAACAATAA TTTAAATAAG CTTTCTTCGG TAGTAAATGG AATTCTTCGC AATGCGATTC GAATTAGAGA GGCTGGGCAA GGCTTACCAT TCAAGTCAAA TGCTTCGGAA GAGTTAGCAC AGTCTTTCTC GATCCCATTA TGGTTGGCTA ATTCATTGAT CACTTGGCGT GGCGAACAAG GTGCAAAGAG TATTGCTATG GCTTTTAATC AGCCCCCTGC CTTTGATTTA AGAATTAATC GATGTAAAAC AAACCCTAGG AGTGTGCAAG AGATTTTTGA TAAATTTGGG ATTACGAGTC TACCTATTAA AGGATGCACT TCAGGATTGC AAATAACCTC AGGGATGGGT GACTTACGCA AATGGCCTGG ATATGAAGGA GGTGAATGGT CAGTTCAGGA TAGATCATCT CAGTGGATTG CTCCATTGCT TGAAGCTGAA CCTGGCGATC GAATTTTAGA TGCATGCTCT GCTCCAGGAG GCAAGGCAAC TCATCTTGCG GAATTGATTG ACGATAATGG TGAGATATGG GCAGTTGATC GCTCTCCTAA ACGTCTACAG AAAGTGTCTG AGAACGCGAC GCGTTTAGGC TTGAATTCTC TTAAATGCTT GGCTGCTGAT GCCTCAATGT TATTAGACTG TAAGCCCCAC TGGAAGGGCT ATTTTCAAAG AATTTTGGTT GATGCCCCTT GCTCGGGTTT GGGAACATTG AGTAGAAATC CAGATGCTCG TTGGCGAATG ACTCCCGAAA AAATTGATGA GTTGATTATT TTACAAGCCC GGTTACTGAG AGGAGTTCTA CCTTTATTGT CTCCTGGAGG GAGAATCGTA TATTCAACCT GCACTATGCA CCCAGAAGAA AACTTCAAAC AAGTTGGGGA ATTTTTAGCA TTGCACCCCA AAGTAAAACT TAAGTATCAA AATCAAATTT GGCCAGATGA TGCACAATCA GGAGATGGTT TCTATGCGGC AGTTATTGAT ATAGATTAA
|
Protein sequence | MPRMAAWKVL QAVSAGAYAE TALDQVLNKY SMKAIDKALT TEIAYGSIRQ RKYLDSWIDN LAKISALKQP PRLRWLLHIG LYQIFLMERI PVSAVVNTTV QLAKNNNLNK LSSVVNGILR NAIRIREAGQ GLPFKSNASE ELAQSFSIPL WLANSLITWR GEQGAKSIAM AFNQPPAFDL RINRCKTNPR SVQEIFDKFG ITSLPIKGCT SGLQITSGMG DLRKWPGYEG GEWSVQDRSS QWIAPLLEAE PGDRILDACS APGGKATHLA ELIDDNGEIW AVDRSPKRLQ KVSENATRLG LNSLKCLAAD ASMLLDCKPH WKGYFQRILV DAPCSGLGTL SRNPDARWRM TPEKIDELII LQARLLRGVL PLLSPGGRIV YSTCTMHPEE NFKQVGEFLA LHPKVKLKYQ NQIWPDDAQS GDGFYAAVID ID
|
| |