Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9515_04881 |
Symbol | sun |
ID | 4720001 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9515 |
Kingdom | Bacteria |
Replicon accession | NC_008817 |
Strand | + |
Start bp | 435559 |
End bp | 436872 |
Gene Length | 1314 bp |
Protein Length | 437 aa |
Translation table | 11 |
GC content | 31% |
IMG OID | 640080163 |
Product | Sun protein (Fmu protein) |
Protein accession | YP_001010804 |
Protein GI | 123965723 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0144] tRNA and rRNA cytosine-C5-methylases |
TIGRFAM ID | [TIGR00563] ribosomal RNA small subunit methyltransferase RsmB |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.929228 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGGCAACAG GATACCTTCA AAGAAAAGCA TCTTGGGAAA TTTTATTAAA GGTAAGTTCT GGTATCTATT CAGACCATGC ACTTGAGAAG GTACTAAAAA ATTATGAGTT TAATTCATTA GATATAGCCT TCATAACTGA ATTATCTTTT GGATGTATAA GATATAGAAA ATTTCTTGAT ACTTGGATAG ATCACATCTC AAAACTTTCT CATCAAAAAC AACCTCCAAA ACTAAGATGG CTTTTACATA TTGGTTTATA TCAATTATTG AAAATGGATA AAATACCTTT TCCTGCCGCT ATTTCCTCCA CAGTAGAAGT AGCAAAGAGA ACTGATTTAA AAGGTTTAGC TGGTACCGTG AATGCAATAT TAAGAAATAC TGTTAGAAAT ATAGAAAGAG ATAATTGCCC AAAAATATCT ACTGACGAAA TGGAAAAGTT ATCTTGTCTT GAGTCATTGC CATTATGGCT TGTAACTGAA ATTGTTAATT GGGTAGGAAT AAAAGAAGCT AAAAATATCT TTAAAGCATT TAATAAAAAA CCTACGATTG ATTTACGAAT TAATTCACTG AAAACTAATT TCAATAAAAT TTTGAAAGAA CTTAATGAAT GTAATATTCA GGCAGAGCCT ATAAATCAAT TAAATAATGG AGTTGCTTTA AATTCAAATC CAAGATCTAT AAAAAATCTT CCAGGGTATA AAGATGGCCT ATGGGTGGTT CAAGATAGAT CTTCTCAATG GGTGGCTCCT CTTTTAAATC CGAAGAAAGG AGAAAAGATT CTGGATGCTT GTTCTGCTCC TGGAAGCAAA ACAACTCATT TAGCTGCATT AGTAAATGAT GATGCTGAAA TTCTAGCTGT TGATAGATCA GAAAAAAGAT TGAAAATACT ACAGTCAAAT TTAGAAAGAT TAAATATAAA AAGTGTAAAA ACCTTAGAAG CAGATGCCAC AACTTTGATT GATATTAAGC CAAATCTCGC ATCTTATTTT GATAAGATTT TAATTGATGC TCCTTGCTCA GGAATCGGAA CCTTTGCAAG AAATCCAGAT ACAAGATGGT CTTTAAGTAA AGATAAAATA AATCAATTAA TTATTCTTCA GGAAGGATTA CTAGACAGTA TTTTCCCTCT TTTAAAAAAA AATGGAACTT TAGTTTATTC AACGTGTACA ATTTGCCCTG ATGAGAATAA CTTACTCATT AGAAGATTTT TGTCAAAAAA TAAGGAACTT AAATTAGATA GTGAAAGACA AATTTTACCA AGATTTGATA AGCCAGGAGA TGGATTCTAT GCAGCAACAA TATCCTATAA GTAA
|
Protein sequence | MATGYLQRKA SWEILLKVSS GIYSDHALEK VLKNYEFNSL DIAFITELSF GCIRYRKFLD TWIDHISKLS HQKQPPKLRW LLHIGLYQLL KMDKIPFPAA ISSTVEVAKR TDLKGLAGTV NAILRNTVRN IERDNCPKIS TDEMEKLSCL ESLPLWLVTE IVNWVGIKEA KNIFKAFNKK PTIDLRINSL KTNFNKILKE LNECNIQAEP INQLNNGVAL NSNPRSIKNL PGYKDGLWVV QDRSSQWVAP LLNPKKGEKI LDACSAPGSK TTHLAALVND DAEILAVDRS EKRLKILQSN LERLNIKSVK TLEADATTLI DIKPNLASYF DKILIDAPCS GIGTFARNPD TRWSLSKDKI NQLIILQEGL LDSIFPLLKK NGTLVYSTCT ICPDENNLLI RRFLSKNKEL KLDSERQILP RFDKPGDGFY AATISYK
|
| |