Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9301_04501 |
Symbol | sun |
ID | 4911296 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9301 |
Kingdom | Bacteria |
Replicon accession | NC_009091 |
Strand | + |
Start bp | 390493 |
End bp | 391809 |
Gene Length | 1317 bp |
Protein Length | 438 aa |
Translation table | 11 |
GC content | 30% |
IMG OID | 640160028 |
Product | Sun protein (Fmu protein) |
Protein accession | YP_001090674 |
Protein GI | 126695788 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0144] tRNA and rRNA cytosine-C5-methylases |
TIGRFAM ID | [TIGR00563] ribosomal RNA small subunit methyltransferase RsmB |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGAGCATAG GATATTTACA AAGAAAGGCA GCTTGGGAAA TTTTATTAAA AGTTAGTTCT GGTGATTTTT CTGATCATGC TCTTGAAAAG GTTTTAAAAA ATTATCAATT TAATCCTCTT GATATAGCTT TTATTACGGA ATTATCTTTT GGATGCATAA GGTATAGAAA ATTTCTTGAT CTTTGGACGG ATCACACATC AAAAATTACT CATAAAAAGC AGCCTCCAAA GTTAAGATGG CTTCTACATA TAGGTTTGTA TCAACTATTG AAAATGGATA AAATCCCATT TCCTGCTGCT ATTTCTACCA CTGTAGAAGT AGCTAAAAAA ACTGATTTAA ATGGTTTAGC GGGAACTGTA AATGCGATAT TGAGAAATGC ATCAAGAAAA TTAGAACAAA AAATCTTTCC GGAATTATCT TCTGATAGAA AAGAAAGAAT TTCATATCTT GAATCATTTC CATTATGGCT TGTAAAGGAT CTTTATAAAT GGGTCGGCAA TAGCGAGGGT GAAAATATCA TTAAGGCATT AAATAAAAAA CCATCAATTG ATTTGAGAAT TAACAAATTA AAAACTAATT TAGATAACTT TTTGAAAGTA CTTCATGAAA ATAAAATTGA TGCTGAAATT ATTAAAGATT TACATAATGG AATTACTTTA AAATCTAATC CAAGATCTAT AAAAAATTTA CCAGGATATA GTGATGGACT TTGGACAATT CAAGATAGAT CTTCTCAGTG GATAGCTCCT CTCTTGAATC CAAAACAAGG TGAAAAGATT TTAGATGCTT GTGCAGCTCC AGGAAGTAAG TCTACCCACC TTGCAGAATT AACAAATGAT AGTTCTGAAA TCATTGCCGT AGATAGATCG GCAAAAAGAT TGAAAATACT GCAATCAAAT TTAGAAAGGT TAAATTTGAA ATCTGTTAAT ACCCTTAAGG CTGATGCTAC GAGTTTGATT GAATTAAATC CTAAGTTTAT ATCTTATTTT GATAAGATTT TATTAGATGC TCCATGTTCA GGCATTGGAA CTCTTTCCAG GAATCCAGAT TCTAGATGGT CTTTAAGTAA AGAAAAAATA AAATCTTTAA CTTTATTGCA GGAAAAACTT TTGGACAGTA TTTTCCCTCT TTTGAAAAAA GATGGTACTT TAGTTTATTC AACTTGTACA ATTTGTCCCG ATGAAAATAA TCTATTAATT GAACGATTTA TTGAAAAAAA CAAAACTTTA AAATTGGTTA GCGAAAAGCA AATTTTACCT AGCTTGGATT ACCCTGGCGA TGGATTTTAT TCTGCAATAA TTTCTTATAA ATCTTAA
|
Protein sequence | MSIGYLQRKA AWEILLKVSS GDFSDHALEK VLKNYQFNPL DIAFITELSF GCIRYRKFLD LWTDHTSKIT HKKQPPKLRW LLHIGLYQLL KMDKIPFPAA ISTTVEVAKK TDLNGLAGTV NAILRNASRK LEQKIFPELS SDRKERISYL ESFPLWLVKD LYKWVGNSEG ENIIKALNKK PSIDLRINKL KTNLDNFLKV LHENKIDAEI IKDLHNGITL KSNPRSIKNL PGYSDGLWTI QDRSSQWIAP LLNPKQGEKI LDACAAPGSK STHLAELTND SSEIIAVDRS AKRLKILQSN LERLNLKSVN TLKADATSLI ELNPKFISYF DKILLDAPCS GIGTLSRNPD SRWSLSKEKI KSLTLLQEKL LDSIFPLLKK DGTLVYSTCT ICPDENNLLI ERFIEKNKTL KLVSEKQILP SLDYPGDGFY SAIISYKS
|
| |