Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | A9601_04811 |
Symbol | sun |
ID | 4717179 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. AS9601 |
Kingdom | Bacteria |
Replicon accession | NC_008816 |
Strand | + |
Start bp | 416073 |
End bp | 417389 |
Gene Length | 1317 bp |
Protein Length | 438 aa |
Translation table | 11 |
GC content | 29% |
IMG OID | 640078193 |
Product | Sun protein (Fmu protein) |
Protein accession | YP_001008876 |
Protein GI | 123968018 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0144] tRNA and rRNA cytosine-C5-methylases |
TIGRFAM ID | [TIGR00563] ribosomal RNA small subunit methyltransferase RsmB |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGAGCATAG GATATTTACA AAGAAAGGCA GCTTGGGAAA TTTTATTAAA AGTTAGTTCG GGTGATTTTT CTGATCATGC TCTTGAAAAG GTTTTAAAAA ATTATCAATT TAATCCTCTT GATATAGCTT TTATTACGGA ATTATCTTTT GGATGCATAA GGTATAGAAA ATTTCTTGAT CTTTGGACGG ATCATACATC AAAAATTACT CATAAAAAGC AGCCTCCAAA GTTAAGATGG CTTCTACATA TAGGTTTATA TCAACTATTG AAAATGGATA AAATTCCATT TCCTGCTGCT ATTACTACGA CTGTAGAAGT AGCTAAAAAA ACAGATTTAA ATGGTTTAGC GGGAACTGTA AATGCGATAT TGAGAAATGC ATCAAGAAAA TTAGAACAAA AAATATTTCC GGAATTATCA TCTGATAGAA AAGAAAGAAT TTCATATCTT GAATCATTCC CATTATGGCT TGTGAAGGAT CTTTATAAAT GGGTCGGTAA TAGTGAGGGT GAAAATATCA TTAGGGCATT TAATAAAAAA CCATCAATTG ATTTGAGAAT TAACCAATTA AAAACTAATT TAGATAACTT TTTGAAAGTA CTTCATGAAA ATAAAATTGA TGCTGAAATT ATTAATGATT TAAATAATGG AATTACTTTA AAATCTAATC CAAGATCTAT AAAAAATTTA CCAGGATATA GTGATGGGCT TTGGACAATT CAAGATAGAT CTTCTCAATG GATAGCACCT CTCTTAAATC CAAAAGAAGG TGAAAAGATT TTAGATGCTT GTGCAGCTCC AGGAAGTAAG TCTACCCACC TTGCAGAATT AACAAATGAT AGTGCTGAAA TAATTGCCGT AGATAGATCA GCAAAAAGAT TAAAAATACT GCAATCAAAT TTAGAAAGGT TAAATTTGAA ATCTGTTAAT ACCCTTAAGG CTGATGCTAC GAGGTTGATT GAATTAAATC CTAAGTTTAT TTCTTATTTT GATAAGATTT TATTAGATGC TCCATGTTCA GGCATTGGAA CTCTTTCCAG GAATCCAGAT TCTAGATGGT CTTTAAGTAA AGAAAAAATA AAATCTTTAA CTTTATTACA GGGAAAACTT TTGGAGAGTA TTTTACCTCT TTTGAAAAAA GATGGCACTT TAGTTTATTC AACTTGTACT ATTTGTCCCG ATGAAAATAA TCTATTAATT GAACGATTTA TTGAAAAAAA CAAAACTTTA AAATTGGTTA GCCAAAAGCA AATTTTACCT AGCTTGGATT ATCCTGGTGA TGGATTTTAT TCTGCAATAA TTTCTTATAA ATCTTAA
|
Protein sequence | MSIGYLQRKA AWEILLKVSS GDFSDHALEK VLKNYQFNPL DIAFITELSF GCIRYRKFLD LWTDHTSKIT HKKQPPKLRW LLHIGLYQLL KMDKIPFPAA ITTTVEVAKK TDLNGLAGTV NAILRNASRK LEQKIFPELS SDRKERISYL ESFPLWLVKD LYKWVGNSEG ENIIRAFNKK PSIDLRINQL KTNLDNFLKV LHENKIDAEI INDLNNGITL KSNPRSIKNL PGYSDGLWTI QDRSSQWIAP LLNPKEGEKI LDACAAPGSK STHLAELTND SAEIIAVDRS AKRLKILQSN LERLNLKSVN TLKADATRLI ELNPKFISYF DKILLDAPCS GIGTLSRNPD SRWSLSKEKI KSLTLLQGKL LESILPLLKK DGTLVYSTCT ICPDENNLLI ERFIEKNKTL KLVSQKQILP SLDYPGDGFY SAIISYKS
|
| |