Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | NATL1_04801 |
Symbol | sun |
ID | 4781299 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. NATL1A |
Kingdom | Bacteria |
Replicon accession | NC_008819 |
Strand | + |
Start bp | 437072 |
End bp | 438418 |
Gene Length | 1347 bp |
Protein Length | 448 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 640083757 |
Product | Sun protein (Fmu protein) |
Protein accession | YP_001014309 |
Protein GI | 124025193 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0144] tRNA and rRNA cytosine-C5-methylases |
TIGRFAM ID | [TIGR00563] ribosomal RNA small subunit methyltransferase RsmB |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.273951 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGCTTTTGA GTGAAAACCT GTCATCCATA AAAGGCTTAG ATGCCAGAAA AGCTGCTTGG GAGGTTATCC AAGCCGTAGG TGGAGGTTCA TTCGCAGATG TTGCTTTGGA AAGGATTTTT AATCTTTATT CCTTTAAGTC GATAGATAAA GCTTTGATAA CTGAACTTTC TTATGGTGCA ATTCGCCAAA GATATTACTT GGATTGTTGG ATTGATCATT TAGGGAAAGT ACCCGCTAAA AAACAACCTC CTTTATTGAG ATGGCTATTG CATCTTGGGC TTTATCAGGT TCTAAAAATG AAGAGAATAC CTCCAGCTGC TGCAATTAAC ACAACTGTAG AGCTTGCGAA AACTCATCAT TTAAGAAAGC TAGCCCCCGT TGTTAATGGA ATCTTGCGAT CTGCTCTTAG AAGCAAAGAG AGAGGACTCT TGTTGCCTAA ATCGAATAAT CCAAGTTTGG AATTGGCAAA AAACGAGTCA CTTCCTGTTT GGTTTGCAGA GGAATTGATT GCTTGGAAGG GAGTCGAACA TGCTCAGCAG ATTGCTAAAG CATTCAACAG CGTTAGTCCT ATTGATATAA GAGTGAATAA ATTGCGTGCA GATTTAAAAG ATGTAAAAGA ACTTTTTGAT ACCTGCGGTA TTCAAAATCA ATTAATCCCA AACTGTCCTT CCGGATTGGA GGTACGAGCT GGTATAGGTG AACCTAGACA ATGGCCTGGT TATGAAGAAG GTAAATGGAG TGTTCAAGAT AGATCTTCAC AGTTAATTGC CCCATCATTA GGACCTCTAC CTGGAGAAAA GATTCTTGAT GCTTGTGCTG CACCAGGCGG AAAATCAACA CATATTGCTG AATTAATTAA TAATGAGGGC AATCTCTGGT CTGTTGATCG ATCATCCAGA AGATCTAAAA AAATATTAGC TAACTCAGAG AGGCTTGGGA CTAAATGCTT GCAAATATTG GTTGCTGATT CTAATGAGCT ATTACTCAAA AAGCCAGATT GGAAAGGTTT TTTTGATCGT ATATTAATAG ATGCTCCATG CTCAGGATTG GGTACTCTTG CTCGCCACCC TGATGCAAGA TGGAGGATGA ATCAAGATAA TATTCAGCAG CTTGTCGCTG TTCAAAGTCA GTTGCTTAAC TCGTTAGCGC CTTTATTGAA AAATGGAGGG AAGTTGGTTT ATTCCACTTG TACTATTCAC CCTGAAGAAA ATTCTCATCA GATAAAAAAT TTTCTTCAAT CTAAGTCTGA GTTTTTATTG GAATATGAAA AACAGATCTG GCCTGGAGAG GGAGATAATG GAGATGGTTT TTATATTGCT GTTTTAAATA AATTAAAAAA TCAATAA
|
Protein sequence | MLLSENLSSI KGLDARKAAW EVIQAVGGGS FADVALERIF NLYSFKSIDK ALITELSYGA IRQRYYLDCW IDHLGKVPAK KQPPLLRWLL HLGLYQVLKM KRIPPAAAIN TTVELAKTHH LRKLAPVVNG ILRSALRSKE RGLLLPKSNN PSLELAKNES LPVWFAEELI AWKGVEHAQQ IAKAFNSVSP IDIRVNKLRA DLKDVKELFD TCGIQNQLIP NCPSGLEVRA GIGEPRQWPG YEEGKWSVQD RSSQLIAPSL GPLPGEKILD ACAAPGGKST HIAELINNEG NLWSVDRSSR RSKKILANSE RLGTKCLQIL VADSNELLLK KPDWKGFFDR ILIDAPCSGL GTLARHPDAR WRMNQDNIQQ LVAVQSQLLN SLAPLLKNGG KLVYSTCTIH PEENSHQIKN FLQSKSEFLL EYEKQIWPGE GDNGDGFYIA VLNKLKNQ
|
| |