Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_20681 |
Symbol | sun |
ID | 4776628 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | - |
Start bp | 1820678 |
End bp | 1822030 |
Gene Length | 1353 bp |
Protein Length | 450 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 640087577 |
Product | Sun protein (Fmu protein) |
Protein accession | YP_001018069 |
Protein GI | 124023762 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0144] tRNA and rRNA cytosine-C5-methylases |
TIGRFAM ID | [TIGR00563] ribosomal RNA small subunit methyltransferase RsmB |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.398676 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTGTCTT CTTCCGTTGC TGCCGCTGAT GGATCTGTAC CTGTACCGGG ACTGCTGCCG CGGCGGGTGG CATGGGAGCT GTTACAGGCA GTGGCGGCAG GGGCCTATGC AGATGTCGCT CTCGAACGAG CTCTTCGTCA GAACCCCATG AGCGGTGCCG ACCGTGGCCT GGTGATGGAA TTGGCTTATG GCGCAATCCG TCAGCGGCAA TGGCTCGATG CTTGGTTGGA TCGTCTTGGC AAGGTGCCTG CCTGCAAACA GCCACCAGTG CTGCGCTGGT TGCTGCATTT GGGGCTCTAT CAGATTCTGC GTATGCAGCG GATTCCAGCT GCAGCGGCAG TGAACACCAG CGTTGAACTT GCTAAGACCG GCAAGCTTGC CCGATTAGCT CCGGTGGTGA ATGGCATTTT GCGGGCGGCA TTGCGTGCAC GCGATGCCGG TATGGTGCTC CTCGAGCCGG AGGACTCTGC CGCTCGGTTG GCTCAAGCGG AATCTCTACC TTTGTGGTTG GTGGAGCAAT TACTTGTTTG GCGAGGCGAG GTGGGAGCTG AGCTGTTTGC TCGTGCCAGC AACCAGGTGC CAACTCTTGA TCTGCGGATC AATCGACGTC GTACAAGCCG TGAGAACGTA AGGCTGGCGC TTGAGGCTAT TGGAGTGGAG AGCACTCCGA TCGAGAGCTG CCCTGATGGT TTGATGGTGA CTGGTAGTGC TGGTGACCTA AGCCAGTGGC CTGGCTATCA GCAAGGACAT TGGTGTGTGC AGGATCGCTC TGCACAGTTG GTCGCACCGC TGTTGGGGCC ACAGCCTGGG GATCGGATTC TTGATGCCTG CGCAGCACCA GGGGGTAAGG CCACTCATCT TGTTGAGCTG ATGGGTGGTT CGGGAGAGGT GTGGGCTGTG GATCGTTCCG CTGGCCGACT CAAGCGCTTG GCGGAGAATG CTGCTCGCTT GGGGGGTGAC TGCCTCCATG CTCTAGTCGC AGATGCCACG AATCTGTTGG CGGTGAAGCC CAGCTGGCGA GGATCCTTCC AGCGCATTCT TGTGGATGCA CCATGTTCTG GTTTAGGTAC TTTGGCCCGT CATGCGGACG CACGTTGGCG AGTCACTCCG TTGCAGGTTG AGGGGCTGGT GATCTTGCAG TCCAAGCTGC TTGAAGGCCT TCTGCCTCTG CTTAGCTCTG GAGGCCGATT GGTTTACGCC ACTTGCACCA TCCATCCGGC CGAGAACTTT GATCAGATCA AGGCCTTCCT TGGTCGGCAT CCTGAATTGA GCTTGTCTCA GGAACAGCAA CTATGGCCTG ATCCTGAGCA TGGTGGTGAT GGTTTTTATT CAGCCGTGTT GGATCTCAGC TGA
|
Protein sequence | MLSSSVAAAD GSVPVPGLLP RRVAWELLQA VAAGAYADVA LERALRQNPM SGADRGLVME LAYGAIRQRQ WLDAWLDRLG KVPACKQPPV LRWLLHLGLY QILRMQRIPA AAAVNTSVEL AKTGKLARLA PVVNGILRAA LRARDAGMVL LEPEDSAARL AQAESLPLWL VEQLLVWRGE VGAELFARAS NQVPTLDLRI NRRRTSRENV RLALEAIGVE STPIESCPDG LMVTGSAGDL SQWPGYQQGH WCVQDRSAQL VAPLLGPQPG DRILDACAAP GGKATHLVEL MGGSGEVWAV DRSAGRLKRL AENAARLGGD CLHALVADAT NLLAVKPSWR GSFQRILVDA PCSGLGTLAR HADARWRVTP LQVEGLVILQ SKLLEGLLPL LSSGGRLVYA TCTIHPAENF DQIKAFLGRH PELSLSQEQQ LWPDPEHGGD GFYSAVLDLS
|
| |