Gene P9211_04221 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9211_04221 
Symbolsun 
ID5730806 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9211 
KingdomBacteria 
Replicon accessionNC_009976 
Strand
Start bp397512 
End bp398810 
Gene Length1299 bp 
Protein Length432 aa 
Translation table11 
GC content42% 
IMG OID641284779 
ProductSun protein (Fmu protein) 
Protein accessionYP_001550307 
Protein GI159902963 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0144] tRNA and rRNA cytosine-C5-methylases 
TIGRFAM ID[TIGR00563] ribosomal RNA small subunit methyltransferase RsmB 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCCAAGAA TGGCTGCTTG GAAGGTTTTG CAGGCTGTCT CTGCTGGTGC TTATGCAGAG 
ACGGCCCTTG ACCAGGTCTT AAACAAATAC TCTATGAAAG CAATTGATAA AGCTCTGACA
ACAGAGATTG CTTACGGTTC AATTCGACAA AGGAAGTATT TAGATTCTTG GATTGATAAC
TTGGCAAAGA TTTCGGCCTT AAAACAACCT CCTAGGCTGA GATGGCTACT GCATATAGGC
CTTTACCAAA TCTTTTTAAT GGAGAGAATA CCTGTTTCTG CAGTAGTGAA TACAACCGTT
CAATTGGCCA AAAACAATAA TTTAAATAAG CTTTCTTCGG TAGTAAATGG AATTCTTCGC
AATGCGATTC GAATTAGAGA GGCTGGGCAA GGCTTACCAT TCAAGTCAAA TGCTTCGGAA
GAGTTAGCAC AGTCTTTCTC GATCCCATTA TGGTTGGCTA ATTCATTGAT CACTTGGCGT
GGCGAACAAG GTGCAAAGAG TATTGCTATG GCTTTTAATC AGCCCCCTGC CTTTGATTTA
AGAATTAATC GATGTAAAAC AAACCCTAGG AGTGTGCAAG AGATTTTTGA TAAATTTGGG
ATTACGAGTC TACCTATTAA AGGATGCACT TCAGGATTGC AAATAACCTC AGGGATGGGT
GACTTACGCA AATGGCCTGG ATATGAAGGA GGTGAATGGT CAGTTCAGGA TAGATCATCT
CAGTGGATTG CTCCATTGCT TGAAGCTGAA CCTGGCGATC GAATTTTAGA TGCATGCTCT
GCTCCAGGAG GCAAGGCAAC TCATCTTGCG GAATTGATTG ACGATAATGG TGAGATATGG
GCAGTTGATC GCTCTCCTAA ACGTCTACAG AAAGTGTCTG AGAACGCGAC GCGTTTAGGC
TTGAATTCTC TTAAATGCTT GGCTGCTGAT GCCTCAATGT TATTAGACTG TAAGCCCCAC
TGGAAGGGCT ATTTTCAAAG AATTTTGGTT GATGCCCCTT GCTCGGGTTT GGGAACATTG
AGTAGAAATC CAGATGCTCG TTGGCGAATG ACTCCCGAAA AAATTGATGA GTTGATTATT
TTACAAGCCC GGTTACTGAG AGGAGTTCTA CCTTTATTGT CTCCTGGAGG GAGAATCGTA
TATTCAACCT GCACTATGCA CCCAGAAGAA AACTTCAAAC AAGTTGGGGA ATTTTTAGCA
TTGCACCCCA AAGTAAAACT TAAGTATCAA AATCAAATTT GGCCAGATGA TGCACAATCA
GGAGATGGTT TCTATGCGGC AGTTATTGAT ATAGATTAA
 
Protein sequence
MPRMAAWKVL QAVSAGAYAE TALDQVLNKY SMKAIDKALT TEIAYGSIRQ RKYLDSWIDN 
LAKISALKQP PRLRWLLHIG LYQIFLMERI PVSAVVNTTV QLAKNNNLNK LSSVVNGILR
NAIRIREAGQ GLPFKSNASE ELAQSFSIPL WLANSLITWR GEQGAKSIAM AFNQPPAFDL
RINRCKTNPR SVQEIFDKFG ITSLPIKGCT SGLQITSGMG DLRKWPGYEG GEWSVQDRSS
QWIAPLLEAE PGDRILDACS APGGKATHLA ELIDDNGEIW AVDRSPKRLQ KVSENATRLG
LNSLKCLAAD ASMLLDCKPH WKGYFQRILV DAPCSGLGTL SRNPDARWRM TPEKIDELII
LQARLLRGVL PLLSPGGRIV YSTCTMHPEE NFKQVGEFLA LHPKVKLKYQ NQIWPDDAQS
GDGFYAAVID ID