Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9211_12801 |
Symbol | |
ID | 5731695 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9211 |
Kingdom | Bacteria |
Replicon accession | NC_009976 |
Strand | + |
Start bp | 1154922 |
End bp | 1155842 |
Gene Length | 921 bp |
Protein Length | 306 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 641285649 |
Product | putative tetrapyrrole methylase family protein |
Protein accession | YP_001551165 |
Protein GI | 159903821 |
COG category | [R] General function prediction only |
COG ID | [COG0313] Predicted methyltransferases |
TIGRFAM ID | [TIGR00096] probable S-adenosylmethionine-dependent methyltransferase, YraL family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.105851 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.0426516 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATTTTT TAGAAAACGA AGCTTTAGGA AAATCGAGCG AACCAACCTC AGGGAATCTA TACATTATTG GAACACCTAT AGGAAATCTT GGAGATTTAT CTCCAAGAGC AAAATCTATA CTTCAAAAAG TCTCACTTAT AGCTTGTGAA GACACGCGTC ATAGTGGGCA GCTTCTAAAA AAGTTAGGAA TTAAAAATAA TCTTATAAGT TTTCACAAGC ACAACACTCA AAGCAGACTC CCGAAGCTAT TGAAATGCCT AAAAGAGGGA CAAAATATTG GATTGATTAG TGATGCTGGA CTTCCGGGCA TTAGCGACCC TGGCGAGGAG CTTGTTAAAG CGGCTAAAGA AGCCGGTTAT TCAGCGATTT GCATACCGGG TCCCTGTGCA ATAACTACAG CGTTAGTCAG TAGTGGCTTA CCTTCGCAAA AGTTTTGCTT CGAGGGATTT CTTCCATCAA AGACAAAAGA TCGCAACAAA GCTCTTTCTT CTATTGCAAA TGAAGAAAGA ACAACTGTTA TTTATGAATC TCCTAAAAAG TTAATAAAGC TATTAGAACA ACTATATGAA TTATGCGGAG AAGACAGGCC GGTTCAAGTC GCCCGAGAAT TGACCAAAAA ATATGAGGAG CATATTGGTC CAACTCTTGG AGAAGTACTA AAACATTTCA AAGAAAATAA GCCTAAAGGT GAATGCACAA TTGTCTTGGG AGGCACTGAA AAGTATAAAA AGAAAATAGC CAATCAAAGC CAAACTGAGT TGCTTAAAAA GATGGAAGCT ATAATCAAAA CAGGTGCAAG CGCAAATTTT GCTGCTAAAC AAATCTCAAA TGAAACTAAA TTATCAAAAA GGTTTCTTTA TGAATTACTT CACAATAAAT CCAATCTTGA TAGTCAAATA GACACTAAAG AGGCAAAATG A
|
Protein sequence | MDFLENEALG KSSEPTSGNL YIIGTPIGNL GDLSPRAKSI LQKVSLIACE DTRHSGQLLK KLGIKNNLIS FHKHNTQSRL PKLLKCLKEG QNIGLISDAG LPGISDPGEE LVKAAKEAGY SAICIPGPCA ITTALVSSGL PSQKFCFEGF LPSKTKDRNK ALSSIANEER TTVIYESPKK LIKLLEQLYE LCGEDRPVQV ARELTKKYEE HIGPTLGEVL KHFKENKPKG ECTIVLGGTE KYKKKIANQS QTELLKKMEA IIKTGASANF AAKQISNETK LSKRFLYELL HNKSNLDSQI DTKEAK
|
| |