Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9211_03531 |
Symbol | hemK |
ID | 5731474 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9211 |
Kingdom | Bacteria |
Replicon accession | NC_009976 |
Strand | + |
Start bp | 332481 |
End bp | 333362 |
Gene Length | 882 bp |
Protein Length | 293 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 641284701 |
Product | putative protein methyltransferase |
Protein accession | YP_001550238 |
Protein GI | 159902894 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG2890] Methylase of polypeptide chain release factors |
TIGRFAM ID | [TIGR00536] HemK family putative methylases [TIGR03534] protein-(glutamine-N5) methyltransferase, release factor-specific |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.854566 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATATGA AGTCAAAAGA AAAAAAATCA GCAAGGGATA TCTTGAACTG GAGAATGGCT CAACTTGCCC TTGGAGGGAG AGTTGTAGAT ATTGATTGGT TGCTAGATGT GGGGGGAGGC CTGGGATGGG AGTCTCTTCA AAGATTAAAA ATTTTTCAAA ATAATCACTA TGAACTCCAA AAATCTTTAG ATGAGCTTTC ATTTATATGG CATAGACATA TAAATGAGAA TGAGCCACTT CAATATCTTG TAGGGAAATG CCCTTGGAGA GATTTTCAGT TAGAAATCAA TTCTTCCGTA TTTATTCCGC GACAGGAGAC AGAGATCCTT GTTGAATTAG CCTTAAAGAA ATGCAATGGA ATAAGTGTTG GCCGATGGGC TGACTTAGGA ACTGGTTCAG GCGTATTGGC CGTAGCTTTG GCTAGATCTT TACCTGGTTG GATTGGAGAT GCAGTGGATT GTAGTAAGGA TGCATTGTCT TTGGCAAAAA AAAATTTAGC TAATTTAGCC AATAATTCAC ATGTCCATTT TCACTTGGGG CATTGGTGGC AGCCTCTAAA ATCTTGGTGG GGAACATATG ATTTGGTATT AGCGAACCCT CCATATATTC CAAGCGCAGT GTTAAGTGAA TTACATCCAA TTGTGAGAGA TAATGAGCCA CATTTGGCAC TTTCTGGAGG CCTGGATGGA ATGAATTGTT GTCGTGAGAT TATTCGAGGA GCAAAGAAAG GACTTGGAAC AGGAGGGTGG TTAATTTTCG AACATCATTA TGACCAAAGT GAACGGCTGT TGAATGAGTT GATTGCTAAT GGTTTTAAGG AAGTTAATTT TGAGAATGAT CTTGAAGGGG TTAGACGTTT TGCTATAGGA CGTAAGTCTT GA
|
Protein sequence | MNMKSKEKKS ARDILNWRMA QLALGGRVVD IDWLLDVGGG LGWESLQRLK IFQNNHYELQ KSLDELSFIW HRHINENEPL QYLVGKCPWR DFQLEINSSV FIPRQETEIL VELALKKCNG ISVGRWADLG TGSGVLAVAL ARSLPGWIGD AVDCSKDALS LAKKNLANLA NNSHVHFHLG HWWQPLKSWW GTYDLVLANP PYIPSAVLSE LHPIVRDNEP HLALSGGLDG MNCCREIIRG AKKGLGTGGW LIFEHHYDQS ERLLNELIAN GFKEVNFEND LEGVRRFAIG RKS
|
| |