Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_119513 |
Symbol | HemK |
ID | 5000443 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009356 |
Strand | + |
Start bp | 466062 |
End bp | 467258 |
Gene Length | 1197 bp |
Protein Length | 398 aa |
Translation table | |
GC content | 49% |
IMG OID | 640415864 |
Product | protein methyltransferase |
Protein accession | XP_001416155 |
Protein GI | 145342148 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG2890] Methylase of polypeptide chain release factors |
TIGRFAM ID | [TIGR00536] HemK family putative methylases [TIGR03534] protein-(glutamine-N5) methyltransferase, release factor-specific |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.132002 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAGTGCG TTTTACTCGA ACCCTTTCTA GTAGGGCGTA TCGCCGTAAA ACGGGGCCTC GCTAAAACAA AGCGAGCCAA ACACATGAAA GGAGCACACG CGAGCGCACT TTGCGACGCA TCTGATATCC CTATACGTTT AGTCGACCGC CCTGGTAAGT ACCCCGTGAA CAGCTCTCAA CTGGATGACG TTAGGAAATG GGGTCAAGAC GTTGCGGCAC GAAATATTGC GTATTTCGAA GCTTCTAATG GAAGTCCGAT GCTAAAAGAG CTATATCAAG AACTCGAATG GCTTATAACG GATAGTACCG CCGAAAGAGT CGAATGTTCT CCCAGACTGA AAGTAGCAAC CTCTGGCGAC GATGGTTTCA GCGCATCAAG TACCACCCGA TCCGCGATTT TGCGTCAGTC GATACCCGAG CTTCAGCAGC TCTGGATGCG AAGAATAATA GATAGGGTGC CTTTGCAGTA TCTCACAAAC ACGGCGCATT GGCGAGATAT GGAATTCACT GTGAATACTT CTGTCCTCAT ACCACGACCT GAGACTGAAT TACTGATAGA TTTTGCGTGC GAATGGCTTC GTGAACTGGA GTCAAATACT GAAAATCATA CCATGAATTA TAATCTATTG TCAGGGCCAT GGCTTGATCT TGGAACCGGA TCTGGAATTT TGGCCATCGC ACTTGCGAAA GAACTACAGC GTAAGTGCGC AGATGCCTCC AGTGTGTACG CAGTAGACGT TTCTGTAGCC GCACTCGAAC TCGCAAGGGA CAATGCGCGT CGTAACGGAG TCCAAGATTC CATCAAAACC TTGCACGGAT CATGGTTTAA CCCGATCAAA AAAGATGTAC GCTTTACCGG AATCTTGACC AACCCTCCGT ACATCCCGAC AGATTTGCTT GAGTCTCTTC AGCCGGAGGT TTGTTCGCAT GAGCCATGGC TCGCACTCGA TGGGGGGGGC GGGGACGGTT CAGCACACTT AGTCACTATC TGTAGGGACG TCAAGAACTT TTTACTCCCC GGTGGTCTGT TTGCAGTTGA AACCCACGGT CTAGAACAGG CTCGTTTGGT TCAACATTTA TTGAACAGCA CCGAGGCTTT TCGAGACGTC CACCTAAAAG CAGACTACTC TGGTATAGTT CGTTATGTAA CGGCTAGAAA AGTGAACGAT CCTGGAGTCA TGGATGGCGC ATGCTGA
|
Protein sequence | MQCVLLEPFL VGRIAVKRGL AKTKRAKHMK GAHASALCDA SDIPIRLVDR PGKYPVNSSQ LDDVRKWGQD VAARNIAYFE ASNGSPMLKE LYQELEWLIT DSTAERVECS PRLKVATSGD DGFSASSTTR SAILRQSIPE LQQLWMRRII DRVPLQYLTN TAHWRDMEFT VNTSVLIPRP ETELLIDFAC EWLRELESNT ENHTMNYNLL SGPWLDLGTG SGILAIALAK ELQRKCADAS SVYAVDVSVA ALELARDNAR RNGVQDSIKT LHGSWFNPIK KDVRFTGILT NPPYIPTDLL ESLQPEVCSH EPWLALDGGG GDGSAHLVTI CRDVKNFLLP GGLFAVETHG LEQARLVQHL LNSTEAFRDV HLKADYSGIV RYVTARKVND PGVMDGAC
|
| |