Gene OSTLU_119513 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_119513 
SymbolHemK 
ID5000443 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009356 
Strand
Start bp466062 
End bp467258 
Gene Length1197 bp 
Protein Length398 aa 
Translation table 
GC content49% 
IMG OID640415864 
Productprotein methyltransferase 
Protein accessionXP_001416155 
Protein GI145342148 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG2890] Methylase of polypeptide chain release factors 
TIGRFAM ID[TIGR00536] HemK family putative methylases
[TIGR03534] protein-(glutamine-N5) methyltransferase, release factor-specific 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.132002 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAGTGCG TTTTACTCGA ACCCTTTCTA GTAGGGCGTA TCGCCGTAAA ACGGGGCCTC 
GCTAAAACAA AGCGAGCCAA ACACATGAAA GGAGCACACG CGAGCGCACT TTGCGACGCA
TCTGATATCC CTATACGTTT AGTCGACCGC CCTGGTAAGT ACCCCGTGAA CAGCTCTCAA
CTGGATGACG TTAGGAAATG GGGTCAAGAC GTTGCGGCAC GAAATATTGC GTATTTCGAA
GCTTCTAATG GAAGTCCGAT GCTAAAAGAG CTATATCAAG AACTCGAATG GCTTATAACG
GATAGTACCG CCGAAAGAGT CGAATGTTCT CCCAGACTGA AAGTAGCAAC CTCTGGCGAC
GATGGTTTCA GCGCATCAAG TACCACCCGA TCCGCGATTT TGCGTCAGTC GATACCCGAG
CTTCAGCAGC TCTGGATGCG AAGAATAATA GATAGGGTGC CTTTGCAGTA TCTCACAAAC
ACGGCGCATT GGCGAGATAT GGAATTCACT GTGAATACTT CTGTCCTCAT ACCACGACCT
GAGACTGAAT TACTGATAGA TTTTGCGTGC GAATGGCTTC GTGAACTGGA GTCAAATACT
GAAAATCATA CCATGAATTA TAATCTATTG TCAGGGCCAT GGCTTGATCT TGGAACCGGA
TCTGGAATTT TGGCCATCGC ACTTGCGAAA GAACTACAGC GTAAGTGCGC AGATGCCTCC
AGTGTGTACG CAGTAGACGT TTCTGTAGCC GCACTCGAAC TCGCAAGGGA CAATGCGCGT
CGTAACGGAG TCCAAGATTC CATCAAAACC TTGCACGGAT CATGGTTTAA CCCGATCAAA
AAAGATGTAC GCTTTACCGG AATCTTGACC AACCCTCCGT ACATCCCGAC AGATTTGCTT
GAGTCTCTTC AGCCGGAGGT TTGTTCGCAT GAGCCATGGC TCGCACTCGA TGGGGGGGGC
GGGGACGGTT CAGCACACTT AGTCACTATC TGTAGGGACG TCAAGAACTT TTTACTCCCC
GGTGGTCTGT TTGCAGTTGA AACCCACGGT CTAGAACAGG CTCGTTTGGT TCAACATTTA
TTGAACAGCA CCGAGGCTTT TCGAGACGTC CACCTAAAAG CAGACTACTC TGGTATAGTT
CGTTATGTAA CGGCTAGAAA AGTGAACGAT CCTGGAGTCA TGGATGGCGC ATGCTGA
 
Protein sequence
MQCVLLEPFL VGRIAVKRGL AKTKRAKHMK GAHASALCDA SDIPIRLVDR PGKYPVNSSQ 
LDDVRKWGQD VAARNIAYFE ASNGSPMLKE LYQELEWLIT DSTAERVECS PRLKVATSGD
DGFSASSTTR SAILRQSIPE LQQLWMRRII DRVPLQYLTN TAHWRDMEFT VNTSVLIPRP
ETELLIDFAC EWLRELESNT ENHTMNYNLL SGPWLDLGTG SGILAIALAK ELQRKCADAS
SVYAVDVSVA ALELARDNAR RNGVQDSIKT LHGSWFNPIK KDVRFTGILT NPPYIPTDLL
ESLQPEVCSH EPWLALDGGG GDGSAHLVTI CRDVKNFLLP GGLFAVETHG LEQARLVQHL
LNSTEAFRDV HLKADYSGIV RYVTARKVND PGVMDGAC