Gene PHATRDRAFT_51700 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_51700 
SymbolCMK 
ID7198117 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011672 
Strand
Start bp1098916 
End bp1100067 
Gene Length1152 bp 
Protein Length355 aa 
Translation table 
GC content48% 
IMG OID 
Product4-diphosphocytidyl-2c-methyl-d-erythritol kinase 
Protein accessionXP_002178363 
Protein GI219115135 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTATTTCG CCTCAGCACA CGTATTATTC TTAGCGGCAC TGTCTCGAGA GGCTGTAGCC 
TTCACTGACA ATCGCTTTCC GATCTTCAAC CATCCGTCTG CCAAGGTCAC CACGACCAGC
AGTCTCAGCG CTTATACAGA AACAAATGAA TCGTCGAGGG GAAAAATCTC TGAAAGAACA
TTGACGCTCT TCAGTCCGTG CAAGATCAAT CTCTTCTTGA GGATAATTCG AAAACGAGAA
GACGGTTTCC ATGATCTTGC CAGTCTATTT CAAGCGATTG GTTTTGGCGA CACTCTGGAG
CTGACATCAA TCGACTCCGA TGCAGACGAA TTTACATGCA ACATGCCGGG TGTTCCCGTA
GACAACTCAA ATCTAGTGCT ACGAGCTATC CAACTAATGC GAGAAAAAAC GGGTGAAAAG
CAATCCTTTC GAGCCAATCT GATCAAACAA GTACCCGCCC AAGCGGGACT TGGTGGGGGA
TCGGCAAACG CTGCCACGGC GATGTGGGGC GTCAACGAAC TGATGGGGCG TCCTGCATCA
TTGGACCAAG TAAGAATACA ATGGACTTTA CAGTAAAAGA GATCATTGCC AACACAGTCG
CTGACATCTT TCTTTTCCAC GCTCATACAT CAGATGGTTG AATGGTCAGG GGCTCTTGGA
AGTGACATTA CGTTCTTTCT GAGTAGAGGA ACCGCATACT GTACTGGTCG CGGCGAGATC
ATGTCCTCCA TTGATCCGCC TCTTCCTTCA GGAACGAAGA TATGCATAGT AAAAATCGAT
ATTGGCTTGT CGACGCCATC TGTTTTCAAA GCACTGGACT ACGATAAATT GAGTAATCTG
AATGCTGACG ACGTTTTATT GCCTACTTTT CTGCGAGGAA TTGACCAGGT ACCTGACTCT
TACTTTGTAA ACGACTTGGA GACACCGGCT TTTAAATGTA TTCCGGAGCT GCGTAGTCTC
AAAGAAGAAT TATTGGGAGT GGCCGGATTC GACCACGTGA TGATGTCGGG GAGCGGTACG
AGCATCTTTT GCCTGGGCGA GCCGAAGGAT AACAAGGACT TTCACGACAG ATTCGTAAGT
CGGAAAGGGC TACAGGTATT CTTCTCGGAG TTCATTAGTA GGCCGGAAGG GGTCTGGTTT
GAGAAACCCT AG
 
Protein sequence
MYFASAHVLF LAALSREAVA FTDNRFPIFN HPSAKVTTTS SLSAYTETNE SSRGKISERT 
LTLFSPCKIN LFLRIIRKRE DGFHDLASLF QAIGFGDTLE LTSIDSDADE FTCNMPGVPV
DNSNLVLRAI QLMREKTGEK QSFRANLIKQ VPAQAGLGGG SANAATAMWG VNELMGRPAS
LDQMVEWSGA LGSDITFFLS RGTAYCTGRG EIMSSIDPPL PSGTKICIVK IDIGLSTPSV
FKALDYDKLS NLNADDVLLP TFLRGIDQVP DSYFVNDLET PAFKCIPELR SLKEELLGVA
GFDHVMMSGS GTSIFCLGEP KDNKDFHDRF VSRKGLQVFF SEFISRPEGV WFEKP