Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_51700 |
Symbol | CMK |
ID | 7198117 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011672 |
Strand | + |
Start bp | 1098916 |
End bp | 1100067 |
Gene Length | 1152 bp |
Protein Length | 355 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | 4-diphosphocytidyl-2c-methyl-d-erythritol kinase |
Protein accession | XP_002178363 |
Protein GI | 219115135 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTATTTCG CCTCAGCACA CGTATTATTC TTAGCGGCAC TGTCTCGAGA GGCTGTAGCC TTCACTGACA ATCGCTTTCC GATCTTCAAC CATCCGTCTG CCAAGGTCAC CACGACCAGC AGTCTCAGCG CTTATACAGA AACAAATGAA TCGTCGAGGG GAAAAATCTC TGAAAGAACA TTGACGCTCT TCAGTCCGTG CAAGATCAAT CTCTTCTTGA GGATAATTCG AAAACGAGAA GACGGTTTCC ATGATCTTGC CAGTCTATTT CAAGCGATTG GTTTTGGCGA CACTCTGGAG CTGACATCAA TCGACTCCGA TGCAGACGAA TTTACATGCA ACATGCCGGG TGTTCCCGTA GACAACTCAA ATCTAGTGCT ACGAGCTATC CAACTAATGC GAGAAAAAAC GGGTGAAAAG CAATCCTTTC GAGCCAATCT GATCAAACAA GTACCCGCCC AAGCGGGACT TGGTGGGGGA TCGGCAAACG CTGCCACGGC GATGTGGGGC GTCAACGAAC TGATGGGGCG TCCTGCATCA TTGGACCAAG TAAGAATACA ATGGACTTTA CAGTAAAAGA GATCATTGCC AACACAGTCG CTGACATCTT TCTTTTCCAC GCTCATACAT CAGATGGTTG AATGGTCAGG GGCTCTTGGA AGTGACATTA CGTTCTTTCT GAGTAGAGGA ACCGCATACT GTACTGGTCG CGGCGAGATC ATGTCCTCCA TTGATCCGCC TCTTCCTTCA GGAACGAAGA TATGCATAGT AAAAATCGAT ATTGGCTTGT CGACGCCATC TGTTTTCAAA GCACTGGACT ACGATAAATT GAGTAATCTG AATGCTGACG ACGTTTTATT GCCTACTTTT CTGCGAGGAA TTGACCAGGT ACCTGACTCT TACTTTGTAA ACGACTTGGA GACACCGGCT TTTAAATGTA TTCCGGAGCT GCGTAGTCTC AAAGAAGAAT TATTGGGAGT GGCCGGATTC GACCACGTGA TGATGTCGGG GAGCGGTACG AGCATCTTTT GCCTGGGCGA GCCGAAGGAT AACAAGGACT TTCACGACAG ATTCGTAAGT CGGAAAGGGC TACAGGTATT CTTCTCGGAG TTCATTAGTA GGCCGGAAGG GGTCTGGTTT GAGAAACCCT AG
|
Protein sequence | MYFASAHVLF LAALSREAVA FTDNRFPIFN HPSAKVTTTS SLSAYTETNE SSRGKISERT LTLFSPCKIN LFLRIIRKRE DGFHDLASLF QAIGFGDTLE LTSIDSDADE FTCNMPGVPV DNSNLVLRAI QLMREKTGEK QSFRANLIKQ VPAQAGLGGG SANAATAMWG VNELMGRPAS LDQMVEWSGA LGSDITFFLS RGTAYCTGRG EIMSSIDPPL PSGTKICIVK IDIGLSTPSV FKALDYDKLS NLNADDVLLP TFLRGIDQVP DSYFVNDLET PAFKCIPELR SLKEELLGVA GFDHVMMSGS GTSIFCLGEP KDNKDFHDRF VSRKGLQVFF SEFISRPEGV WFEKP
|
| |