Gene OSTLU_24843 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_24843 
SymbolCMS 
ID5002890 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009361 
Strand
Start bp586545 
End bp587416 
Gene Length872 bp 
Protein Length275 aa 
Translation table 
GC content64% 
IMG OID640418311 
Product2-C-methyl-D-erythritol 4-phosphate cytidylyltransferase (4-diphosphocytidyl-2C-methyl-D-erythritol synthase) (MEP cytidylyltransferase) (MCT) (ISPD) 
Protein accessionXP_001418750 
Protein GI145348632 
COG category[I] Lipid transport and metabolism 
COG ID[COG1211] 4-diphosphocytidyl-2-methyl-D-erithritol synthase 
TIGRFAM ID[TIGR00453] 2-C-methyl-D-erythritol 4-phosphate cytidylyltransferase 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value0.56499 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0645065 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
CTCGCCGAGC GCGCGCGCGC ACGCCTCGCG ACCATGCTCT CGCGAAGAGC GACCGTGATC 
GCGCCCGCGC GCGCGGCGCC CGAGAGCGCG CGACGCGTCG CGCGGACGCG CGCGCGGCGC
GGCGCGAACG TCCGAACGCG CGCGGCGAGC GAGGTCACGC AGGACGTCGC CGACGGCGCC
GTGTCCTTCG TCCTGCTCGC GGGCGGCGTG GGCAAGCGCA TGGGAGCGGA CATGCCGAAG
CAGTACCTGC CGCTCATGGG CACGCCGATC GCGCTGTGGT CGCTTCGGAA GTTTGCGAAG
ATGGCTGAGG TCGGGGAGAT CGTCGTCGTG TGCGACCCGA GCTACGACGA CGTGTTTCAG
AGCGAAGCGA TCGATAAGCC GCTGGTGTTC GCGAGACCGG GGAAAGAGCG ACAAGATAGT
GTGTATAATG GCATGCAAGC GGCGCGGGCG GGGGCGGAGT TGTTGGCGAT TCACGATAGC
GCGCGACCGC TGTGCGCGGC GACGGATGCG AGGCGGTGCT TCAACGACGC GAAAAAGTAC
GGTGCGGCGG TTTTGGCGGT GCAGAGTAAG GCGACGATTA AGGAAGTGAA TAAGGATTTG
AGCATCGATA AGGGGCTCGA TCGCAGTCGG CTTTGGGAGA TGCAAACGCC GCAAGTGATG
CGACCCGAGT TGTTGCGAGC GGGATACGAT CTCGTGAATA GTAAGGGACT TGAGGTGACG
GACGATGTAT CCATCGTCGA AGCCTTAGGT GAGCGCGTGC AAGTGACGCC GGGGAGTTAT
TTCAACTTGA AGGTCACGAC GCCGGAGGAC ATGTTCATCG CGGAACGGCT GATGACGGAG
CAGGGCGACG CCGTCGCATA AATAATTCTA AT
 
Protein sequence
MLSRRATVIA PARAAPESAR RVARTRARRG ANVRTRAASE VTQDVADGAV SFVLLAGGVG 
KRMGADMPKQ YLPLMGTPIA LWSLRKFAKM AEVGEIVVVC DPSYDDVFQS EAIDKPLVFA
RPGKERQDSV YNGMQAARAG AELLAIHDSA RPLCAATDAR RCFNDAKKYG AAVLAVQSKA
TIKEVNKDLS IDKGLDRSRL WEMQTPQVMR PELLRAGYDL VNSKGLEVTD DVSIVEALGE
RVQVTPGSYF NLKVTTPEDM FIAERLMTEQ GDAVA