Gene NATL1_09481 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_09481 
SymbolispE 
ID4781268 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp874226 
End bp875185 
Gene Length960 bp 
Protein Length319 aa 
Translation table11 
GC content33% 
IMG OID640084225 
Product4-diphosphocytidyl-2-C-methyl-D-erythritol kinase 
Protein accessionYP_001014771 
Protein GI124025655 
COG category[I] Lipid transport and metabolism 
COG ID[COG1947] 4-diphosphocytidyl-2C-methyl-D-erythritol 2-phosphate synthase 
TIGRFAM ID[TIGR00154] 4-diphosphocytidyl-2C-methyl-D-erythritol kinase 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.782765 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.0943702 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAACCTT CCAAAGCAAA TGAAGATTTT CTTATCGCAA AAGCACATGC AAAAATTAAT 
CTACATTTAG AGGTTTTAGG TATTAGGAGC GATGGCTTTC ATGAATTAGC AATGGTCATG
CAAAGTATTA ATTTAAGTGA TCAGTTGAAG ATGATAAAAA GAGTAGATAA TACTATTAAT
CTAAAATCTA ATAATAAAGA AATTAGTAAT GGTGACGATA ATCTAATAAT AAAAGCTTCA
AAGCTGTTGA GAAATAAAGT AGAAAATCAA GAATTAGGTG TTGATATTGA ACTTGAGAAA
AACATTCCTA TTGGAGCAGG ATTGGCAGGG GGATCTACAG ATGCAGCTGC AACCTTACTT
GGATTAAATA AACTCTGGAA GCTAAATCTT AAGACTGATG AATTAGAGAA CCTATCAAAA
GAAATAGGAT CAGATATCCC TTTTTGCATA TCAGGAGGGA GGCAAATATG TTTTGGTAGA
GGTGAAATTT TAGAAAAATT GAAATTTGAT CAAATTCAGT TAGGTCTTAT TTTGGTTAAA
GACCCTTCAA TACAAGTATC TACTCCAGTT GCATACAAAA AATATAAAGA TCAGTTTGGT
GAAAGCTATC TTGAAGATGA TAGGGATTTT GAAATCAAAA GAAACTCTAT TAGATCTATT
GACTGGTCTG ATCAGTCGCT TTTTGATAAT CGTAAAGAAA TACAAAATGA TTTACAAAAA
AGCGTTCGGC CTATAACACC AGAGGTTGAG AAGTCATTGG ATTTATTGTC TAGTTTGCCA
GATTCACGTC TTGTTTCAAT GAGTGGTTCT GGTCCAAGTT GTTTTGCCTT GTTTCAAAAT
TATGACCAAG CAAATAAAGT ACTCAAAGAA CATGTTAATG AATTTGAAAG GGCTGGTTTA
TCAGCTTGGG CATGTTCAAT GATGTCTAAT GGAGTTGAAT TAAGAAATGA ATTCATCTAG
 
Protein sequence
MEPSKANEDF LIAKAHAKIN LHLEVLGIRS DGFHELAMVM QSINLSDQLK MIKRVDNTIN 
LKSNNKEISN GDDNLIIKAS KLLRNKVENQ ELGVDIELEK NIPIGAGLAG GSTDAAATLL
GLNKLWKLNL KTDELENLSK EIGSDIPFCI SGGRQICFGR GEILEKLKFD QIQLGLILVK
DPSIQVSTPV AYKKYKDQFG ESYLEDDRDF EIKRNSIRSI DWSDQSLFDN RKEIQNDLQK
SVRPITPEVE KSLDLLSSLP DSRLVSMSGS GPSCFALFQN YDQANKVLKE HVNEFERAGL
SAWACSMMSN GVELRNEFI