Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | NATL1_09481 |
Symbol | ispE |
ID | 4781268 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. NATL1A |
Kingdom | Bacteria |
Replicon accession | NC_008819 |
Strand | + |
Start bp | 874226 |
End bp | 875185 |
Gene Length | 960 bp |
Protein Length | 319 aa |
Translation table | 11 |
GC content | 33% |
IMG OID | 640084225 |
Product | 4-diphosphocytidyl-2-C-methyl-D-erythritol kinase |
Protein accession | YP_001014771 |
Protein GI | 124025655 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG1947] 4-diphosphocytidyl-2C-methyl-D-erythritol 2-phosphate synthase |
TIGRFAM ID | [TIGR00154] 4-diphosphocytidyl-2C-methyl-D-erythritol kinase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.782765 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.0943702 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAACCTT CCAAAGCAAA TGAAGATTTT CTTATCGCAA AAGCACATGC AAAAATTAAT CTACATTTAG AGGTTTTAGG TATTAGGAGC GATGGCTTTC ATGAATTAGC AATGGTCATG CAAAGTATTA ATTTAAGTGA TCAGTTGAAG ATGATAAAAA GAGTAGATAA TACTATTAAT CTAAAATCTA ATAATAAAGA AATTAGTAAT GGTGACGATA ATCTAATAAT AAAAGCTTCA AAGCTGTTGA GAAATAAAGT AGAAAATCAA GAATTAGGTG TTGATATTGA ACTTGAGAAA AACATTCCTA TTGGAGCAGG ATTGGCAGGG GGATCTACAG ATGCAGCTGC AACCTTACTT GGATTAAATA AACTCTGGAA GCTAAATCTT AAGACTGATG AATTAGAGAA CCTATCAAAA GAAATAGGAT CAGATATCCC TTTTTGCATA TCAGGAGGGA GGCAAATATG TTTTGGTAGA GGTGAAATTT TAGAAAAATT GAAATTTGAT CAAATTCAGT TAGGTCTTAT TTTGGTTAAA GACCCTTCAA TACAAGTATC TACTCCAGTT GCATACAAAA AATATAAAGA TCAGTTTGGT GAAAGCTATC TTGAAGATGA TAGGGATTTT GAAATCAAAA GAAACTCTAT TAGATCTATT GACTGGTCTG ATCAGTCGCT TTTTGATAAT CGTAAAGAAA TACAAAATGA TTTACAAAAA AGCGTTCGGC CTATAACACC AGAGGTTGAG AAGTCATTGG ATTTATTGTC TAGTTTGCCA GATTCACGTC TTGTTTCAAT GAGTGGTTCT GGTCCAAGTT GTTTTGCCTT GTTTCAAAAT TATGACCAAG CAAATAAAGT ACTCAAAGAA CATGTTAATG AATTTGAAAG GGCTGGTTTA TCAGCTTGGG CATGTTCAAT GATGTCTAAT GGAGTTGAAT TAAGAAATGA ATTCATCTAG
|
Protein sequence | MEPSKANEDF LIAKAHAKIN LHLEVLGIRS DGFHELAMVM QSINLSDQLK MIKRVDNTIN LKSNNKEISN GDDNLIIKAS KLLRNKVENQ ELGVDIELEK NIPIGAGLAG GSTDAAATLL GLNKLWKLNL KTDELENLSK EIGSDIPFCI SGGRQICFGR GEILEKLKFD QIQLGLILVK DPSIQVSTPV AYKKYKDQFG ESYLEDDRDF EIKRNSIRSI DWSDQSLFDN RKEIQNDLQK SVRPITPEVE KSLDLLSSLP DSRLVSMSGS GPSCFALFQN YDQANKVLKE HVNEFERAGL SAWACSMMSN GVELRNEFI
|
| |