Gene A9601_09281 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagA9601_09281 
SymbolispE 
ID4717635 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. AS9601 
KingdomBacteria 
Replicon accessionNC_008816 
Strand
Start bp798259 
End bp799194 
Gene Length936 bp 
Protein Length311 aa 
Translation table11 
GC content28% 
IMG OID640078641 
Product4-diphosphocytidyl-2-C-methyl-D-erythritol kinase 
Protein accessionYP_001009319 
Protein GI123968461 
COG category[I] Lipid transport and metabolism 
COG ID[COG1947] 4-diphosphocytidyl-2C-methyl-D-erythritol 2-phosphate synthase 
TIGRFAM ID[TIGR00154] 4-diphosphocytidyl-2C-methyl-D-erythritol kinase 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAAGATT TTGCTAAAAA GAAAATTAAT ATAAAATCTC CTGCCAAAAT AAATTTGCAC 
CTTGAAGTGA TTGGTAAAAG AGAGGATGGA TTTCACGAGT TAGCAATGAT TATGCAAAAT
ATCGATCTTG CTGATTATTT AGAATTTGAA ATTAATAATG AAGGTTTAAT TAAACTTGAG
TCTGATTGTA ATGATTTAAG CCTATCTGAT GATAACTTAA TTGTTAAATC GGCAAACCTA
TTAAGGAAAA AATCAAATAT AGATTACGGT GCGAATATAT TTTTAAGAAA AAATATCCCA
ATTGGTGCAG GATTAGCTGG TGGATCCAGT AATGCAGCAG CAACATTAAT TGGTCTTAAT
AATTTATGGG ATTTGAAATT AGATCAAGAA ACTTTATGTT CATTAGCATC AACTTTAGGA
TCTGATATTC CCTTTTTTAT AAATGGTGGT ATTCAATTAT GTTTTGGAAG AGGCGAAATT
TTGGAGAAAT TAGATTCAAC CCTTGAATAT GGAGCAATTC TTTTAAAAAA TCCTAATGTA
TCAGTATCCA CAGCTGAAAC TTATAAAAAA TATAGTAATA GATTTTGTGA TCAATATCTT
ACTGATAGAG AAATGATTGA GAACATAAGA AAAAATTTAA GAGATAATGG TTTAAATAAC
TTAAATTTTG ATAATCAACA TTTATCTATT AAAAATGATT TGCAGTTAGT TGTTGAAAAT
GAAAATGATT CTGTAAAGCA GGCATTATAT TTACTTTCTA AATTAGAAAA TTGTCTAACA
TTTTCAATGA GTGGATCAGG ACCTACATGC TTTGCACTCT TTAAAGATAA AGAGACTGCT
AAAAAAGAAT TAACTGCAAA TTCTAAATTA TTTAAAGATA AAGGCTATGA TTCATGGGTT
TGCACTTTCC TTGAAAAGGG AATAACATTC ATATAA
 
Protein sequence
MQDFAKKKIN IKSPAKINLH LEVIGKREDG FHELAMIMQN IDLADYLEFE INNEGLIKLE 
SDCNDLSLSD DNLIVKSANL LRKKSNIDYG ANIFLRKNIP IGAGLAGGSS NAAATLIGLN
NLWDLKLDQE TLCSLASTLG SDIPFFINGG IQLCFGRGEI LEKLDSTLEY GAILLKNPNV
SVSTAETYKK YSNRFCDQYL TDREMIENIR KNLRDNGLNN LNFDNQHLSI KNDLQLVVEN
ENDSVKQALY LLSKLENCLT FSMSGSGPTC FALFKDKETA KKELTANSKL FKDKGYDSWV
CTFLEKGITF I