Gene P9303_16181 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_16181 
SymbolispE 
ID4778340 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp1418555 
End bp1419514 
Gene Length960 bp 
Protein Length319 aa 
Translation table11 
GC content52% 
IMG OID640087127 
Product4-diphosphocytidyl-2-C-methyl-D-erythritol kinase 
Protein accessionYP_001017627 
Protein GI124023320 
COG category[I] Lipid transport and metabolism 
COG ID[COG1947] 4-diphosphocytidyl-2C-methyl-D-erythritol 2-phosphate synthase 
TIGRFAM ID[TIGR00154] 4-diphosphocytidyl-2C-methyl-D-erythritol kinase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.122507 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCATAT CTAGCCCTTC TGCTGCTTCA GGGCATCTGG TTAGCGTCTC GGCGCCAGCC 
AAGATCAATC TCCATCTTGA GGTGCTTGGC CTCAGGTCTG ATGGTTTTCA TGAGCTAGCG
ATGGTGATGC AAAGCATCGA ACTCGCTGAT CAACTTCATT TCCGCAATAC GGCTGATGGC
ACCATCAGCC TGCGCTGCGA TGATTCCAGC CTCAGCACTG CTGGCGATAA TTTGATCGTG
CAAGCTGCGC ATTTATTACG TGAGCGCTCA GGGTTCTCTG AACTTGGTGC CGCGATTGAA
TTGCAAAAAC GCATCCCAAT TGGAGCTGGT CTTGCGGGCG GCTCAAGTGA TGGTGCAGCA
ACACTGGTGG GGTTAAACGG TCTCTGGAAT CTCAATTTTT CTCAGGGTCA ACTTGAAGGT
TTTGCGGCTG AGCTTGGCTC CGATATGCCC TTTTGCCTGG CAGGTGGAAG CCAATTGTGT
TTCGGTCGTG GGGAAAGGTT GGAATCGCTA CAAGCGATGC AAGCATCAAT GGCCGTGGTG
TTGGTGAAGG ATCCATCAGT GAGCGTTTCA ACCCCTTGGG CTTATGGACG CTGTAAGGAA
CTTTTCAGGA GTCGTTATCT TTCACAGGAA TCTGATTTTG AGCAACGTCG TCAGCAGCTC
AGAGAATCTT CTTGGCTGAA TCCTTTGCGG GCTGATGATC CACCACCTCT GCACAACGAT
CTTCAGGCTG TGGTTGCACC CGAAGTATTT GCTGTGCAAA CCACATTGAA GTTGCTCAGT
GATTTGCCTG GTTCTCTTGC TGTAGCGATG AGTGGATCTG GTCCAAGCTG TTTTGCCCTT
TTTGCTGACG TTGATTCAGC TCAGGCAGCC CTTAAGCGCC AACAGCCTGC CTTCGACGCA
GCTGGTTTAA GCAGTTGGTG CTGCGCGTTC CGCTCTGAAG GCATCAAACT GGAAGCATGA
 
Protein sequence
MSISSPSAAS GHLVSVSAPA KINLHLEVLG LRSDGFHELA MVMQSIELAD QLHFRNTADG 
TISLRCDDSS LSTAGDNLIV QAAHLLRERS GFSELGAAIE LQKRIPIGAG LAGGSSDGAA
TLVGLNGLWN LNFSQGQLEG FAAELGSDMP FCLAGGSQLC FGRGERLESL QAMQASMAVV
LVKDPSVSVS TPWAYGRCKE LFRSRYLSQE SDFEQRRQQL RESSWLNPLR ADDPPPLHND
LQAVVAPEVF AVQTTLKLLS DLPGSLAVAM SGSGPSCFAL FADVDSAQAA LKRQQPAFDA
AGLSSWCCAF RSEGIKLEA