Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_16181 |
Symbol | ispE |
ID | 4778340 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | + |
Start bp | 1418555 |
End bp | 1419514 |
Gene Length | 960 bp |
Protein Length | 319 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 640087127 |
Product | 4-diphosphocytidyl-2-C-methyl-D-erythritol kinase |
Protein accession | YP_001017627 |
Protein GI | 124023320 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG1947] 4-diphosphocytidyl-2C-methyl-D-erythritol 2-phosphate synthase |
TIGRFAM ID | [TIGR00154] 4-diphosphocytidyl-2C-methyl-D-erythritol kinase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.122507 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCATAT CTAGCCCTTC TGCTGCTTCA GGGCATCTGG TTAGCGTCTC GGCGCCAGCC AAGATCAATC TCCATCTTGA GGTGCTTGGC CTCAGGTCTG ATGGTTTTCA TGAGCTAGCG ATGGTGATGC AAAGCATCGA ACTCGCTGAT CAACTTCATT TCCGCAATAC GGCTGATGGC ACCATCAGCC TGCGCTGCGA TGATTCCAGC CTCAGCACTG CTGGCGATAA TTTGATCGTG CAAGCTGCGC ATTTATTACG TGAGCGCTCA GGGTTCTCTG AACTTGGTGC CGCGATTGAA TTGCAAAAAC GCATCCCAAT TGGAGCTGGT CTTGCGGGCG GCTCAAGTGA TGGTGCAGCA ACACTGGTGG GGTTAAACGG TCTCTGGAAT CTCAATTTTT CTCAGGGTCA ACTTGAAGGT TTTGCGGCTG AGCTTGGCTC CGATATGCCC TTTTGCCTGG CAGGTGGAAG CCAATTGTGT TTCGGTCGTG GGGAAAGGTT GGAATCGCTA CAAGCGATGC AAGCATCAAT GGCCGTGGTG TTGGTGAAGG ATCCATCAGT GAGCGTTTCA ACCCCTTGGG CTTATGGACG CTGTAAGGAA CTTTTCAGGA GTCGTTATCT TTCACAGGAA TCTGATTTTG AGCAACGTCG TCAGCAGCTC AGAGAATCTT CTTGGCTGAA TCCTTTGCGG GCTGATGATC CACCACCTCT GCACAACGAT CTTCAGGCTG TGGTTGCACC CGAAGTATTT GCTGTGCAAA CCACATTGAA GTTGCTCAGT GATTTGCCTG GTTCTCTTGC TGTAGCGATG AGTGGATCTG GTCCAAGCTG TTTTGCCCTT TTTGCTGACG TTGATTCAGC TCAGGCAGCC CTTAAGCGCC AACAGCCTGC CTTCGACGCA GCTGGTTTAA GCAGTTGGTG CTGCGCGTTC CGCTCTGAAG GCATCAAACT GGAAGCATGA
|
Protein sequence | MSISSPSAAS GHLVSVSAPA KINLHLEVLG LRSDGFHELA MVMQSIELAD QLHFRNTADG TISLRCDDSS LSTAGDNLIV QAAHLLRERS GFSELGAAIE LQKRIPIGAG LAGGSSDGAA TLVGLNGLWN LNFSQGQLEG FAAELGSDMP FCLAGGSQLC FGRGERLESL QAMQASMAVV LVKDPSVSVS TPWAYGRCKE LFRSRYLSQE SDFEQRRQQL RESSWLNPLR ADDPPPLHND LQAVVAPEVF AVQTTLKLLS DLPGSLAVAM SGSGPSCFAL FADVDSAQAA LKRQQPAFDA AGLSSWCCAF RSEGIKLEA
|
| |