Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hhal_0990 |
Symbol | ipk |
ID | 4709377 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhodospira halophila SL1 |
Kingdom | Bacteria |
Replicon accession | NC_008789 |
Strand | - |
Start bp | 1062255 |
End bp | 1063121 |
Gene Length | 867 bp |
Protein Length | 288 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 639855461 |
Product | 4-diphosphocytidyl-2-C-methyl-D-erythritol kinase |
Protein accession | YP_001002568 |
Protein GI | 121997781 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG1947] 4-diphosphocytidyl-2C-methyl-D-erythritol 2-phosphate synthase |
TIGRFAM ID | [TIGR00154] 4-diphosphocytidyl-2C-methyl-D-erythritol kinase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.276134 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACTGAAT GGAGCGTCTG GCCGGCCCCG GCCAAGATCA ACCTCTTTCT CCACGTCCTC GGCCGGCGTG CCGACGGCTA CCACGAGCTG CAGACCGCCT TCCAGCTGCT CGGCTACGGG GATCGCATCT GGCTGCGCCC GCGTGTCGAC GGACGGGTGG AGCGCTGCAG CCATCTGCCC GGCGTGGCCG CGGAGGACGA CCTCACCGTT CGCGCAGCGC GGGCGCTTCA GGCGGCCACC GGCAGCGCGG GCGGGGTCGA TATCCACGTG GACAAGCGCC TGCCGGCCGG CGGGGGAGTC GGTGGCGGTT CCTCGGACGC CGCCACCGTG CTGGTCGCCC TGAACCATCT GTGGCAGACG GGGCTGGATG AGGAGGCGCT GGCCGGCATC GGTCTGGCCC TGGGGGCCGA CGTCCCGGTG TTCGTGCGCG GGCGCAGCGC CTGGGGCGAG GGGGTGGGCG AGCAGCTGCG CCCCATGCCG GTGGACCCGG CGGGTAGCCG CTGGTATCTG GTGGTCGACC CGGGGGCGAG TATCTCCACC GCCGAAGTCT TTGGCGCGCC GGAATTGACA CGGGATACGT CTCCGATCAC AATACCCGAC CTTCGCGCCG GCGCCGTTCG CAACGACTGT GAGCCGGTGG TTCGGAGCCG CTGGCCGGCA GTCGCCGAGG CGCTCGAGTG GCTCGGCCGG CACGGTCAGG CGCGAATGAG TGGTACCGGC AGCTGCTGTT TCGTCGGCTT CCCCGGTGAG GCCGAGGCGC AAGCTGCCCT GGAGCGTCTA CCCGGAGCGT GGTCCGGGTT CGTCGCTCAG GCCGTGGAAC GGTCTCCGCT GCACCTTACG CTCGCGGAGG CCGCCCGTAC CGGCTGA
|
Protein sequence | MTEWSVWPAP AKINLFLHVL GRRADGYHEL QTAFQLLGYG DRIWLRPRVD GRVERCSHLP GVAAEDDLTV RAARALQAAT GSAGGVDIHV DKRLPAGGGV GGGSSDAATV LVALNHLWQT GLDEEALAGI GLALGADVPV FVRGRSAWGE GVGEQLRPMP VDPAGSRWYL VVDPGASIST AEVFGAPELT RDTSPITIPD LRAGAVRNDC EPVVRSRWPA VAEALEWLGR HGQARMSGTG SCCFVGFPGE AEAQAALERL PGAWSGFVAQ AVERSPLHLT LAEAARTG
|
| |