Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_1934 |
Symbol | ipk |
ID | 6144276 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 1954218 |
End bp | 1955069 |
Gene Length | 852 bp |
Protein Length | 283 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641616810 |
Product | 4-diphosphocytidyl-2-C-methyl-D-erythritol kinase |
Protein accession | YP_001743986 |
Protein GI | 170681054 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG1947] 4-diphosphocytidyl-2C-methyl-D-erythritol 2-phosphate synthase |
TIGRFAM ID | [TIGR00154] 4-diphosphocytidyl-2C-methyl-D-erythritol kinase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.00000000109905 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | 44 |
Fosmid unclonability p-value | 0.323322 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGGACAC AGTGGCCCTC TCCGGCAAAA CTTAATCTGT TTTTATACAT TACCGGTCAG CGTGCGGATG GTTACCACAC GCTGCAAACG CTGTTTCAGT TTCTTGATTA CGGCGACACC ATCAACATTG AGCTTCGTGA CGATGGGGAT ATTCGTCTGT TAACGCCCGT TGAAGGCGTG GAACATGAAG ATAACCTGAT CGTTCGCGCA GCGCGGTTAT TGATGAAAAC TGCGGCAGAC AGCGGGCGTC TTCCGACGGG AAGCGGTGCG AATATCAGCA TTGACAAGCG TTTGCCGATG GGCGGCGGTC TCGGCGGTGG TTCATCCAAT GCCGCGACGG TCCTGGTGGC ATTAAATCAT CTCTGGCAAT GCGGGCTAAG CATGGATGAG CTGGCGGAAA TGGGACTGAC GCTGGGCGCA GATGTTCCTG TCTTTGTTCG GGGGCATGCC GCGTTTGCCG AAGGCGTTGG TGAAATACTA ACGCCGGTGG ATCCGCCAGA GAAGTGGTAT CTGGTGGCGC ACCCTGGTGT AAGTATTCCG ACACCGGTGA TTTTTAAAGA TCCTGAACTC CCGCGCAATA CGCCAAAAAG GTCAATAGAA ACGTTGCTAA AATGTGAATT CAGCAATGAT TGCGAGGTTA TCGCAAGAAA ACGTTTTCGC GAGGTTGATG CGGTGCTTTC CTGGCTGTTA GAATACGCCC CGTCGCGCCT GACTGGGACA GGGGCCTGTG TCTTTGCTGA ATTTGATACA GAGTCTGAAG CCCGCCAGGT GCTAGAGCAA GCCCCGGAAT GGCTCAATGG CTTTGTGGCG AAAGGCGTTA ATCTTTCCCC ATTGCACAGA GCCATGCTTT AA
|
Protein sequence | MRTQWPSPAK LNLFLYITGQ RADGYHTLQT LFQFLDYGDT INIELRDDGD IRLLTPVEGV EHEDNLIVRA ARLLMKTAAD SGRLPTGSGA NISIDKRLPM GGGLGGGSSN AATVLVALNH LWQCGLSMDE LAEMGLTLGA DVPVFVRGHA AFAEGVGEIL TPVDPPEKWY LVAHPGVSIP TPVIFKDPEL PRNTPKRSIE TLLKCEFSND CEVIARKRFR EVDAVLSWLL EYAPSRLTGT GACVFAEFDT ESEARQVLEQ APEWLNGFVA KGVNLSPLHR AML
|
| |