Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmul_A0588 |
Symbol | |
ID | 3783986 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosospira multiformis ATCC 25196 |
Kingdom | Bacteria |
Replicon accession | NC_007614 |
Strand | + |
Start bp | 672751 |
End bp | 673635 |
Gene Length | 885 bp |
Protein Length | 294 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 637810670 |
Product | 4-diphosphocytidyl-2C-methyl-D-erythritol kinase |
Protein accession | YP_411288 |
Protein GI | 82701722 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG1947] 4-diphosphocytidyl-2C-methyl-D-erythritol 2-phosphate synthase |
TIGRFAM ID | [TIGR00154] 4-diphosphocytidyl-2C-methyl-D-erythritol kinase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATGGAA TACTGTCAAC GACTGAAGTG GATTCGCAGG CGGAACTGAG CTGCCCTGCG CCTGCCAAAC TCAATCTGTT CCTGCATGTG GTGGGACGGA GGGAGGATGG GTACCATCTT CTGCAAACCG TTTTCCGCCT GGTGGATTTC GCCGACCAGC TCCATTTCGG GCTGCGGGCG GACGGTGTGA TCAAGCTGCA TACGCCCACT CCGGGGGTGC CGGAAGAGCA GGATTTGTGC GTGCGCGCAG CAAAACTGCT GCAACGGGAA AGCGGTACTC CCTGGGGGGC CAATATCTTT CTGGAAAAGC GCATCCCGAT GGGTGGTGGC CTGGGAGGCG GCAGTTCGGA TGCAGCCACG ACATTGCTTG CGCTCAACCG CTTGTGGAAG CTGGGCTGGC GCCGGAATCA ACTTTTGAAA CTGGCCCCGG AACTGGGTGC GGATGTTCCC GTATTCGTTT TCAGTGAAAA TGCCTTTGCT GAGGGCATCG GCGAAAAACT CCTGCCGATT GCGTTACCCC CGGCATGGTA TCTGATACTC ACACCGCCCG TGCATGTCTC AACGGCAAAG GTTTTTTCAA GTAAGGAATT GACACGAAAC ACGATTCCGA TCAAAATACC GCCCTTTTCC ACCGAGCAAG GGCATAATGA TCTCGAGCCG GTGGTGTGTG CTTCATACCC CGAGGTAGCA CGCCACCTCG AGTGGCTGCG GCAGCTCGAA GGTGCAAGGA TGGCGGCCAT GACGGGTTCC GGCGCGTGCG TTTTTGCCGA GTTTGCGACC GAATCCGGGG CCAGAAGCGC ACTGGGGAAG ATTCCATACG GTATGAAGGG TTTTGTGGCG CAGGGACTTG ATCGCCATCC CTTGCATGAT TTTGCAGAAC AATAA
|
Protein sequence | MNGILSTTEV DSQAELSCPA PAKLNLFLHV VGRREDGYHL LQTVFRLVDF ADQLHFGLRA DGVIKLHTPT PGVPEEQDLC VRAAKLLQRE SGTPWGANIF LEKRIPMGGG LGGGSSDAAT TLLALNRLWK LGWRRNQLLK LAPELGADVP VFVFSENAFA EGIGEKLLPI ALPPAWYLIL TPPVHVSTAK VFSSKELTRN TIPIKIPPFS TEQGHNDLEP VVCASYPEVA RHLEWLRQLE GARMAAMTGS GACVFAEFAT ESGARSALGK IPYGMKGFVA QGLDRHPLHD FAEQ
|
| |