Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_1782 |
Symbol | |
ID | 3918341 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_007794 |
Strand | + |
Start bp | 1880119 |
End bp | 1880976 |
Gene Length | 858 bp |
Protein Length | 285 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 640444523 |
Product | 4-diphosphocytidyl-2-C-methyl-D-erythritol kinase |
Protein accession | YP_497056 |
Protein GI | 87199799 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG1947] 4-diphosphocytidyl-2C-methyl-D-erythritol 2-phosphate synthase |
TIGRFAM ID | [TIGR00154] 4-diphosphocytidyl-2C-methyl-D-erythritol kinase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 0.242238 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAGGAGG CGTCGGGGCT TGCTCCGGCG CCTCTTCCGT TTCAGGGGAG GGCCATGAGT CTGCGCGAAA CCGCCTATGC CAAGATCAAC CTTGCCCTCC ACGTAAGGCG GCGCAGGGAT GACGGGTATC ACGAGCTGGA AACGCTGTTC GCGTTCGTCG ATGCGGGGGA TGAGCTGACG GCGAAGCCGG CTCCTGCGGA CTCCTTGCGA GTCTTCGGCG AGTTTGGCGG GGCGCTGGAC GATCCGTTCG GTAACATCGT AGCCAAGGCG CTCGGGACCT TGCCGCATGG CGAGGGCTGG GCGGTGACGC TGGAGAAGCA TCTCCCGGTC GCCGCGGGCC TCGGTGGAGG ATCCGCCGAT GCGGGGGCGG TCTTCCGCAT GGTGGAGCGC AGCCACGGCC TTCCGGCAGA CTGGCATGCC CGCGCTGCCA GGCTGGGCGC GGATGTCCCC GCCTGTGTCG AAAGCGCGGC CTGCATCGGT CGCGGCACGG GAACCGAACT CGAACCCATC CCTAACGACC TTGCCGGAAC GCCGGTTCTT CTCGTCAATC CGCGCATCCC GCTTGCGACC GGCCCGGTGT TCAAGGCCTG GGATGGCGTG GACCGGGGCG CGCTTGAAGG CGCCACCGCT CGCGAAGTCG CCTTTGCCGG ACGCAACGAC CTCGAGGCGC CCGCGCTCTC GCTCGTGCCG GAGATCGGCG CTGTCCTCGT TACCCTGCGC CAGACTGGCG GATGGCTCAC GCGCATGTCC GGCTCGGGCG CAACCTGCTT CGCGCTCTAC GACACGCCGG AGCAGCGCGA TCTGGCGCAG GCCGCGATGC CGCCATCGTG GTGGACACTG GGGGGAGCCC TGCGTTGA
|
Protein sequence | MQEASGLAPA PLPFQGRAMS LRETAYAKIN LALHVRRRRD DGYHELETLF AFVDAGDELT AKPAPADSLR VFGEFGGALD DPFGNIVAKA LGTLPHGEGW AVTLEKHLPV AAGLGGGSAD AGAVFRMVER SHGLPADWHA RAARLGADVP ACVESAACIG RGTGTELEPI PNDLAGTPVL LVNPRIPLAT GPVFKAWDGV DRGALEGATA REVAFAGRND LEAPALSLVP EIGAVLVTLR QTGGWLTRMS GSGATCFALY DTPEQRDLAQ AAMPPSWWTL GGALR
|
| |