Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_2169 |
Symbol | |
ID | 5899624 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 2353922 |
End bp | 2354803 |
Gene Length | 882 bp |
Protein Length | 293 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 641562660 |
Product | 4-diphosphocytidyl-2-C-methyl-D-erythritol kinase |
Protein accession | YP_001683795 |
Protein GI | 167646132 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG1947] 4-diphosphocytidyl-2C-methyl-D-erythritol 2-phosphate synthase |
TIGRFAM ID | [TIGR00154] 4-diphosphocytidyl-2C-methyl-D-erythritol kinase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 0.103422 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.102097 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCCTCG ACGCCTTCGC CCCGGCCAAG GTCAACCTGT TCCTGCATGT CGGGGGCCCC GACGCGGCGG GCTATCACCC GATCTCCAGC CTGATGCTGT TCGCCGACGT CGGCGACCGG GTCAGCCTGC AGGCGGCCGA TGCGCCCAGC TTCGAGGCGA CAGGCTGGTT CGGCGCGGAG GTTCCGGTCG ATGACGGCAA TCTGGTGGTG CGCGCCGAGA TGGCCCTGCG CGCCCGGCTG GGCGGACCGA CCCCGCCGTT CCGCCTGATC CTCGACAAGG CCCTGCCGAT CGCCGCCGGC CTGGGCGGCG GCTCCAGCGA CGCCGGGGCG GCCCTGCGGC TGCTGCGCGA AGCCCTGGCG CCGGACCTGT CCGACGCCGA TCTGGAAGCC GTGGCCGGCG GCCTGGGCGC CGACGGCGCG GCCTGCCTGT GGGGCGCGCC GGTCATGGCG CGGGGCAGGG GGGAACGCCT GTCGCCGGCT CCGGCCTTGC CGGCCTTGCA CGCGGTGCTG GTCAATCCGC TGGTCCCGTC GCCGACCGGG GCGGTCTACC GCGCCTATGA CGCCGCCGTC GCGCCCGAGG GGGAAGCCCC GCCGCCGATG CTGGACGGGC TGGAGAGCAT CGAGGAGGTC TGCGCCTGGC TGGCCGGCTT CACCCGCAAC GACCTGCAGG CGCCCGCCGT GGCCCTGGAG CCGCGGATCG GCCAGGTGCT GGACCTGTTG GCCGACGAGC CCGAGACTCT GCTGGCCCGG ATGTCCGGCT CCGGCGCCAC CTGTTTCGCC CTCTGCGCCG GCGATATTGA GGCCGAGGGC CTGGCCGAGC GCATCGAGCA GATGCGGCCC GACTGGTGGG TCAAGCGCTG CCGGTTGGGC GGGCCGTTCT AG
|
Protein sequence | MRLDAFAPAK VNLFLHVGGP DAAGYHPISS LMLFADVGDR VSLQAADAPS FEATGWFGAE VPVDDGNLVV RAEMALRARL GGPTPPFRLI LDKALPIAAG LGGGSSDAGA ALRLLREALA PDLSDADLEA VAGGLGADGA ACLWGAPVMA RGRGERLSPA PALPALHAVL VNPLVPSPTG AVYRAYDAAV APEGEAPPPM LDGLESIEEV CAWLAGFTRN DLQAPAVALE PRIGQVLDLL ADEPETLLAR MSGSGATCFA LCAGDIEAEG LAERIEQMRP DWWVKRCRLG GPF
|
| |