Gene Caul_2169 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_2169 
Symbol 
ID5899624 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp2353922 
End bp2354803 
Gene Length882 bp 
Protein Length293 aa 
Translation table11 
GC content75% 
IMG OID641562660 
Product4-diphosphocytidyl-2-C-methyl-D-erythritol kinase 
Protein accessionYP_001683795 
Protein GI167646132 
COG category[I] Lipid transport and metabolism 
COG ID[COG1947] 4-diphosphocytidyl-2C-methyl-D-erythritol 2-phosphate synthase 
TIGRFAM ID[TIGR00154] 4-diphosphocytidyl-2C-methyl-D-erythritol kinase 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.103422 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.102097 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCCTCG ACGCCTTCGC CCCGGCCAAG GTCAACCTGT TCCTGCATGT CGGGGGCCCC 
GACGCGGCGG GCTATCACCC GATCTCCAGC CTGATGCTGT TCGCCGACGT CGGCGACCGG
GTCAGCCTGC AGGCGGCCGA TGCGCCCAGC TTCGAGGCGA CAGGCTGGTT CGGCGCGGAG
GTTCCGGTCG ATGACGGCAA TCTGGTGGTG CGCGCCGAGA TGGCCCTGCG CGCCCGGCTG
GGCGGACCGA CCCCGCCGTT CCGCCTGATC CTCGACAAGG CCCTGCCGAT CGCCGCCGGC
CTGGGCGGCG GCTCCAGCGA CGCCGGGGCG GCCCTGCGGC TGCTGCGCGA AGCCCTGGCG
CCGGACCTGT CCGACGCCGA TCTGGAAGCC GTGGCCGGCG GCCTGGGCGC CGACGGCGCG
GCCTGCCTGT GGGGCGCGCC GGTCATGGCG CGGGGCAGGG GGGAACGCCT GTCGCCGGCT
CCGGCCTTGC CGGCCTTGCA CGCGGTGCTG GTCAATCCGC TGGTCCCGTC GCCGACCGGG
GCGGTCTACC GCGCCTATGA CGCCGCCGTC GCGCCCGAGG GGGAAGCCCC GCCGCCGATG
CTGGACGGGC TGGAGAGCAT CGAGGAGGTC TGCGCCTGGC TGGCCGGCTT CACCCGCAAC
GACCTGCAGG CGCCCGCCGT GGCCCTGGAG CCGCGGATCG GCCAGGTGCT GGACCTGTTG
GCCGACGAGC CCGAGACTCT GCTGGCCCGG ATGTCCGGCT CCGGCGCCAC CTGTTTCGCC
CTCTGCGCCG GCGATATTGA GGCCGAGGGC CTGGCCGAGC GCATCGAGCA GATGCGGCCC
GACTGGTGGG TCAAGCGCTG CCGGTTGGGC GGGCCGTTCT AG
 
Protein sequence
MRLDAFAPAK VNLFLHVGGP DAAGYHPISS LMLFADVGDR VSLQAADAPS FEATGWFGAE 
VPVDDGNLVV RAEMALRARL GGPTPPFRLI LDKALPIAAG LGGGSSDAGA ALRLLREALA
PDLSDADLEA VAGGLGADGA ACLWGAPVMA RGRGERLSPA PALPALHAVL VNPLVPSPTG
AVYRAYDAAV APEGEAPPPM LDGLESIEEV CAWLAGFTRN DLQAPAVALE PRIGQVLDLL
ADEPETLLAR MSGSGATCFA LCAGDIEAEG LAERIEQMRP DWWVKRCRLG GPF