Gene Caul_2603 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_2603 
SymbolispDF 
ID5900058 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp2823454 
End bp2824605 
Gene Length1152 bp 
Protein Length383 aa 
Translation table11 
GC content70% 
IMG OID641563094 
Productbifunctional 2-C-methyl-D-erythritol 4-phosphate cytidylyltransferase/2-C-methyl-D-erythritol 2,4-cyclodiphosphate synthase protein 
Protein accessionYP_001684228 
Protein GI167646565 
COG category[I] Lipid transport and metabolism 
COG ID[COG0245] 2C-methyl-D-erythritol 2,4-cyclodiphosphate synthase
[COG1211] 4-diphosphocytidyl-2-methyl-D-erithritol synthase 
TIGRFAM ID[TIGR00151] 2C-methyl-D-erythritol 2,4-cyclodiphosphate synthase
[TIGR00453] 2-C-methyl-D-erythritol 4-phosphate cytidylyltransferase 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value0.328797 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCTTCT CCGCCGTCAT CGTCGCCGCC GGTTCCGGAT CTCGCGCAGG CTCCGGCCAG 
GCCAAGCAAT GGCGGGTCGT GGCGGGGAAA CCCGTGTTGC GCTGGTCGGT CGAGGCCTTG
TTGAAGGCCG GCGCCCAAAA CCTTGTGATC GTGGCGGATC CCGCCGCTCG CGAGGCCCTG
GAAGACGCCC TCGACGGCCT TTCCGGCTGG ACCACGACGG CGGGCGGCGC GACTCGCGCG
CGCTCCGTCC AGGCCGGATT GGCGGCCTTG ACCGAGCGTC CCGGCGCCGA GCCGGTGCTG
ATCCATGACG CCGCTCGCCC CTTTCTCGGC GCCGCAACAA TCGCCTCGGT GCTGCGCGCC
CTCGACGACG CCGATGGTGC AATTCCAGCC TTGCCGGTGG CCGATACGCT GAAAAGCGGA
GCGCCCGACG CGGCCATTGT CACAAAATCG CGTGACGATC TGTGGCGCGC CCAGACCCCC
CAGGCCTTCC GCCGCGACCG CCTGCTGGCC GCCTACGCCG CCTGGACCGG ACCGGACGAA
CCGACCGACG ACGCCCAGGT GGTCGAGCGC CATGGCGGCC GCGTGGTCGT CACGCCGGGC
GACCCGATGC TGATGAAACT GACCTATCCG GAGGACTTCG CCATGGCTGA ACGACTGGCC
GGCGCGACGC GCGTCACCCG GATGGGCCAG GGCTTCGACG CCCACCGCTG GGGACCCGGC
GAGTCGGTCT GGCTGTGCGG CGTGCAGATC GCCCACGACG AGACCCTGAT CGGCCATTCT
GACGCCGACG CCGGGCTGCA CGCCCTGACC GACGCCATCC TCGGGGCGAT CGGCGAAGGC
GACATCGGCG ACCACTTCCC GCCCACCGAT CCCCAATGGA AGGGCGCGGC GTCCGATAAG
TTCCTGATCC ACGCCGTTGA TCTGGTTCGC CAACGTGGCG GGACCCTGGT CAATGTCGAC
GTGACCCTGA TCTGCGAGCG GCCGAAGATC AAACCGCACC GCGCGGCCAT GCGGCAGCGC
CTGGCCGATA TCCTCGACCT GCCGCTCGAC CGGGTGAGCG TCAAGGCGAC CACCACCGAG
GGCATGGGCT TCACCGGCCG TGGCGAAGGC CTGGCCGCCC AGGCCATCGC CGTGGTCGAG
ACGCCGGCAT GA
 
Protein sequence
MTFSAVIVAA GSGSRAGSGQ AKQWRVVAGK PVLRWSVEAL LKAGAQNLVI VADPAAREAL 
EDALDGLSGW TTTAGGATRA RSVQAGLAAL TERPGAEPVL IHDAARPFLG AATIASVLRA
LDDADGAIPA LPVADTLKSG APDAAIVTKS RDDLWRAQTP QAFRRDRLLA AYAAWTGPDE
PTDDAQVVER HGGRVVVTPG DPMLMKLTYP EDFAMAERLA GATRVTRMGQ GFDAHRWGPG
ESVWLCGVQI AHDETLIGHS DADAGLHALT DAILGAIGEG DIGDHFPPTD PQWKGAASDK
FLIHAVDLVR QRGGTLVNVD VTLICERPKI KPHRAAMRQR LADILDLPLD RVSVKATTTE
GMGFTGRGEG LAAQAIAVVE TPA