Gene Caul_2715 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_2715 
Symbol 
ID5900170 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp2949814 
End bp2950914 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content72% 
IMG OID641563207 
ProductCDP-glucose 4,6-dehydratase 
Protein accessionYP_001684340 
Protein GI167646677 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0451] Nucleoside-diphosphate-sugar epimerases 
TIGRFAM ID[TIGR02622] CDP-glucose 4,6-dehydratase 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value0.525402 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCATCG GCATCGACGA GGATTTCTGG CGGGGCAGGC GGGTGCTGCT GACCGGACAT 
ACCGGTTTCA AGGGCGGCTG GATGGCCTTG TGGCTCGAGC GGCTGGGCGC GACCGTGCGC
GGCGTGGCCT TGCCCCCGGA CACCGAGCCG AACCTGTTCG ACGCGGCGCG GATCGGCGAC
GGCCTGGACA GCGTGATCGC CGACGTCCGC GATCCCCACG CGGTCGCGGC GGCCGTGCTC
GACTTCACGC CGTCCGTGGT CCTGCACATG GCCGCCCAGC CGCTGGTGCG GCGATCCTAT
GACGAGCCGC GCGAGACCTT CGCCACCAAC CTGATGGGCA CGGTCAATCT GCTGGACGCC
GTCCGCCGCC TGCCGCGCCC AGCCACCACG CTGATCGTCA CGACCGACAA GGTCTATGAG
AACCTTGAAC ACGGGCAGCC GTACCGGGAG GGTGACCGCC TGGGCGGACG GGACCCCTAC
AGCGCCAGCA AGGCCTGCGC CGAACTGGCG GTCAGGGCCT ATTTCGCCGC CTATCTGGGA
CCGGCCGGCG CGGCGGTGGG CGTGGCCCGG GCCGGCAACG TGATCGGCGG CGGCGACTGG
TCGCGCGATC GCCTGTTGCC GGACATCCTG TCGGCGTTCG CGCGCGGTGA GCCGGCGATC
CTGCGCAATC CCGGCGCCAT ACGCCCCTGG CAGCACGTGC TGGAACCGCT GCACGGCTAC
CTTCTGGCGA TCCAGGCCCT GGCCGCCTCG CCGGAAGCGG GTTTGCGCGC CTGGAACTTC
GGTCCCGAAG CGGACGGCGC GCGATCGGTC GGCGCGGTGG CGCGGCTGGC GGCGGACGCC
TGGGGCGAGG GCGCGCGCCT GGTCGAGCAG GTCGATCCGA ATGCGCCGCA CGAGGCGCGC
CTGCTGACCT TGAGCTCCGA TCTCGCCAAG GCCGAGCTGG GCTGGAGACC GCGCCTGGAT
CTGGAGACGG CCATCGCGCT GACGACGGAC TGGTGGCGTG GCGTCCTGGC CGGCCGCGAT
GCGCGCGCGG CGAGCCTGGA GCAGATCGAT CGCTACGTGG GGCGCGGCGT GTCAGCCAGC
CGCCCCGTCG CTGCGAGATA G
 
Protein sequence
MTIGIDEDFW RGRRVLLTGH TGFKGGWMAL WLERLGATVR GVALPPDTEP NLFDAARIGD 
GLDSVIADVR DPHAVAAAVL DFTPSVVLHM AAQPLVRRSY DEPRETFATN LMGTVNLLDA
VRRLPRPATT LIVTTDKVYE NLEHGQPYRE GDRLGGRDPY SASKACAELA VRAYFAAYLG
PAGAAVGVAR AGNVIGGGDW SRDRLLPDIL SAFARGEPAI LRNPGAIRPW QHVLEPLHGY
LLAIQALAAS PEAGLRAWNF GPEADGARSV GAVARLAADA WGEGARLVEQ VDPNAPHEAR
LLTLSSDLAK AELGWRPRLD LETAIALTTD WWRGVLAGRD ARAASLEQID RYVGRGVSAS
RPVAAR