Gene Caul_3853 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_3853 
Symbol 
ID5901315 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp4168341 
End bp4169531 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content72% 
IMG OID641564375 
Product3-oxoadipate enol-lactonase 
Protein accessionYP_001685477 
Protein GI167647814 
COG category[R] General function prediction only
[S] Function unknown 
COG ID[COG0596] Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily)
[COG0599] Uncharacterized homolog of gamma-carboxymuconolactone decarboxylase subunit 
TIGRFAM ID[TIGR02425] 4-carboxymuconolactone decarboxylase
[TIGR02427] 3-oxoadipate enol-lactonase 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.126877 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.866495 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCTTCG CCACCCACGA CGGCGCGCGG ATCTACTGGC GGCTGGACGG CGCGGCCGAC 
AAGCCCGCCC TGGTGCTGCT GAACTCGATC GGCACGGACA TGGGGCTGTA CGACGCCGCC
GCCCCGCTGC TGCTGGCCGA CTTCCGCCTG CTGCGCATCG ACACGCGCGG CCATGGCGCC
TCGGACGCCC CGGCCGTCGA CTACACGCTG GACCAGTTGG CGGGCGACGC GCTGGCGGCG
ATGGACGCGG CGGGCCTGGC CACGGCCAGC GTCGTCGGCG TCTCGCTGGG CGGCATGGTC
GCCATGGCCC TGGCCCTGAA GGCGCCCGAG CGGGTCGAGG GGCTGGTCCT GGCCTGCACC
TCGGCGGCCA TGGACGTCGC CGCCTGGACC GCCCGCATCG CCACCGTCCG CGCCGAAGGC
ATGGCCGCCA TCGCCGAGAT GGCCCTGGGC CGGTTCTTCT CCGAGCCCTT CCGCGGCCAG
CATCCCGCCA CGGTCGAGAC CGTGCGCGCC GGTCTGCTGG CCATGAGCCC CGATGGCTAC
AGCGGCTGCG GAGCCGCGAT CCGCGACATG GACCTGCTGG CGCGGATCTC CGCCATCACC
GCCCCCACCC TGGTGATCGG CGGACGCAAG GACGTCTCGA CGCCGTTCGA GGGGAATGGC
GACCGGATCG TGGCGGCCAT CCCGGGCGCG ACCTCGGCCA TGCTCGACAC CGCTCACCTG
CCCAGCCTGG AAGACCCCAC CGCCTTCGCC GGCGCCGTGC GCAGCTTTCT GGCCAGCACC
CGCGACGGTT CGGGCGTCAG CGCGGCGGCG GACGTGCTGT TCGAGGCCGG CCTCGTCCAT
CGTCGCAAGG TGCTGGGCGA CGCCTGGGTC GACCGTTCGC TGGCCAAGCG CACCCCCTTC
ACCGCCGACT ACCAGGCGAT GATCACCCGC TATGCCTGGA ACGAGATCTG GGGCCGGCCG
GGCCTGGACC ATCGCACGCG TCGCCTGCTG GTGCTGGCCA TCTGCGCCTC GCTGGCCCGC
TGGGAAGAGT TTCGCCTCCA CGTCCGCGCC GGCCTGGAAC AGGGCGGCTT CACCCAGGAC
GAGCTCAAGG AGGTGCTGAT GCAGACGGCG ATCTATGCGG GGGTGCCCGC CGCCAACACC
GCCTTCACGG AGGCCGCCGA AATCATCGCC GAGTTGGGCG GGGCTGATTG A
 
Protein sequence
MAFATHDGAR IYWRLDGAAD KPALVLLNSI GTDMGLYDAA APLLLADFRL LRIDTRGHGA 
SDAPAVDYTL DQLAGDALAA MDAAGLATAS VVGVSLGGMV AMALALKAPE RVEGLVLACT
SAAMDVAAWT ARIATVRAEG MAAIAEMALG RFFSEPFRGQ HPATVETVRA GLLAMSPDGY
SGCGAAIRDM DLLARISAIT APTLVIGGRK DVSTPFEGNG DRIVAAIPGA TSAMLDTAHL
PSLEDPTAFA GAVRSFLAST RDGSGVSAAA DVLFEAGLVH RRKVLGDAWV DRSLAKRTPF
TADYQAMITR YAWNEIWGRP GLDHRTRRLL VLAICASLAR WEEFRLHVRA GLEQGGFTQD
ELKEVLMQTA IYAGVPAANT AFTEAAEIIA ELGGAD