Gene Caul_3651 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_3651 
Symbol 
ID5901106 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp3939503 
End bp3940510 
Gene Length1008 bp 
Protein Length335 aa 
Translation table11 
GC content70% 
IMG OID641564162 
Productfumarylacetoacetate (FAA) hydrolase 
Protein accessionYP_001685276 
Protein GI167647613 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0179] 2-keto-4-pentenoate hydratase/2-oxohepta-3-ene-1,7-dioic acid hydratase (catechol pathway) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.188386 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGCTCG CGTCTCTGAA GGGCGGCCGC GACGGCCGGC TGGTGATGGT CTCCAACGAC 
TTGGCCTGGT TCACCGACGC CGGAACGATC GCCCCGACCC TGCAAGCCGC TCTCGACGAC
TGGGAGCGCT GCGAGCCGAT GCTGCGCGCC CTGGCCGAGA GCCTCGAGCA CGGCGGCGTG
CCGCGCGAGC GCTTCCACGA GCACGAGGCC GCTAGCCCCT TGCCCCGCGC CTATCAGTGG
GTCGACGGCA GCGCCTATGT GAACCACGTG CAACTGGTGC GCCAGGCGCG GGGGGCGCAG
ATGCCGGAGA GCTTCTGGAC CGATCCCCTG ATGTACCAGG GCGCCTCGGA CGGGTTCCTG
GGGCCGCGCG ACGCGATCCC GCTGGCTGAC GAGGCCTGGG GCTGCGACCT GGAGGGCGAG
GTGGCCGTGG TCACCAGCGA CGTGCCGCTG GGCGCGAGCC GCGAGGAGGC CCTGGCGGCG
ATCCGCCTGG TGATGCTGGT CAACGACGTT TCCCTGCGCG CCCTGATCCC CGCCGAACTG
GCCAAGGGTT TCGGCTTCGT GCAGTCCAAG CCGGCCAGCG CCTTCTCGCC GGTCGCCGTC
TCGGTCGACG CCCTGGGCGA GGCCTGGAAG GACGGCAAGC TGTCGGGCGC TCTGCTGGTT
GAACTGAACG GCGAGGAGTT CGGCAGGGCC GATGCGGGGG TCGACATGAC CTTCGATTTC
GGAACCCTGG TGGCCCACGC CGCCAAGACC CGCGCCTTGG CCGCCGGCAC GATCGTCGGC
TCGGGCACGG TGAGCAACCG CGACGCGGAC GGCGGTCCCG GCAAGCCGGT GGCCGACGGC
GGCCTGGGCT ATTCGTGCCT GGCCGAGCTG CGCACGATCG AGACCTTGCA GACCGGCCAC
CCCAAGACGC CGTTCTTGAA GGTCGGCGAC ACCGTCCGCA TCGAGATGCG CGACGCCAAG
GGCCATACGA TCTTCGGCGC CATCGAGCAG GCGGTGGCTT CCATTTGA
 
Protein sequence
MKLASLKGGR DGRLVMVSND LAWFTDAGTI APTLQAALDD WERCEPMLRA LAESLEHGGV 
PRERFHEHEA ASPLPRAYQW VDGSAYVNHV QLVRQARGAQ MPESFWTDPL MYQGASDGFL
GPRDAIPLAD EAWGCDLEGE VAVVTSDVPL GASREEALAA IRLVMLVNDV SLRALIPAEL
AKGFGFVQSK PASAFSPVAV SVDALGEAWK DGKLSGALLV ELNGEEFGRA DAGVDMTFDF
GTLVAHAAKT RALAAGTIVG SGTVSNRDAD GGPGKPVADG GLGYSCLAEL RTIETLQTGH
PKTPFLKVGD TVRIEMRDAK GHTIFGAIEQ AVASI