Gene Caul_0412 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_0412 
Symbol 
ID5897686 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp451568 
End bp452893 
Gene Length1326 bp 
Protein Length441 aa 
Translation table11 
GC content69% 
IMG OID641560898 
Productfumarylacetoacetase 
Protein accessionYP_001682047 
Protein GI167644384 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0179] 2-keto-4-pentenoate hydratase/2-oxohepta-3-ene-1,7-dioic acid hydratase (catechol pathway) 
TIGRFAM ID[TIGR01266] fumarylacetoacetase 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value0.784977 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCAAGC CTCCCATTGA TGAGACGCAC GATCCTTCAC GCCGCAGTTG GGTGAGCTCG 
GCCCAGGGCT CGGCGTTCCC GATCCAGAAC CTGCCGCTTG GCGTGTTCAG TCCCGCCGGC
GGACTTCCGC GCGGCGGGGT GGCGATCGGT GATCAGATCC TGGACCTGCT GGACGTCAGC
GAGTCGGGCT ATTTCGAGGG CGAGGCGCGC AGCGCCGCCG AGGCGGCGGC CCGACCCGAT
CTGACCAGCT ATCTGGGCCT GAAAGCCTCG GCGAGGCTGG CCTTGCGGCG GCGGCTTTCC
CAGCTGCTGT CAGATCCCAC GCATCAGGCG GCCTTAACGC CCTATCTGCA CGCGGCCAGC
GATTGCGCGC TGCTTATGCC GGTCCGCGTG GCCAACTACA CCGACTTCTT CGCCGGGGCC
CACCACGCCG TGAACGCGGG CAAGATGCTC CGCCCCGACG CGCCCTTGTC GCCCAACTAC
AAGCACGTGC CGATCGCCTA TCACGGACGC GCCTCATCCA TTCGGCCCAG TGGAACCCCG
GTGCGTCGTC CTTGGGGGCA AGTCCTGCCC GGCGGCGCCC AAGCCCCTGA ACTTAGGCAG
ACCCGGCGAT TGGACTACGA GTTCGAGCTG GGCCTTTGGG TGGGCGCGGG CAACGCCCTG
GGTGAGCCGC TCCCAATCGC CCAGGCCGGG GAGCATCTGG TCGGGCTGTG CATTCTCAAC
GACTTCTCCG CGCGGGACCT GCAAGCTTGG GAGGCTCAGC CGCTAGGGCC GTTCCTCAGC
AAGAATTTCC TTAGCGTGAT CTCCCCCTGG ATCGTCACCG TCGAAGCCCT GGCGCCGTTC
CGCGCCGCGC AGCGCGCAAG GCCGCAGGAG GATCCCCCGC CGCTCGCCTA CCTCTGGGAC
GAGGCTGATC AAGCCGGCGG CGCCTTGGCC ATCGCCCTGG AGGCGTTCCT GTCGACCTCG
GCCATGCGCC AGGCGGGCCT AGCGCCCGAT CGCATCGGCC TGGGACATGC CGCCCACCTC
TATTGGACGC CGGCCCAGAT GATCGCCCAT CACACCGTGG GCGGCTGTGA TCTGGGCCCT
GGGGACCTGC TGGGCACGGG AACGATATCG GCGCCGGACG CGCCGGACGT CTCGGGGGCC
GGCTGTCTGC TGGAAATGAC AAAGGCGGGG CGTCAGCCCA TCACGCTCTC GAGCGGCGAG
ACGCGCGGCT TCCTGGCGGA CGGTGACGAA ATCCTGCTGC GCGCGACCGC TCGGGCGGCG
GGATTTGTCG ATATCGGATT TGGTGAGTGC CGCGCGCAGG TGGCGCCGGC CAATGAGGAG
GCGTGA
 
Protein sequence
MLKPPIDETH DPSRRSWVSS AQGSAFPIQN LPLGVFSPAG GLPRGGVAIG DQILDLLDVS 
ESGYFEGEAR SAAEAAARPD LTSYLGLKAS ARLALRRRLS QLLSDPTHQA ALTPYLHAAS
DCALLMPVRV ANYTDFFAGA HHAVNAGKML RPDAPLSPNY KHVPIAYHGR ASSIRPSGTP
VRRPWGQVLP GGAQAPELRQ TRRLDYEFEL GLWVGAGNAL GEPLPIAQAG EHLVGLCILN
DFSARDLQAW EAQPLGPFLS KNFLSVISPW IVTVEALAPF RAAQRARPQE DPPPLAYLWD
EADQAGGALA IALEAFLSTS AMRQAGLAPD RIGLGHAAHL YWTPAQMIAH HTVGGCDLGP
GDLLGTGTIS APDAPDVSGA GCLLEMTKAG RQPITLSSGE TRGFLADGDE ILLRATARAA
GFVDIGFGEC RAQVAPANEE A