Gene Daci_0074 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaci_0074 
Symbol 
ID5745610 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDelftia acidovorans SPH-1 
KingdomBacteria 
Replicon accessionNC_010002 
Strand
Start bp80757 
End bp82427 
Gene Length1671 bp 
Protein Length556 aa 
Translation table11 
GC content70% 
IMG OID641295137 
Productbenzoyl-CoA-dihydrodiol lyase 
Protein accessionYP_001561106 
Protein GI160895524 
COG category[I] Lipid transport and metabolism 
COG ID[COG1024] Enoyl-CoA hydratase/carnithine racemase 
TIGRFAM ID[TIGR03222] benzoyl-CoA-dihydrodiol lyase 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones45 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCACCCGT CCAGCCCCAC TGTCGATTAC CGCACCGAGC CCGCCCAGTA CCGCCACTGG 
TCGTTCACCG TCGAGGGCGC CATCGCCCGC ATGCAGCTGA ACATCGCCGA GGACGGCGGC
ATACGGCCCG GCTACAAGCT CAAGCTCAAC AGCTACGACC TGGGCGTGGA CATCGAGCTG
CACGACGCGC TCAATCGCAT CCGCTTCGAG CATCCGCAGG TGCGCACCGT CATCGTCACC
AGCGGGCGCG ACCGCATCTT CTGCTCGGGC GCCAACATCT TCATGCTGGG CGTGTCCAGC
CATGCCTGGA AGGTGAACTT CTGCAAGTTC ACGAACGAGA CGCGCAACGG CATCGAGGAC
AGTTCGCGCC ACTCGGGCCT GAAGTTCCTG GCCGCCGTCA ACGGGGCCTG CGCGGGCGGC
GGCTACGAGC TGGCCCTGGC CTGCGACGAC ATCGTGCTGA TCGATGACCG CTCCTCCTCC
GTCTCCCTGC CCGAGGTGCC GCTGCTGGGC GTGCTGCCCG GCACGGGCGG CCTCACGCGC
GTGACCGACA AGCGCCATGT GCGCCATGAC CTGGCCGACA TCTTCTGCAC CAGCGTGGAA
GGCGTGCGCG GCCAGCGCGC CGTGGACTGG CGCCTGGTCG ATGCCGTGGC CAAGCCCGCG
CAGTTCGAGG CCGTGGTGGC CCAGCGCGCG CAGCGCCTGG CCGAGGGCAG CACGCGCCCC
GCCAATGCGC AGGGCATCAC GCTCACGCGC CTGGAACGCG AAGCCACGGA CGACGCGCTG
CACTACCGCC ATGTCAGCGT GCAGATCGAC CGCGCGCGCC GCAGCGCCAC GCTGACCGTG
AAGGCGCCCA CGCTTGGGCA AGGCCCGCAG CCGGCCACCA TCGAAGCCAT CGAGCAGGCC
GGCGACGCCT GGTGGCCGCT GGCCATGGGC CGCGAGCTGG ACGACGCCAT CCTGCACCTG
CGCACCAATG AGCTGGACGT GGGCACCTGG CTGCTCAAGA CCGAAGGCGA TGCCGCCGCC
GTGCTGGCCG CAGACGCAGC CCTGCTGGCC CACCAGGACC ACTGGCTGGC GCGCGAGACG
CTGGGCCTGC TGCGCCGCAG CTTTGCGCGG CTGGATGTGT CCTCGCGCAC GCTGTTCGCG
CTGATCGAGC CCGGCTCCTG CTTCGTGGGC ATGCTGGCCG AGCTGGCCTT CGCGGCCGAC
CGCGCCTACA TGCTGGTGCT GCCCGATGAC ACCGAGCGCG CCCCGCGCAT CCAGCTGGAC
GAGTTCAATT TCGGCCTCCT GCCCCTGGTC AACGACCAGT CGCGCCTGCA GCGCCGCTTC
TACGAGGAGG CCGCGCCGCT GGAGGCCGCG CGCGCCGCCA CGGGCCGCGC GCTGGACGGC
GACCAGGCCC TGGCCCTGGG GCTGGTCACG GCCGCGCCCG ACGACATCGA CTGGGACGAC
GAGATCCGCA TCGCCATGGA AGAGCGCGCC GCCATGTCGC CCGATGCGCT CACGGGCCTG
GAGGCCAACC TGCGCTTTGC CAGCCGCGAG AACATGGTCA CGCGCATCTT CGGGCGCCTG
TCGGCCTGGC AGAACTGGAT CTTCAACCGC CCCAATGCCG TGGGCGACAA GGGCGCTCTC
AAGCTCTACG GCACGGGCCA GAAGGCCGGC TTCGACTTCA ACCGCGTCTG A
 
Protein sequence
MHPSSPTVDY RTEPAQYRHW SFTVEGAIAR MQLNIAEDGG IRPGYKLKLN SYDLGVDIEL 
HDALNRIRFE HPQVRTVIVT SGRDRIFCSG ANIFMLGVSS HAWKVNFCKF TNETRNGIED
SSRHSGLKFL AAVNGACAGG GYELALACDD IVLIDDRSSS VSLPEVPLLG VLPGTGGLTR
VTDKRHVRHD LADIFCTSVE GVRGQRAVDW RLVDAVAKPA QFEAVVAQRA QRLAEGSTRP
ANAQGITLTR LEREATDDAL HYRHVSVQID RARRSATLTV KAPTLGQGPQ PATIEAIEQA
GDAWWPLAMG RELDDAILHL RTNELDVGTW LLKTEGDAAA VLAADAALLA HQDHWLARET
LGLLRRSFAR LDVSSRTLFA LIEPGSCFVG MLAELAFAAD RAYMLVLPDD TERAPRIQLD
EFNFGLLPLV NDQSRLQRRF YEEAAPLEAA RAATGRALDG DQALALGLVT AAPDDIDWDD
EIRIAMEERA AMSPDALTGL EANLRFASRE NMVTRIFGRL SAWQNWIFNR PNAVGDKGAL
KLYGTGQKAG FDFNRV