Gene Daci_2036 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaci_2036 
Symbol 
ID5747597 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDelftia acidovorans SPH-1 
KingdomBacteria 
Replicon accessionNC_010002 
Strand
Start bp2227992 
End bp2229308 
Gene Length1317 bp 
Protein Length438 aa 
Translation table11 
GC content70% 
IMG OID641297118 
Productfumarylacetoacetase 
Protein accessionYP_001563061 
Protein GI160897479 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0179] 2-keto-4-pentenoate hydratase/2-oxohepta-3-ene-1,7-dioic acid hydratase (catechol pathway) 
TIGRFAM ID[TIGR01266] fumarylacetoacetase 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.94384 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value0.503936 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCTTGA ACGAAACCCA TGACGCAGGC TTGCGCAGCT GGGTGGCCTC GGCCAACACC 
GGCGCCAGCG ACTTTCCCAT CCAGAACCTG CCGTTTGCGG TGTTCCGCCG CGCAGGCAGC
CAGGAGGCCT GGCGCGGCGG CGTGGCCATT GGCGACCAGG TGCTGGACCT GGCGCGCGCA
AGCGCGATCA AGGCGCTGGG CGATGCCGTG CAGCCGCAGC TGGAAGCCGC TTCACAACAG
CACCTGAACG GCTTCATGGC CATGGGACCT GCCGCCTGGT CGGCCCTGCG CCTGGCGCTG
TCGCGCGCGC TGCGCGAAGG CGCTGCCGCA CAGGCGGCCC TGCAGGATTG CCTGGTGGCC
CAGTCCGACG TGGAGTACAC GGTGCCGGCC CAGGTGGGCG ACTACACGGA CTTCTATACC
TCGGTGCACC ACGCCACCAA CGTCGGCCAG CTGTTCCGCC CGGACAACCC GCTGATGGAG
AACTACAAGT GGGTGCCGAT TGGCTACCAC GGCCGTGCGT CCAGCCTGCG TGTGTCGGGC
GTGGACTTCC GCCGCCCCAT GGGCCAGCTG AAGGCGCCCG ACGCCACCGC ACCCGCGCTC
AAGCCCTGCG CACGCCTGGA CTATGAGCTG GAGATGGGCA TCTACACCGG CGCCGGCAAC
GCCTGGGGCG AGGCGATTTC CATGGACGAG GCAGAGAACC ACATCTTCGG CCTGTGCCTG
CTCAACGACT GGTCGGCGCG CGACATCCAG GCCTGGGAAT ACCAGCCGCT GGGCCCCTTC
CTGTCGAAGA ACTTCGCCAC CACGGTCTCG CCCTGGATCG TGACGCTGGA GGCGCTGGAG
CCCTACCGCA CGGCCTTCAC GCGGCCCGCC ACAGATCCCC AGCCCCTGCC CTACCTGAGC
TCGGCCGCCA ACTCCGAGCG CGGCGCGTTC GACGTGCAGT TGAGCGTGGC GCTGGAGACC
GGCCGCATGC GCGCCGAAGG CCAAGCCGCC CAGCAAATCA CCCACACCAG CTACCGCCAC
GCCTACTGGA CCATGGCACA GCTGGTGGCC CACCACAGCG TCAACGGCTG CGACCTGCAG
CCCGGTGACC TGCTGGGCAC GGGCACGCTG TCCGGCCCCA CCTCCAGCGA GGCCGGTGCG
CTGCTGGAGC TGACCGAAGG CGGCAAGAAG CCCGTGGCGC TGGCCAATGG CGAGAGCCGC
ACCTTCCTGC AGGATGGCGA TGCCGTGATC CTGCGCGGCT GGTGCGAGAA GCCGGGCGCC
GCGCGCATCG GCTTCGGCGA GTGCCGCGCC ACCGTGCTGC CCGCGCGCCA GGCCTGA
 
Protein sequence
MSLNETHDAG LRSWVASANT GASDFPIQNL PFAVFRRAGS QEAWRGGVAI GDQVLDLARA 
SAIKALGDAV QPQLEAASQQ HLNGFMAMGP AAWSALRLAL SRALREGAAA QAALQDCLVA
QSDVEYTVPA QVGDYTDFYT SVHHATNVGQ LFRPDNPLME NYKWVPIGYH GRASSLRVSG
VDFRRPMGQL KAPDATAPAL KPCARLDYEL EMGIYTGAGN AWGEAISMDE AENHIFGLCL
LNDWSARDIQ AWEYQPLGPF LSKNFATTVS PWIVTLEALE PYRTAFTRPA TDPQPLPYLS
SAANSERGAF DVQLSVALET GRMRAEGQAA QQITHTSYRH AYWTMAQLVA HHSVNGCDLQ
PGDLLGTGTL SGPTSSEAGA LLELTEGGKK PVALANGESR TFLQDGDAVI LRGWCEKPGA
ARIGFGECRA TVLPARQA