Gene Dshi_3811 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDshi_3811 
Symbol 
ID5714340 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDinoroseobacter shibae DFL 12 
KingdomBacteria 
Replicon accessionNC_009956 
Strand
Start bp18901 
End bp20142 
Gene Length1242 bp 
Protein Length413 aa 
Translation table11 
GC content71% 
IMG OID641276726 
Productfumarylacetoacetase 
Protein accessionYP_001542022 
Protein GI159046351 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0179] 2-keto-4-pentenoate hydratase/2-oxohepta-3-ene-1,7-dioic acid hydratase (catechol pathway) 
TIGRFAM ID[TIGR01266] fumarylacetoacetase 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.685186 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value0.278222 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCGGGC TGATCCACTC CTGGGTGCCG GGGGCAAACG ACCCGGCGGG GGCTTTCCCG 
CTCAACAACC TGGCCTTCGG GGTGTTCTCG ACCGGGGATG GGCCGCGCTG CGCCGTGGCC
ATCGGCGACA AGGTGCTGGA CCTTGCCGCA TTGCAGGCCG CGGGACTGCT GCCCGATCAC
GGGTTCGACG CGCCCGCGCT CGACACCTTC ATGGGGCGCG GCCAGCCCGC CTGGCAGGCG
GTGCGCGAGG CCCTGACCGA GCTGCTCCGC GCGGGGGCGG AGACCGCGCC GGTGCGCGCC
GCCCTGCACG ACCGGGCCGG CGTCCGCCTG CATCTGCCCT TCACCCTTGC CGAGTTCACC
GACTTCTACG CCGGGCGGCA ACACGCGTTC AATGTCGGCT CTCTCTTCCG CGATCCGGCC
AATGCCCTGC CGCCCAACTG GCTGCACATG CCCATCGGCT ATAACGGCCG GGCCTCCACC
GTGGTGGTCT CGGGCACGCC GATCCACCGC CCGGCGGGCC AGATCAAGGA CCCCTCCGAC
CCGATGCCGC GCTTCGGCCC CTGCGAAAGG CTCGATTTTG AGCTGGAGCT GGGCGCGGTG
GTCGGCACCC CGTCGCAGAT GGGCGTCCCG GTCACGGTGG ATGAGGCCGA CGAGATGATC
TTCGGCTACG TGCTGCTGAA CGACTGGTCC GCGCGGGATA TCCAGGCCTG GGAATACGTC
CCCCTCGGCC CGTTTCAGGG CAAGGCGTTC GCCACCACGA TCAGCCCGTG GGTGGTCCCC
CGGGCCGCTC TCGCGCCGTT CCGCTGCGGC CCGCCGGTGC GGGAGGTGCC GCTGCTGCCC
CATCTGCGCG ACACCGGGCC GATGTTCCAC GATATCGACC TGGCGGTGAC CCTCGCGCCC
CCGGGCGGTG CCCCGACCGA GGTCTGCCGG ACCAATTCCA ACGCGCTCTA CTATTCCGCC
GCGCAGCTTC TGGCCCACCA CAGCACGTCG GGCTGCGCGA TGCGGACGGG CGATCTGCTG
GGCTCGGGCA CGATCTCGGG GCCGGAGAAG GGCATGTTCG GCTCTCTGCT AGAGATCACC
TGGGGCGGGC GCGATCCGGT CGCGCTGGCG GGCGGGGCGA CGCGCCGCTT TCTGGCCGAT
GGGGACACCG TCACGCTGAA GGGGGAAGCC CGGGGCGATG GCTACCGGAT CGGGTTCGGG
ACGTGTACCG GCACCATCCT GCCCGCGCCG AAACGGCCCT AA
 
Protein sequence
MSGLIHSWVP GANDPAGAFP LNNLAFGVFS TGDGPRCAVA IGDKVLDLAA LQAAGLLPDH 
GFDAPALDTF MGRGQPAWQA VREALTELLR AGAETAPVRA ALHDRAGVRL HLPFTLAEFT
DFYAGRQHAF NVGSLFRDPA NALPPNWLHM PIGYNGRAST VVVSGTPIHR PAGQIKDPSD
PMPRFGPCER LDFELELGAV VGTPSQMGVP VTVDEADEMI FGYVLLNDWS ARDIQAWEYV
PLGPFQGKAF ATTISPWVVP RAALAPFRCG PPVREVPLLP HLRDTGPMFH DIDLAVTLAP
PGGAPTEVCR TNSNALYYSA AQLLAHHSTS GCAMRTGDLL GSGTISGPEK GMFGSLLEIT
WGGRDPVALA GGATRRFLAD GDTVTLKGEA RGDGYRIGFG TCTGTILPAP KRP