Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dshi_3811 |
Symbol | |
ID | 5714340 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dinoroseobacter shibae DFL 12 |
Kingdom | Bacteria |
Replicon accession | NC_009956 |
Strand | - |
Start bp | 18901 |
End bp | 20142 |
Gene Length | 1242 bp |
Protein Length | 413 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641276726 |
Product | fumarylacetoacetase |
Protein accession | YP_001542022 |
Protein GI | 159046351 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG0179] 2-keto-4-pentenoate hydratase/2-oxohepta-3-ene-1,7-dioic acid hydratase (catechol pathway) |
TIGRFAM ID | [TIGR01266] fumarylacetoacetase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.685186 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 37 |
Fosmid unclonability p-value | 0.278222 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAGCGGGC TGATCCACTC CTGGGTGCCG GGGGCAAACG ACCCGGCGGG GGCTTTCCCG CTCAACAACC TGGCCTTCGG GGTGTTCTCG ACCGGGGATG GGCCGCGCTG CGCCGTGGCC ATCGGCGACA AGGTGCTGGA CCTTGCCGCA TTGCAGGCCG CGGGACTGCT GCCCGATCAC GGGTTCGACG CGCCCGCGCT CGACACCTTC ATGGGGCGCG GCCAGCCCGC CTGGCAGGCG GTGCGCGAGG CCCTGACCGA GCTGCTCCGC GCGGGGGCGG AGACCGCGCC GGTGCGCGCC GCCCTGCACG ACCGGGCCGG CGTCCGCCTG CATCTGCCCT TCACCCTTGC CGAGTTCACC GACTTCTACG CCGGGCGGCA ACACGCGTTC AATGTCGGCT CTCTCTTCCG CGATCCGGCC AATGCCCTGC CGCCCAACTG GCTGCACATG CCCATCGGCT ATAACGGCCG GGCCTCCACC GTGGTGGTCT CGGGCACGCC GATCCACCGC CCGGCGGGCC AGATCAAGGA CCCCTCCGAC CCGATGCCGC GCTTCGGCCC CTGCGAAAGG CTCGATTTTG AGCTGGAGCT GGGCGCGGTG GTCGGCACCC CGTCGCAGAT GGGCGTCCCG GTCACGGTGG ATGAGGCCGA CGAGATGATC TTCGGCTACG TGCTGCTGAA CGACTGGTCC GCGCGGGATA TCCAGGCCTG GGAATACGTC CCCCTCGGCC CGTTTCAGGG CAAGGCGTTC GCCACCACGA TCAGCCCGTG GGTGGTCCCC CGGGCCGCTC TCGCGCCGTT CCGCTGCGGC CCGCCGGTGC GGGAGGTGCC GCTGCTGCCC CATCTGCGCG ACACCGGGCC GATGTTCCAC GATATCGACC TGGCGGTGAC CCTCGCGCCC CCGGGCGGTG CCCCGACCGA GGTCTGCCGG ACCAATTCCA ACGCGCTCTA CTATTCCGCC GCGCAGCTTC TGGCCCACCA CAGCACGTCG GGCTGCGCGA TGCGGACGGG CGATCTGCTG GGCTCGGGCA CGATCTCGGG GCCGGAGAAG GGCATGTTCG GCTCTCTGCT AGAGATCACC TGGGGCGGGC GCGATCCGGT CGCGCTGGCG GGCGGGGCGA CGCGCCGCTT TCTGGCCGAT GGGGACACCG TCACGCTGAA GGGGGAAGCC CGGGGCGATG GCTACCGGAT CGGGTTCGGG ACGTGTACCG GCACCATCCT GCCCGCGCCG AAACGGCCCT AA
|
Protein sequence | MSGLIHSWVP GANDPAGAFP LNNLAFGVFS TGDGPRCAVA IGDKVLDLAA LQAAGLLPDH GFDAPALDTF MGRGQPAWQA VREALTELLR AGAETAPVRA ALHDRAGVRL HLPFTLAEFT DFYAGRQHAF NVGSLFRDPA NALPPNWLHM PIGYNGRAST VVVSGTPIHR PAGQIKDPSD PMPRFGPCER LDFELELGAV VGTPSQMGVP VTVDEADEMI FGYVLLNDWS ARDIQAWEYV PLGPFQGKAF ATTISPWVVP RAALAPFRCG PPVREVPLLP HLRDTGPMFH DIDLAVTLAP PGGAPTEVCR TNSNALYYSA AQLLAHHSTS GCAMRTGDLL GSGTISGPEK GMFGSLLEIT WGGRDPVALA GGATRRFLAD GDTVTLKGEA RGDGYRIGFG TCTGTILPAP KRP
|
| |