Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dshi_0967 |
Symbol | |
ID | 5710658 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dinoroseobacter shibae DFL 12 |
Kingdom | Bacteria |
Replicon accession | NC_009952 |
Strand | - |
Start bp | 989049 |
End bp | 989933 |
Gene Length | 885 bp |
Protein Length | 294 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 641266875 |
Product | fumarylacetoacetate (FAA) hydrolase |
Protein accession | YP_001532310 |
Protein GI | 159043516 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG0179] 2-keto-4-pentenoate hydratase/2-oxohepta-3-ene-1,7-dioic acid hydratase (catechol pathway) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 33 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 0.0987505 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGGTTTC TTGTTGGGCA TGTCCAAGGC CAGCCGGGCG TCTTTGCAAT CGTTGAAGAG GCGGCAACAA ACCTGACCGC CCTGATGCCG GAAATCGGCA CCGACCTGAT GGCGCTGACC CGTGATCCCG AGCTTGTGAC GAAGGCCGCA GGCCTCGTTG GACAGGGCCC GCAGACTCAA GTGGCCGACA TCACCCCCGC CCTGCCCATC GCGCATCCCG GCACCATCAT CTGCCTTGGC CTGAACTATG TCGAACACAT CCGCGAAGGT GGCTACGAGA TTCCGGGCTA TCCGGCCCTG TTCATGCGGG GGCGCAACTC GATCATGCCC TCCGGCGCCC CCATGGTGCG CCCCGCCTGC TCCGAGACAC TGGATTACGA AGCCGAACTG ATGCTGATCG TCGGCGAGGG CGGCCGCCAC ATCCCCGAGG ACGCGGCGCT GGAGCATGTC TTTGGATACA CCACCTTCAA CGACGGCTCC GTGCGGGAAT ACCAGCGCAA GACTCACCAA TGGACTCCGG GCAAGAATTT CGACGCGACC GGGGCCGTCG GCCCCATCGT GGTGACCCCG GACGAGTTGC CAGAAGGCGC GAAGGGGCTG AAGATAGAGA GCCGTGTGGG CAATGAGATC CTGCAATCCT CCAACACTGA AAACATGATC TGGGGCGCGG CCAAGACCCT CTCGATCATC TCGGAATACA CCACGCTGGA GCCCGGAGAC CTGATCGCGC TCGGCACACC GCCCGGCGTC GGCCATGCCA AGAAGCCCGG CCCCCGCTGG CTGCGTCCCG GAGAAGTGAT CGAGGTGGAA ATCGAAGGCA TCGGGACCTG CGCCAACCCG GTGGTGGACG AGACCGCCAT GGCGGCAAGA AAGGCGGCGG AGTGA
|
Protein sequence | MRFLVGHVQG QPGVFAIVEE AATNLTALMP EIGTDLMALT RDPELVTKAA GLVGQGPQTQ VADITPALPI AHPGTIICLG LNYVEHIREG GYEIPGYPAL FMRGRNSIMP SGAPMVRPAC SETLDYEAEL MLIVGEGGRH IPEDAALEHV FGYTTFNDGS VREYQRKTHQ WTPGKNFDAT GAVGPIVVTP DELPEGAKGL KIESRVGNEI LQSSNTENMI WGAAKTLSII SEYTTLEPGD LIALGTPPGV GHAKKPGPRW LRPGEVIEVE IEGIGTCANP VVDETAMAAR KAAE
|
| |