Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_4686 |
Symbol | |
ID | 5704313 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | - |
Start bp | 5308187 |
End bp | 5309383 |
Gene Length | 1197 bp |
Protein Length | 398 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641274084 |
Product | fumarylacetoacetase |
Protein accession | YP_001539430 |
Protein GI | 159040177 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG0179] 2-keto-4-pentenoate hydratase/2-oxohepta-3-ene-1,7-dioic acid hydratase (catechol pathway) |
TIGRFAM ID | [TIGR01266] fumarylacetoacetase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.454046 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.30172 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCTGGG TGACCGGCCT CGAGGGCTCG CCGTACGGGG TGCACAACCT GCCGTACGGG GTGTTCCGCA CCGCAGCCCG TGAGCCGCGG GTGGGCGTAC GCGTCGGTGC CCACGTGCTC GACCTCGGCG GTGCGGAGCA GGCTGGTCTG GTGTTTGCTG CCGGCACGTT GGGCCGCCCC CGGCTGAACG ACTTCATGGC GCTCGGTCGA CCGCAGTGGA CGGCGGTGCG GCAGCGGGTC ACCGAGCTGC TCACCGACAG CGCCCACCGG GCAGCGGTCG AGCCGCTGCT CCTGCCGCTG CTGGACGTCG AGCTGCTGCT TCCCTTCGAC GTGGCCGACT ACGTCGACTT CTACTCTTCC GAGCAGCACG CCACCAACGT CGGCAAGATC TTCCGCCCCG GTCAGCCACC GCTGTTGCCC AACTGGAAGC ACCTGCCGAT CGGCTACCAC GGCCGGGCCG GCACCGTCGT GGTCTCCGGC ACCCCGATCG TCCGGCCGTG CGGCCAGCGG GCCACCCCGC AGGGCCCGGT CACCGGTGCC TCCGTCCGCC TCGACATCGA GGCGGAGGTC GGCTTCGTGG TGGGAGTGCC CAGCCCGCTG GGCCACCGCG CCGCAGCCGG CGACTTCGCC GACCACGTGT TCGGCGTGGT ACTGGTCAAC GACTGGTCCG CCCGGGACAT CCAAGCGTGG GAGTACCAAC CTCTCGGCCC GTTCCTCGGT AAGTCCTTTG CCACGTCGAT CGCCGCCTGG GTGACGCCGC TGGAGGCTCT TGGCGCGGCG TTCGTACCAG CGCCGGACCA GGATCCGCCC GTCGCCGACT ATCTGCGCGA CGAGCCCCAC CTCGGATTGG ACCTGCGTCT GTCGGTGGAG TGGAACGGTG AGCGGGTGAG CGAGCCACCG TTCGCCGCGA TGTACTGGAC GCCGGCACAG CAACTAGCCC ACCTGACCAT CAACGGAGCC GCGTTGCGCA CCGGTGACCT GTACGCCTCC GGCACCGTGT CCGGGGACGA GCGGCACCAG GTCGGCTCGT TCCTCGAGCT GACCTGGGGC GGTACGGAGC CCGTGCGAGT TGGCGGCGAG GAACGGATGT TCCTGGCGGA CGGCGACACC GTCACCATCT CTGGTACCGC GCCCGGACCG GACGGTACGA CCGTCGGCCT CGGCGAGGTC ACCGGCACCA TCATCAGCCC CCGCTGA
|
Protein sequence | MTWVTGLEGS PYGVHNLPYG VFRTAAREPR VGVRVGAHVL DLGGAEQAGL VFAAGTLGRP RLNDFMALGR PQWTAVRQRV TELLTDSAHR AAVEPLLLPL LDVELLLPFD VADYVDFYSS EQHATNVGKI FRPGQPPLLP NWKHLPIGYH GRAGTVVVSG TPIVRPCGQR ATPQGPVTGA SVRLDIEAEV GFVVGVPSPL GHRAAAGDFA DHVFGVVLVN DWSARDIQAW EYQPLGPFLG KSFATSIAAW VTPLEALGAA FVPAPDQDPP VADYLRDEPH LGLDLRLSVE WNGERVSEPP FAAMYWTPAQ QLAHLTINGA ALRTGDLYAS GTVSGDERHQ VGSFLELTWG GTEPVRVGGE ERMFLADGDT VTISGTAPGP DGTTVGLGEV TGTIISPR
|
| |