Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_1074 |
Symbol | |
ID | 3916370 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_007794 |
Strand | + |
Start bp | 1117707 |
End bp | 1118714 |
Gene Length | 1008 bp |
Protein Length | 335 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 640443809 |
Product | fumarylacetoacetate (FAA) hydrolase |
Protein accession | YP_496353 |
Protein GI | 87199096 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG0179] 2-keto-4-pentenoate hydratase/2-oxohepta-3-ene-1,7-dioic acid hydratase (catechol pathway) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.0872589 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAAGCTTG CCTCCCTCCC GCAGGGACGG GATGGGCGGC TGATCGTCGT GTCCGACGAC CTGGCCTGGT ATGCCGATGC CGACCACATC GTCCCGACGA TGCAGGGTCT GCTGGACGAC TGGGACCGGC ATTTCCCGTT GATCGAGGCC TTGGCGATCG AGCTTGCGCA CGAAGCCATT CCGCTCAAGC GCTTTCACGA GCGCGAGGCC TGCGCACCCT TGCCGCGCGC CTTCCAGTGG GCAGACGGCA GCGCCTATGT GAATCACGTC GAACTGGTGC GGAAGGCGCG TGGGGCGGAG CTGCCTGAAA GCTTCTGGAC CGACCCGCTG ATGTATCAGG GCGGCAGCGA CGACATGCGC GGCGGGCGCG ATCCCCTCGT ACTGGCGGAC GAAGCCTGGG GTGGGGACTT CGAGGCGGAA ATCGTCGTCG TCACCGGCGA CGTGCCCCAG GGCGTCAGCC CCGAAGAAGC GCTGAAGCAT ATCCGCCTTG TCGGTCTGGT CAACGACGTG TCGTTGCGCA ATCTCATCCC CGGCGAACTG TCCAAGGGCT TCGGCTTCGT CCAGTCCAAG CCGGCCAGTC ACTTTTCCCC GGTCTTCGTG ACGCCGGAAA CGCTCGGTGA CGCATGGGCT GGCGGCAGAC TGTCGCAGAC GCTGATGGTC GACCTCAACG GCGAGCCGTT CGGGCGCATC GAGGCTGCGG AAGAGTGCAC GTTCGACTTC GGCGTGCTGA TCGCGCATCT TGCGAAGACG CGCAGCATCG GGGCCGGTTC GATCATCGGA TCGGGCACCG TGTCCAATCG CGATCCCGAC GGTTCGCCAG GACGCCCGGT GGCCCAGGGC GGGCGCGGCT ATGCCTGCAT CGCCGAACAG CGCATGGTCG AGACGATCGC ATCGGGCCAG CCGTCGACGG CCTTCCTGCG ATGGGGCGAT ACAGTCCGGA TCGAAATGCG CGACGACAAG GGCAAGAGCA TCTTCGGCGC CATCGAGCAG ACCGTCGTCC GCCCCTGA
|
Protein sequence | MKLASLPQGR DGRLIVVSDD LAWYADADHI VPTMQGLLDD WDRHFPLIEA LAIELAHEAI PLKRFHEREA CAPLPRAFQW ADGSAYVNHV ELVRKARGAE LPESFWTDPL MYQGGSDDMR GGRDPLVLAD EAWGGDFEAE IVVVTGDVPQ GVSPEEALKH IRLVGLVNDV SLRNLIPGEL SKGFGFVQSK PASHFSPVFV TPETLGDAWA GGRLSQTLMV DLNGEPFGRI EAAEECTFDF GVLIAHLAKT RSIGAGSIIG SGTVSNRDPD GSPGRPVAQG GRGYACIAEQ RMVETIASGQ PSTAFLRWGD TVRIEMRDDK GKSIFGAIEQ TVVRP
|
| |