Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_3580 |
Symbol | |
ID | 5077729 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_009427 |
Strand | + |
Start bp | 198148 |
End bp | 199557 |
Gene Length | 1410 bp |
Protein Length | 469 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 640481304 |
Product | carotenoid oxygenase |
Protein accession | YP_001165966 |
Protein GI | 146275806 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG3670] Lignostilbene-alpha,beta-dioxygenase and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTGACCA AAGGCGTCGT CGTCGTCTCC AGCTTCCGCC GCAGCCGCCA GGACGCGAAC CGCCCCCACG CTTTCCTCAC CGGCATCCAC GCCCCGGTGA AGGAAGAACG CACGATCGAA GATCTCGCCG TCACCGGCAC GATCCCGGCG GAGCTTTCGG GCCGCTACGT GCGCATCGGC CCCAATCCCT TCCGCGCCGA TCCGCGCGGG CACCACTGGT TCGTGGGCGA CGGCATGGTC CACGGCGTCT GCATGAAGGG CGGCAAGGCG CTGTGGTATC GCAACCGCTA TGTCCGCTCG CGCAACCTCC AGGATGCCGG AGGCCCAGCC GCCGCGCCCG GTCCGCGTCG CTCCACCTTC GACACGGTGA ACACCAACGT CATCCAGCAC GCCGGCCGCA CGTTCGCGCT GGTCGAAGCC GGGTCCTTTC CCGTCGAACT TACGCACGAT CTGGAAAGCT TCGCCTACTC CGACCTTGGC GGCACGCTGA AGGGGCCGTT CAGCGCCCAT CCGCATCTCG ATCCGCTGAC CGGCGAACTC CACGCCGTGA CCTATGACGG ACAGACGCTC GACACGGTCT GGCACGTCGT CGTCGACCGC GAGGGGCGCG TCCGGCGCGA AGAGCCGGTG CCGGTTGCGC ACGGCCCGTC GATCCACGAT TGCGCGATCA CCGCCAAGTA CGTCCTCATC CTCGACCTGC CGGTCACCTT CTCGATGGCC GCGCTCGTCG GCGGGGCGCG CTTTCCCTAT CGCTGGAACC CGGCGCACCG CGCCCGCGTC GGCCTGCTCC CGCGCGAAGG GACGGCGGCG GACGTGATCT GGTGCGACGT CGACGCGGCC TATGTCTTCC ACGTCGCCAA TGCCTTCGAC AATCCCGATG GCACGGTCAC GGTCGACCTG GCCGCTTACG AGACGATGTT CGCCCATGGC CCCGACGGGC CCAACGGCAA GTCCCTGGGT ATGGAGCGCT GGACGGTCGA CCCCGCTGCC CGCAAGGTCG CGCGCAAGAC GCTCGACGCC GCCCCGCAGG AATTCCATCG CCCGGACGAA CGCTTCTTCG GCCAGCCCTA CCGCTTTGCC TGGTCGATGG GCCTGCCCGC CGAAAACGCC GAGGACTTCC TCGGCCACGC CCCGATCTAT GGCTACGACC TCGCGACCGG CCAGCGCAGC GCCCATGATT TCGGCCCCGG CAAGATCCCC GGCGAGTTCG TCTTCATCCC GCGCAGGGCC GATGCGGAAG AAGGCGACGG GTGGCTGATG GGCTACGTCA TCGACCTCGC CTCGGAAACC ACCGACCTTG CGATCCTCGA TGCGCGCAAC CTCGCCGCCC CGCCCCTCGC CCTGATCCAC ATCCCGTGCC GCATTCCCCC CGGCTTCCAC GGCAACTGGC TCCCCGACGC GGCGGACTGA
|
Protein sequence | MVTKGVVVVS SFRRSRQDAN RPHAFLTGIH APVKEERTIE DLAVTGTIPA ELSGRYVRIG PNPFRADPRG HHWFVGDGMV HGVCMKGGKA LWYRNRYVRS RNLQDAGGPA AAPGPRRSTF DTVNTNVIQH AGRTFALVEA GSFPVELTHD LESFAYSDLG GTLKGPFSAH PHLDPLTGEL HAVTYDGQTL DTVWHVVVDR EGRVRREEPV PVAHGPSIHD CAITAKYVLI LDLPVTFSMA ALVGGARFPY RWNPAHRARV GLLPREGTAA DVIWCDVDAA YVFHVANAFD NPDGTVTVDL AAYETMFAHG PDGPNGKSLG MERWTVDPAA RKVARKTLDA APQEFHRPDE RFFGQPYRFA WSMGLPAENA EDFLGHAPIY GYDLATGQRS AHDFGPGKIP GEFVFIPRRA DAEEGDGWLM GYVIDLASET TDLAILDARN LAAPPLALIH IPCRIPPGFH GNWLPDAAD
|
| |