Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_0802 |
Symbol | |
ID | 3915856 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_007794 |
Strand | - |
Start bp | 852328 |
End bp | 853812 |
Gene Length | 1485 bp |
Protein Length | 494 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 640443533 |
Product | carotenoid oxygenase |
Protein accession | YP_496081 |
Protein GI | 87198824 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG3670] Lignostilbene-alpha,beta-dioxygenase and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 0.957651 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCCCAAT TTCCGAACAC CCCCAGCTTC ACGGGATTCA ACACGCCGTC GCGGATCGAG GCGGATATCG CCGATCTGGC CCACGAAGGC ACGATTCCGC AAGGGTTAAA CGGCGCATTC TACCGCGTCC AGCCCGACCC GCAGTTTCCT CCCCGCCTCG ACGACGACAT CGCCTTCAAC GGCGACGGCA TGATCACCCG CTTCCACATC CACGACGGCC AGGTCGACTT CCGCCAGCGC TGGGCGAAGA CCGACAAGTG GAAGCTGGAG AACGCCGCCG GAAAGGCCCT GTTCGGCGCC TACCGCAACC CGCTGACCGA CGACGAGGCG GTCAAGGGCG AGATCCGTTC GACCGCCAAC ACCAACGCCT TCGTGTTCGG CGGCAAGCTG TGGGCGATGA AGGAGGACAG TCCCGCCCTC GTCATGGACC CGGCGACGAT GGAAACCTTC GGGTTCGAGA AGTTCGGCGG CAAGATGACC GGCCAGACCT TTACCGCCCA CCCCAAGGTC GATCCGAAGA CCGGCAACAT GGTCGCCATC GGCTATGCCG CAAGCGGGCT GTGCACCGAC GATGTGACCT ACATGGAAGT GAGCCCGGAG GGCGAGCTTG TCCGCGAAGT GTGGTTCAAG GTGCCGTACT ACTGCATGAT GCACGACTTC GGCATCACCG AGGATTACCT CGTGCTGCAC ATCGTGCCTT CCATCGGAAG CTGGGAAAGG CTGGAACAGG GCAAGCCGCA CTTCGGCTTC GACACGACCA TGCCGGTGCA CCTCGGCATC ATCCCGCGCC GCGACGGCGT GCGCCAGGAA GACATCCGCT GGTTCACGCG GGACAACTGC TTTGCCAGCC ATGTCCTGAA CGCCTGGCAA GAGGGGACCA AGATCCACTT CGTGACCTGC GAGGCGAAGA ACAACATGTT CCCGTTCTTC CCCGACGTCC ACGGCGCGCC CTTCAACGGC ATGGAGGCCA TGAGCCATCC GACCGACTGG GTGGTCGACA TGGCCAGCAA CGGCGAGGAC TTTGCCGGGA TCGTGAAGCT TTCCGACACA GCCGCCGAGT TCCCGCGCAT CGACGACCGC TTTACCGGCC AGAAGACCCG CCATGGCTGG TTCCTCGAAA TGGACATGAA GCGCCCGGTG GAATTGCGCG GCGGCAGCGC CGGCGGCCTG CTGATGAACT GCCTGTTCCA CAAGGACTTC GAAACGGGTC GCGAGCAGCA CTGGTGGTGC GGCCCGGTGT CGAGCCTTCA GGAGCCGTGC TTCGTGCCGC GCGCCAAGGA TGCCCCCGAA GGCGACGGCT GGATCGTGCA GGTTTGCAAC CGGCTGGAAG AGCAGCGCAG CGACTTGCTG ATCTTCGACG CGCTCGACAT CGAGAAAGGC CCGGTGGCCA CGGTCAACAT CCCCATCCGC CTGCGCTTCG GCCTTCACGG CAACTGGGCG AATGCCGACG AAATCGGCCT TGCCGAGAAG GTCCTGGCCG CATGA
|
Protein sequence | MAQFPNTPSF TGFNTPSRIE ADIADLAHEG TIPQGLNGAF YRVQPDPQFP PRLDDDIAFN GDGMITRFHI HDGQVDFRQR WAKTDKWKLE NAAGKALFGA YRNPLTDDEA VKGEIRSTAN TNAFVFGGKL WAMKEDSPAL VMDPATMETF GFEKFGGKMT GQTFTAHPKV DPKTGNMVAI GYAASGLCTD DVTYMEVSPE GELVREVWFK VPYYCMMHDF GITEDYLVLH IVPSIGSWER LEQGKPHFGF DTTMPVHLGI IPRRDGVRQE DIRWFTRDNC FASHVLNAWQ EGTKIHFVTC EAKNNMFPFF PDVHGAPFNG MEAMSHPTDW VVDMASNGED FAGIVKLSDT AAEFPRIDDR FTGQKTRHGW FLEMDMKRPV ELRGGSAGGL LMNCLFHKDF ETGREQHWWC GPVSSLQEPC FVPRAKDAPE GDGWIVQVCN RLEEQRSDLL IFDALDIEKG PVATVNIPIR LRFGLHGNWA NADEIGLAEK VLAA
|
| |