Gene Saro_3580 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_3580 
Symbol 
ID5077729 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_009427 
Strand
Start bp198148 
End bp199557 
Gene Length1410 bp 
Protein Length469 aa 
Translation table11 
GC content69% 
IMG OID640481304 
Productcarotenoid oxygenase 
Protein accessionYP_001165966 
Protein GI146275806 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3670] Lignostilbene-alpha,beta-dioxygenase and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTGACCA AAGGCGTCGT CGTCGTCTCC AGCTTCCGCC GCAGCCGCCA GGACGCGAAC 
CGCCCCCACG CTTTCCTCAC CGGCATCCAC GCCCCGGTGA AGGAAGAACG CACGATCGAA
GATCTCGCCG TCACCGGCAC GATCCCGGCG GAGCTTTCGG GCCGCTACGT GCGCATCGGC
CCCAATCCCT TCCGCGCCGA TCCGCGCGGG CACCACTGGT TCGTGGGCGA CGGCATGGTC
CACGGCGTCT GCATGAAGGG CGGCAAGGCG CTGTGGTATC GCAACCGCTA TGTCCGCTCG
CGCAACCTCC AGGATGCCGG AGGCCCAGCC GCCGCGCCCG GTCCGCGTCG CTCCACCTTC
GACACGGTGA ACACCAACGT CATCCAGCAC GCCGGCCGCA CGTTCGCGCT GGTCGAAGCC
GGGTCCTTTC CCGTCGAACT TACGCACGAT CTGGAAAGCT TCGCCTACTC CGACCTTGGC
GGCACGCTGA AGGGGCCGTT CAGCGCCCAT CCGCATCTCG ATCCGCTGAC CGGCGAACTC
CACGCCGTGA CCTATGACGG ACAGACGCTC GACACGGTCT GGCACGTCGT CGTCGACCGC
GAGGGGCGCG TCCGGCGCGA AGAGCCGGTG CCGGTTGCGC ACGGCCCGTC GATCCACGAT
TGCGCGATCA CCGCCAAGTA CGTCCTCATC CTCGACCTGC CGGTCACCTT CTCGATGGCC
GCGCTCGTCG GCGGGGCGCG CTTTCCCTAT CGCTGGAACC CGGCGCACCG CGCCCGCGTC
GGCCTGCTCC CGCGCGAAGG GACGGCGGCG GACGTGATCT GGTGCGACGT CGACGCGGCC
TATGTCTTCC ACGTCGCCAA TGCCTTCGAC AATCCCGATG GCACGGTCAC GGTCGACCTG
GCCGCTTACG AGACGATGTT CGCCCATGGC CCCGACGGGC CCAACGGCAA GTCCCTGGGT
ATGGAGCGCT GGACGGTCGA CCCCGCTGCC CGCAAGGTCG CGCGCAAGAC GCTCGACGCC
GCCCCGCAGG AATTCCATCG CCCGGACGAA CGCTTCTTCG GCCAGCCCTA CCGCTTTGCC
TGGTCGATGG GCCTGCCCGC CGAAAACGCC GAGGACTTCC TCGGCCACGC CCCGATCTAT
GGCTACGACC TCGCGACCGG CCAGCGCAGC GCCCATGATT TCGGCCCCGG CAAGATCCCC
GGCGAGTTCG TCTTCATCCC GCGCAGGGCC GATGCGGAAG AAGGCGACGG GTGGCTGATG
GGCTACGTCA TCGACCTCGC CTCGGAAACC ACCGACCTTG CGATCCTCGA TGCGCGCAAC
CTCGCCGCCC CGCCCCTCGC CCTGATCCAC ATCCCGTGCC GCATTCCCCC CGGCTTCCAC
GGCAACTGGC TCCCCGACGC GGCGGACTGA
 
Protein sequence
MVTKGVVVVS SFRRSRQDAN RPHAFLTGIH APVKEERTIE DLAVTGTIPA ELSGRYVRIG 
PNPFRADPRG HHWFVGDGMV HGVCMKGGKA LWYRNRYVRS RNLQDAGGPA AAPGPRRSTF
DTVNTNVIQH AGRTFALVEA GSFPVELTHD LESFAYSDLG GTLKGPFSAH PHLDPLTGEL
HAVTYDGQTL DTVWHVVVDR EGRVRREEPV PVAHGPSIHD CAITAKYVLI LDLPVTFSMA
ALVGGARFPY RWNPAHRARV GLLPREGTAA DVIWCDVDAA YVFHVANAFD NPDGTVTVDL
AAYETMFAHG PDGPNGKSLG MERWTVDPAA RKVARKTLDA APQEFHRPDE RFFGQPYRFA
WSMGLPAENA EDFLGHAPIY GYDLATGQRS AHDFGPGKIP GEFVFIPRRA DAEEGDGWLM
GYVIDLASET TDLAILDARN LAAPPLALIH IPCRIPPGFH GNWLPDAAD