Gene Saro_2809 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_2809 
Symbol 
ID3916969 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp3033028 
End bp3034521 
Gene Length1494 bp 
Protein Length497 aa 
Translation table11 
GC content66% 
IMG OID640445588 
Productcarotenoid oxygenase 
Protein accessionYP_498079 
Protein GI87200822 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3670] Lignostilbene-alpha,beta-dioxygenase and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.220593 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGCGCTT TTCCGCAGAC GATCTATTTC ACCGGCGCGA ATGCGCCGGT CGGCGAGGAA 
CCGGACCTCC GCGGCCTGAA GGTGGAAGGC GATCTGCCCG CCGAAGTGCG CGGCAGCTTC
TATCGCGCCA TCCCCGATCC CGCGTTTCCG CCCAGGTTCG AGAACGACCA CACCCTGTCC
GGCGACGGCA TGGTCAGCCG CCTTTCGTTC AACGGCGACG GCACGGCGGA CTTCATCCAG
AAGTACGTCG AGACGGCGCG CTACAAGGCG GAGAAGGCTG CGGGCAAGGC GCTGTTCGGC
AAGTACCGCA ATCCGTTCAC CGACGATCCG GAAGTGCAAG GCGTGGACCG CACCGTGGCG
AACACCACGC CGGTCTGGCA CGCGGGCCGG ATGCTGATGG CCAAGGAAGA CGGTCGGCCC
TATCGCGTCG ACCCCCGGAC CCTGGCGACG ATCGGCTCCT ACGATTTCGG CGGCGCGCTC
AAGAGCGAGA CGATGACCGC GCACGTACGG ATCGACGCTG GGACGGGCGA GCTGTTCTTC
TATGGCTACG AGGCGGACGG CCAGGCTTCG ACCAAGGTGG CGTACTGCAT TGTCGGCCCG
GATGGCGAAC TGAAGCGCGA GCAGTGGTTC GATGCGCCCT ATTGCGCGAT GATGCACGAT
TTCACGATCA GCGAGAACTA TGCGCTGTTC CCGATCTACC CGACCACGGC GGACCTCGAC
CGGCTGAAGG CGGGCGGGGA GCATTGGCAC CACCAGCCGG AGCTGGACTC GTGGCTGGGC
GTGATGCCGC GCTATGGCGA TGTTTCGGAG ATCAAGTGGT TCAAGGGCCC CAAGGGGTGC
CATTCGTACC ACATGATGAA TGCGTGGGAG GATGCCGACG GCATGCTCCA CTTCGACGCC
TGCCTCAACA ATACCAACGC CTTCGCCTTC ATCCGCGAAC CGTCGGGCAT CCACATGGGG
CCGCAGGATA TCAAGGGCGC GCTGACACGC TGGACTGTCG ATCCCCGGGC CGATGGCGGC
GACGTGGTGG AGACTGTCAT CGGGCCTCCG GGCGATTTCC CGGTGATCCC GGCGAAGTTG
CAGGGGCGCC CGTACAAGAC CGGCTGGATG CTGAGCATGA ATCCCGAACT TCAGGGGCCG
CCGCTCTTCG CCGGGCCGGT CGGGGTTAGC TTCAACCTGC TGCTGCGACT GGACGGGATG
GACACGCCCG CGCCGCAGGT CACGGGCGCG CTGGCGCTGC CGCCGATGGC GGGTTTCAAC
GAGCCGGTGC ATGTGCCTGC CGCCGATCCC GCGAAGGACG GCTGGCTGGT ATTCCTTGTC
GACCAGCAGG TTGGCGACAA TCAGTTCGTG CACGAAGCCT GGGTTGTCGA TGCGGGGAAC
ATCGGCGCGG GCGCTGTGGC CAAGGTGCAC ATCCCGACGC GGCTGCGACC CCAGGTCCAC
GGCTGGTGGG TGCCCCAGGC GCAACTGGAC GCGCTTGAAG GCTCCGCAGC GTGA
 
Protein sequence
MGAFPQTIYF TGANAPVGEE PDLRGLKVEG DLPAEVRGSF YRAIPDPAFP PRFENDHTLS 
GDGMVSRLSF NGDGTADFIQ KYVETARYKA EKAAGKALFG KYRNPFTDDP EVQGVDRTVA
NTTPVWHAGR MLMAKEDGRP YRVDPRTLAT IGSYDFGGAL KSETMTAHVR IDAGTGELFF
YGYEADGQAS TKVAYCIVGP DGELKREQWF DAPYCAMMHD FTISENYALF PIYPTTADLD
RLKAGGEHWH HQPELDSWLG VMPRYGDVSE IKWFKGPKGC HSYHMMNAWE DADGMLHFDA
CLNNTNAFAF IREPSGIHMG PQDIKGALTR WTVDPRADGG DVVETVIGPP GDFPVIPAKL
QGRPYKTGWM LSMNPELQGP PLFAGPVGVS FNLLLRLDGM DTPAPQVTGA LALPPMAGFN
EPVHVPAADP AKDGWLVFLV DQQVGDNQFV HEAWVVDAGN IGAGAVAKVH IPTRLRPQVH
GWWVPQAQLD ALEGSAA