Gene Saro_0802 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_0802 
Symbol 
ID3915856 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp852328 
End bp853812 
Gene Length1485 bp 
Protein Length494 aa 
Translation table11 
GC content64% 
IMG OID640443533 
Productcarotenoid oxygenase 
Protein accessionYP_496081 
Protein GI87198824 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3670] Lignostilbene-alpha,beta-dioxygenase and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.957651 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCCAAT TTCCGAACAC CCCCAGCTTC ACGGGATTCA ACACGCCGTC GCGGATCGAG 
GCGGATATCG CCGATCTGGC CCACGAAGGC ACGATTCCGC AAGGGTTAAA CGGCGCATTC
TACCGCGTCC AGCCCGACCC GCAGTTTCCT CCCCGCCTCG ACGACGACAT CGCCTTCAAC
GGCGACGGCA TGATCACCCG CTTCCACATC CACGACGGCC AGGTCGACTT CCGCCAGCGC
TGGGCGAAGA CCGACAAGTG GAAGCTGGAG AACGCCGCCG GAAAGGCCCT GTTCGGCGCC
TACCGCAACC CGCTGACCGA CGACGAGGCG GTCAAGGGCG AGATCCGTTC GACCGCCAAC
ACCAACGCCT TCGTGTTCGG CGGCAAGCTG TGGGCGATGA AGGAGGACAG TCCCGCCCTC
GTCATGGACC CGGCGACGAT GGAAACCTTC GGGTTCGAGA AGTTCGGCGG CAAGATGACC
GGCCAGACCT TTACCGCCCA CCCCAAGGTC GATCCGAAGA CCGGCAACAT GGTCGCCATC
GGCTATGCCG CAAGCGGGCT GTGCACCGAC GATGTGACCT ACATGGAAGT GAGCCCGGAG
GGCGAGCTTG TCCGCGAAGT GTGGTTCAAG GTGCCGTACT ACTGCATGAT GCACGACTTC
GGCATCACCG AGGATTACCT CGTGCTGCAC ATCGTGCCTT CCATCGGAAG CTGGGAAAGG
CTGGAACAGG GCAAGCCGCA CTTCGGCTTC GACACGACCA TGCCGGTGCA CCTCGGCATC
ATCCCGCGCC GCGACGGCGT GCGCCAGGAA GACATCCGCT GGTTCACGCG GGACAACTGC
TTTGCCAGCC ATGTCCTGAA CGCCTGGCAA GAGGGGACCA AGATCCACTT CGTGACCTGC
GAGGCGAAGA ACAACATGTT CCCGTTCTTC CCCGACGTCC ACGGCGCGCC CTTCAACGGC
ATGGAGGCCA TGAGCCATCC GACCGACTGG GTGGTCGACA TGGCCAGCAA CGGCGAGGAC
TTTGCCGGGA TCGTGAAGCT TTCCGACACA GCCGCCGAGT TCCCGCGCAT CGACGACCGC
TTTACCGGCC AGAAGACCCG CCATGGCTGG TTCCTCGAAA TGGACATGAA GCGCCCGGTG
GAATTGCGCG GCGGCAGCGC CGGCGGCCTG CTGATGAACT GCCTGTTCCA CAAGGACTTC
GAAACGGGTC GCGAGCAGCA CTGGTGGTGC GGCCCGGTGT CGAGCCTTCA GGAGCCGTGC
TTCGTGCCGC GCGCCAAGGA TGCCCCCGAA GGCGACGGCT GGATCGTGCA GGTTTGCAAC
CGGCTGGAAG AGCAGCGCAG CGACTTGCTG ATCTTCGACG CGCTCGACAT CGAGAAAGGC
CCGGTGGCCA CGGTCAACAT CCCCATCCGC CTGCGCTTCG GCCTTCACGG CAACTGGGCG
AATGCCGACG AAATCGGCCT TGCCGAGAAG GTCCTGGCCG CATGA
 
Protein sequence
MAQFPNTPSF TGFNTPSRIE ADIADLAHEG TIPQGLNGAF YRVQPDPQFP PRLDDDIAFN 
GDGMITRFHI HDGQVDFRQR WAKTDKWKLE NAAGKALFGA YRNPLTDDEA VKGEIRSTAN
TNAFVFGGKL WAMKEDSPAL VMDPATMETF GFEKFGGKMT GQTFTAHPKV DPKTGNMVAI
GYAASGLCTD DVTYMEVSPE GELVREVWFK VPYYCMMHDF GITEDYLVLH IVPSIGSWER
LEQGKPHFGF DTTMPVHLGI IPRRDGVRQE DIRWFTRDNC FASHVLNAWQ EGTKIHFVTC
EAKNNMFPFF PDVHGAPFNG MEAMSHPTDW VVDMASNGED FAGIVKLSDT AAEFPRIDDR
FTGQKTRHGW FLEMDMKRPV ELRGGSAGGL LMNCLFHKDF ETGREQHWWC GPVSSLQEPC
FVPRAKDAPE GDGWIVQVCN RLEEQRSDLL IFDALDIEKG PVATVNIPIR LRFGLHGNWA
NADEIGLAEK VLAA