Gene Saro_3659 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_3659 
Symbol 
ID5077807 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_009427 
Strand
Start bp288676 
End bp290097 
Gene Length1422 bp 
Protein Length473 aa 
Translation table11 
GC content64% 
IMG OID640481382 
Productcytochrome P450 
Protein accessionYP_001166044 
Protein GI146275884 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG2124] Cytochrome P450 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.539017 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAAGAG CTGCGACTGC GGCCGGTAAT GGCCTTCCCT TGCTCGATGG AGGCGTGCCG 
CTCCTCGGGC ATCTCGCACA GTTCTTCCGC GATCCGGTTT CGGTACTCAA GCGCGGATAC
CGCTCGAAGG GGCGGCTCTT CGCGATGAAC TTCATGGGCC AGCGCATGAA CGTGATGCTG
GGTCCGGAAC ACAACCGCTT CTTCTTCGAG GAGACGGACA AGCTGCTCTC GATCCGGGAG
TCGATGCCGT TCTTCCTCAA GATGTTCTCG CCCGAGTTCT ATTCGTTCGC GGAAATGGAC
GAGTACCTGC GCCAGCGCTC GATCATCATG CCCCGCTTCA AGGCGGCATC GATGAAGCAG
TACGTGCCGG TCATGGTCGA GGAATCGCTT AACCTGGTCG AGCGGCTGGG CGAGGAAGGC
GAGTTCGACC TGATCCCGAC GCTGGGCCCG GTGGTAATGG ACATCGCCGC GCACAGCTTC
ATGGGACGCG AGTTCCACGA GAAGCTGGGG CATGAGTTCT TCGAACTCTT CCGCGATTTT
TCGGGAGGCA TGGAATTCGT CCTGCCGCTG TGGCTGCCGA CACCCAAGAT GGTCAAGTCA
CAGCGCGCGA AGAGGAAGCT CCACGCCATC CTGCAATCGT GGATCGACAA GCGCCGCGCC
GCCCCGCTCG ATCCGCCCGA TTTCTTCCAG ACGATGATCG AGACGAAGTA TCCCGATGGC
CGCCCGGTGC CCGACGAGAT CATCCGCCAC CTGATCCTCC TTCTCGTCTG GGCAGGGCAC
GAGACGACCG CCGGGCAGGT GAGCTGGGCG CTGGCGGACC TCCTTCAGAA CCCGGACTAC
CAGAAGGTGC TGCGCGGCGA GATATCGTCG CTGCTGGGCG GCAGCGACGG GCGCGACCTT
GGCTGGGAAC AGGCCGTGGC GATGGAGAAG ATGGACCTTG CCCTGCGCGA GACCGAGCGG
CTCCATCCGG TCGCCTACAT GCTCAGCCGC AAGGCGCGGG CCGATATCGA GCGCGACGGC
TATGTCATCC GCAAGGGCGA GTTCGTGCTG CTTGCGCCTT CGGTCAGCCA CCGCATGGAA
GAGACGTTCC GCAATCCCGA TGCCTATGAC CCGGAACGCT TCAACCCGGC CAACCCCGAT
GCGCAGATCG AAAGCAATTC GTTGATCGGC TTTGGCGGGG GTGTCCACCG CTGCGCGGGC
GTGAACTTCG CGCGGATGGA GATGAAGGTG CTGGTGGCGA TCCTGCTCCA GAACTTCGAC
ATGGAGCTGA TGGACGAAGT GCGGCCCATC GCGGGCGCAT CGACCTACTG GCCCGCCCAG
CCCTGCCGGG TGCGCTATCG GCGGCGCAAG CTCGACGGGT CGGAGGCAGG TGCGGACATG
GCGGCGCTGG CCCGAGCCGC CGGCTGCCCG GCGCATACGT GA
 
Protein sequence
MARAATAAGN GLPLLDGGVP LLGHLAQFFR DPVSVLKRGY RSKGRLFAMN FMGQRMNVML 
GPEHNRFFFE ETDKLLSIRE SMPFFLKMFS PEFYSFAEMD EYLRQRSIIM PRFKAASMKQ
YVPVMVEESL NLVERLGEEG EFDLIPTLGP VVMDIAAHSF MGREFHEKLG HEFFELFRDF
SGGMEFVLPL WLPTPKMVKS QRAKRKLHAI LQSWIDKRRA APLDPPDFFQ TMIETKYPDG
RPVPDEIIRH LILLLVWAGH ETTAGQVSWA LADLLQNPDY QKVLRGEISS LLGGSDGRDL
GWEQAVAMEK MDLALRETER LHPVAYMLSR KARADIERDG YVIRKGEFVL LAPSVSHRME
ETFRNPDAYD PERFNPANPD AQIESNSLIG FGGGVHRCAG VNFARMEMKV LVAILLQNFD
MELMDEVRPI AGASTYWPAQ PCRVRYRRRK LDGSEAGADM AALARAAGCP AHT