Gene Saro_3798 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_3798 
Symbol 
ID5077946 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_009427 
Strand
Start bp445320 
End bp449651 
Gene Length4332 bp 
Protein Length1443 aa 
Translation table11 
GC content66% 
IMG OID640481521 
Productcytochrome P450-like protein 
Protein accessionYP_001166183 
Protein GI146276023 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG2124] Cytochrome P450 
TIGRFAM ID[TIGR01413] Dyp-type peroxidase family 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTCGAAC CGAAATACCT GCGCATGCAG TGGCTTGGCG CACGCCGCAA GAAAGAGCGC 
GGGTCGATGG ACTTCTCGCT GGATTTCATG CGGCCGAAGG GCGTGGCGGG GCAGGTGGCA
GCGGCGCTTT TCCGACGGCT TCTGGTGCCG GTGCTCTTCG TGCTCCAGTG CTTCTGGCCG
GTGCTACGCT GGCGTCGCTT CCTGCTGGTC ACGCGCGCCG ATGACGTGGC TGCGATCCTG
GGTGATCCTG AAGGTTTTCC GGTACCGTTC GGCCCTGAAA TGCAGAGCCT TGGCGCAGGG
GCAACCTTCC TTCTTGGTCT CGAGGGTCCG GATCACGACC GGCAGCGCCG GATCATCACC
AGCGTCGTCA AGCGAAGCGA CCTTGCCGGG CTGGAGGCGC AGGCGGACAG CTTTGCCAAG
GCGCTGCTCG AATCCTCCAA GGGCCGGATC GACGCCCAGC GCGACCTCGT CCAGCGCGTG
GCTGCCGAGA CCTGTGCCCG CTATTTTGGC CTGCCCATCG ACGATCCGGA CCTGTATGCC
GAATGGACGA TCGCCTGCTC GCAGCTTCTG TTTGCCGATC CGCTCGGCGA TCCGGTCGCG
CGAGACATGG CCGAAGCGGC TGCGGCACGC ATCGGCGCAC TGATCGACCG GACAGTAGCG
GCAACGCAAT CCGGTGGCCA TGAACATGCC GCGCCGCCCG AGACGCTCGT CGCGCGGCTT
GTCGAATTGC AGGCCGCAGG GGGAGCGGAC GCACCGACGA ATGAAGAGAT CCGGGCCATC
CTTCTCGGCC TGTCTGTCGG CTTCGTGCCG ACCAACAGCA TGGCGGGCGG CAAGATTCTC
GATCTACTTT CGCGCAAGGG CGAAGCGCGG CGCGAGGCCA TCGAGGCCGC GCGCGCCGGC
GATGCGCAGC ACTTCGAGCA GGTGCTGCGC GAAGCGATGA GGCTGGCGCC GGCCATCGCC
CCCGGCCAGT GGCGCTGGAC GCGGCAGGAT ACGCAGTGGA TCCGTCCGGG CGGTCGAACC
TTTCGCGTCA GGCGCAACAC GCTGGTCATG GTCGCCACGC AGATCGCTCT GCGCGACCCT
CGCAAGGTCG TTCGGCCGTG GCGTTTCGAG TATGACCGTA CCGATACGCC GCCCCTCGTC
TTCGGCAATC ATGCGCATTC CTGCATCGGC GAACATATCG CGATTGCCCA GTTGCGCGGC
AGCCTGATGC CGTTGCTGGC GCAGGATGGC ATCGAGGATT CCCTGAACAG GCTTCGCGTG
AAATGGCTGG GGGCGTTTCC CGAGCACGCC TGGCTGCAAT TCGACCACGA CGAAGGTGCC
CAAAAGGGAC AGTTTATCGT GATCGAGGCC AAGGCCGGCA CCACGGCCGA GCTGAATGCC
CGGATTGCGG AAATCGACGC GCGTCTCGAA CGGTTGCGAC CGCGGCTCGA TCAGGCGGGG
TTGCTTCACT TCTGCTCGAT GACGGCCATC GACGCCCCCA CCACGCGGCC GCCAGAGGAA
CGCGGCGACG GGCGCACCAC CGATACGCTG CTGGTGATCG AGGTGAATGG CGATGGCGAA
GGGACGCGCG CCATCGCTGC CTTCGTTCGT GCAATGAATG CCGAGCTTCG GGCGCTGCTG
GCAGCAGCCG GCGTGGAGCC GGGACTGGAC CTGGTAACTT ATCTGCTCGA CAAGCGGCTC
GACCTGACCA GCGCCCCATG GGGCGCGACG GCGCTGCAGT TCTATGGTCA CCAAGGTCTG
AGCGCGCGCG ACATTGCGGC CCAGGATGAG TTGGCCGGCA TTGCCAGCGA GGCGCTTCAG
GCGACCTTGC AGGCCAATAT CGGCCTCGTG GCCAGCGCGT CTGCCACCTT GCGCGAGGTC
CGGCGACGCA TCGTGGCCCG GCCGGACGGC GCCCGCTGGG CGGCGTACAT GCTCAAGCCG
TCGCAGGCCA GGATGGACCT CAGCTGGTGG GACGATCCGC TTGATGGCGG TTTCGTGCTG
CGCCTGCTGC GCACCCGTGC GATGAAGCGG GTGGGGCTTG TCCTTGTCGC GGTGGCGCTG
GCATCGGCAT GGGCGATCGG TCGGAGCGAA GGCTTCACAT GGGGTGGCCT TGTCAGTCTG
CCGCACCTTC TATGGCTCGG CATTGCCGGC GTCATGGTCT CGCTTTCGGG CGGTGCGCTG
GTGGCGGGCG GGTTCGTTGC GATGCTTCGC CTCAAGGAAA GCCGTGATGT GCCCGATGGC
GCAGCGCCCT CGCTCGACCA TCTGCGCGAA TGCGCGGCGC ACGAGGACAA GCCCGACCAC
GTCCACAACC ACATCCTGGT CGTCACGCCG TTGAAGAAAG GATTGATCCG CCGACTGGCG
CTTGCACTGG CGCTGTGGGG GATCGCGATG ATCGTGACCT TCCGCTTCCG GCCCGGCTTC
GTGCTCAACA TGGGCACGAT CCACTTCGCC AGGTGGGTGA GATTGCCCGG CCGCGACACC
ATGGTCTTCC AGTCGAACTA CGACGGTAGC TGGGAAAGTT ACCTCGAAGA CTTCATTACC
CGAGCGCACT GGGGGCAGAG CGCGGCCTGG TCGAACGGGG AGGGCTTCCC GCGAACGCGC
TTCCTGATCT CCGGCGGCGC CGAGGACGGC GACAATTTCA AGCGCTATGT CCGGCGCAAG
CAGGTGCTGA CCCGGTTCTG GTACGCCCGC TTTCGCAACC TGACGAGTGC GGCGATGCGG
CGCAATGCGC TGATCCATGA CGGGCTGGCG CGGGCGCGCA CGGAAAGCGA GGCGCGCAGT
TGGCTGGCCC TCTTCGGGTC GGGACAGCTT ACCCGCGACC GGCTCGAAAG CGACGAAATC
CAGTCGCTGG TCTTTACCGG CTTCGGCAAG CTGCGGCATT CGACTGCGCT TCTCATCCGG
TTCGGCGAGC AGGCCGATGA CAATCGCAAG TGGTTGCGCA ACCTGACCGG CTTCCGCAAC
GCAACGCCCG ACAAGTCGCT CCTGTTCGAG ATGGACGAAG TTCGCGAACG CCGCCTGGGC
CGCGTCACCT TTGGCGAGGC GGAAAGGTTC GGTACAGCAG TGAGCCTCGG CCTGAGCGCC
GAAGGCCTTC GCGTGCTGGA GATGCCCGCA TCCCATCCCG AGCGCGGGTT CAACGCGCTG
CCCGGGCCGT TCGCCTTCGG CATGACCAAC CGCGCACGAA GGCTGGGCGA TGATGGGGAG
GAAGCTCCGC AGAACTGGCG CTGGCGCGAC GCCGAGGATG GCGCCGTCCA CGCGATACTG
CTCCTCTACG CTGCCGACGA GGAAGGGCTG GCGCGGCTCG CGGCACTGCA CCGGCGCTTT
GCGGAACGGG CAGGGGTCAG CGTGGTCGAT GAAGTGCCCA CCACGGCGCT TCCTCAGAAC
GGCCACGCCT ACGACCACTT CGGCTTCCGC GACGGGATCG TGCAACCGGT GATCGCCGGG
ACGGCCAAGG CCGCGTTGAA CCGCGTGCCC AGCGATATCA TCCCGCCCGG CGAAATGGTG
CTCGGCTACG CCAACGCGCA GGGCTATCTT ACGCCCGGTA TTCCGGTGGA CAAGGCTGAC
GACCCCTTCG ACCGCCTGCC GGAAATGCCG CGCGAGCCAC AGCGCTATCC GCGCTTCGGC
GGCGATGAAG GTGCGCGCGG GCCCCGCGAT TTCGGGCGTA ACGGGGCATT CCTCGCCGTC
CGGCAGCTTG AACAGCATGT CGTCCGCTTC AGTGAGGCAA TGGAAGCGGC CGCCAATCAG
ATACACGACA ACTATCCCGA ACTGCCCTCG CTTCTGGGGC ATGATGTCGA TGCGAACTGG
GTGGCGGCAC GTCTGGTCGG GCGCTGGAAG AACGGCGCGC CGCTCATGCG CAATCCGCTT
GAACCGGACA GGAAGAGCCC CGAACTGCCC ATGCTGTTCG GCGCCGACGA TCCTTCCGGC
CTCCAGTGCC CGCTTGGCGC ACACGTGCGC CGCGCCAATC CGCGCGACAG CTTCGAGCCG
GGCGATCCGA CCGAACTGGG GATCGTCAAC CGTCACCGCC TTGTCCGGCG CGGCCGCAGC
TATGAAAGGC CTGCCAGTGA TGGCGGCGCG CCCGAACAGG GCCTGCTGTT CATGGCGGTT
TGCGAAGATC TGGAGAGGCA GTTCGAATTC GTGCAGCGTA GCTGGCTGGA TTCTCCGGCG
TTCCACGATC TGGACCGCGA AAACGACCCG ATTGTCGGGC GGTGTCCTGC CGGACACAAG
CGCAGCTTCA GGATACCGAC GTCGAGCGGA CCTATCCAGG TCGATGGGCT GCCCCAGTTC
GTGACCCTGC GCGCGGGCGG CTACTTCTTC CTGCCGAGCC GCAGCGCCAT GCTCTGGCTG
GCGCGGGTCT GA
 
Protein sequence
MFEPKYLRMQ WLGARRKKER GSMDFSLDFM RPKGVAGQVA AALFRRLLVP VLFVLQCFWP 
VLRWRRFLLV TRADDVAAIL GDPEGFPVPF GPEMQSLGAG ATFLLGLEGP DHDRQRRIIT
SVVKRSDLAG LEAQADSFAK ALLESSKGRI DAQRDLVQRV AAETCARYFG LPIDDPDLYA
EWTIACSQLL FADPLGDPVA RDMAEAAAAR IGALIDRTVA ATQSGGHEHA APPETLVARL
VELQAAGGAD APTNEEIRAI LLGLSVGFVP TNSMAGGKIL DLLSRKGEAR REAIEAARAG
DAQHFEQVLR EAMRLAPAIA PGQWRWTRQD TQWIRPGGRT FRVRRNTLVM VATQIALRDP
RKVVRPWRFE YDRTDTPPLV FGNHAHSCIG EHIAIAQLRG SLMPLLAQDG IEDSLNRLRV
KWLGAFPEHA WLQFDHDEGA QKGQFIVIEA KAGTTAELNA RIAEIDARLE RLRPRLDQAG
LLHFCSMTAI DAPTTRPPEE RGDGRTTDTL LVIEVNGDGE GTRAIAAFVR AMNAELRALL
AAAGVEPGLD LVTYLLDKRL DLTSAPWGAT ALQFYGHQGL SARDIAAQDE LAGIASEALQ
ATLQANIGLV ASASATLREV RRRIVARPDG ARWAAYMLKP SQARMDLSWW DDPLDGGFVL
RLLRTRAMKR VGLVLVAVAL ASAWAIGRSE GFTWGGLVSL PHLLWLGIAG VMVSLSGGAL
VAGGFVAMLR LKESRDVPDG AAPSLDHLRE CAAHEDKPDH VHNHILVVTP LKKGLIRRLA
LALALWGIAM IVTFRFRPGF VLNMGTIHFA RWVRLPGRDT MVFQSNYDGS WESYLEDFIT
RAHWGQSAAW SNGEGFPRTR FLISGGAEDG DNFKRYVRRK QVLTRFWYAR FRNLTSAAMR
RNALIHDGLA RARTESEARS WLALFGSGQL TRDRLESDEI QSLVFTGFGK LRHSTALLIR
FGEQADDNRK WLRNLTGFRN ATPDKSLLFE MDEVRERRLG RVTFGEAERF GTAVSLGLSA
EGLRVLEMPA SHPERGFNAL PGPFAFGMTN RARRLGDDGE EAPQNWRWRD AEDGAVHAIL
LLYAADEEGL ARLAALHRRF AERAGVSVVD EVPTTALPQN GHAYDHFGFR DGIVQPVIAG
TAKAALNRVP SDIIPPGEMV LGYANAQGYL TPGIPVDKAD DPFDRLPEMP REPQRYPRFG
GDEGARGPRD FGRNGAFLAV RQLEQHVVRF SEAMEAAANQ IHDNYPELPS LLGHDVDANW
VAARLVGRWK NGAPLMRNPL EPDRKSPELP MLFGADDPSG LQCPLGAHVR RANPRDSFEP
GDPTELGIVN RHRLVRRGRS YERPASDGGA PEQGLLFMAV CEDLERQFEF VQRSWLDSPA
FHDLDRENDP IVGRCPAGHK RSFRIPTSSG PIQVDGLPQF VTLRAGGYFF LPSRSAMLWL
ARV