Gene Saro_1492 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_1492 
Symbol 
ID3916157 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp1534481 
End bp1536040 
Gene Length1560 bp 
Protein Length519 aa 
Translation table11 
GC content61% 
IMG OID640444234 
Productvanillyl-alcohol oxidase 
Protein accessionYP_496768 
Protein GI87199511 
COG category[C] Energy production and conversion 
COG ID[COG0277] FAD/FMN-containing dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGCGC TACTTGCCCA AGGCGTTTCC CCCGCGCAAT TTGCACAAGC GCTGGATGCG 
ATGCGCGCTG TCGTTGGCCC ACAGTGGCTT TTTGCCAGCG AAGAAGACAT CGCGGCCTAC
AGTGATCACT TCGCCTTCGA AGACGTGACT GCCAACATGG CATCGGCGAT CGTCGCCCCG
CTAGGCCTCG ACCAGATCAC CAGGATCATC GGCATTGCAC GCGATCACCG GATCCCGGTC
TGGGCGATTT CGACCGGTCG CAACCTTGCC TACGGCGGAT CCGCGCCGCG TAAGCACGGC
ACGCTGACGC TGGATCTCAA GCGCAACAAC CGCATCCTCG AGGTTAACGA AGAGTTGGCC
TACGCGGTGG TCGAGCCGGG GGTATCGTTC TTCCAACTGT ATCGCCACTT GCGCGAAACT
GGCTCCAAGC TCTGGATCGA CACGCCATCG CCAGGCTGGG GCGGTATTAT GGGCAATATG
CTCGAACGCG GGGTCGGCTA CACGCCCTAC GGCGACCGCT TCATGTGGCA ATGCGGGATG
CAGGTGGTGC TGGCCGACGG TACGGTTGTG GACACCGGCA TGGCCGCGCA GGAAGGTGCG
CCCGGCAATC ACACGTATCG TTATGGCGGC GGCCCGTGGA TCGACGGAAT TTTCACGCAG
TCCAATTTCG GCATCGTCAC CAAGGTCGGC ATCCAGTTGA TGCCTGAACC ACCGGGATAT
CGCCCGTTCC TTGTCACCTT TGCTGAAGAT GATGACATCG AGCCGGTCTC AGACCTGATC
CGCCCGCTCA AGATGACGCA CATCATCCCC AACGCGGCCG TCACCTGCAG CCTCAACCTG
GAAGCAGCCA CATCGCTGGA CCGGACGAAG TACCATTCAG ATTCTGGCCC CGTGCCAGAA
GCGGGCCGCC GCCGGATGAT GGAGGATCTG GGCGTCGGCA AATGGAACTT TTACGCCGCC
CTCTATGGCC CTGAACCGGT CATGGATGCC CATTGGGAAG TAATCCGCGA CAGCTTCTCT
TCGGTGAAGG GCGCGCGCTT CTTCACCGAA GAAGACCGCA AAAACGATGT CGTGTTCGGA
TATCGCACCC AGTTGATGCG CGGAGAACCG AACATGACCG AGTTCGGTAT CCTCAACTGG
ATGCCGAATG GTGCCCACCT CGGTTTTTCG CCTGTGGCCC CGGTCGACGG CAAAACCGCG
CTCGACCAGT ACCGCCTAGC CGAAGCAATC TGTAACCGGC ATGGCTTCGA CTATACCGGC
ATGTTCATCG TCGGCTTCCG CGCGATGCAC CACATCGTCG AACCGATCTT CTCGCGTAGC
GATGAGGATC AGCGCGGTCG AGTGGTCAGC ATGGTCACCG AACTGATCGA TGAGGCCGCC
AAGCGTGGCT ACGGCGAATA TCGCGGTCAC CTCAGCTTCA TGGATCAGAT CGCCGGTACT
TATGGCTGGG GCGACGACGC GCTCATGAAG CTTAGCCAAC GCATCAAGCG CGCACTGGAC
CCTTCGGGCA TCATGGCCCC CGGGAAGAGT GGCATCTGGT CGGATGGAGC GTCCTCATGA
 
Protein sequence
MSALLAQGVS PAQFAQALDA MRAVVGPQWL FASEEDIAAY SDHFAFEDVT ANMASAIVAP 
LGLDQITRII GIARDHRIPV WAISTGRNLA YGGSAPRKHG TLTLDLKRNN RILEVNEELA
YAVVEPGVSF FQLYRHLRET GSKLWIDTPS PGWGGIMGNM LERGVGYTPY GDRFMWQCGM
QVVLADGTVV DTGMAAQEGA PGNHTYRYGG GPWIDGIFTQ SNFGIVTKVG IQLMPEPPGY
RPFLVTFAED DDIEPVSDLI RPLKMTHIIP NAAVTCSLNL EAATSLDRTK YHSDSGPVPE
AGRRRMMEDL GVGKWNFYAA LYGPEPVMDA HWEVIRDSFS SVKGARFFTE EDRKNDVVFG
YRTQLMRGEP NMTEFGILNW MPNGAHLGFS PVAPVDGKTA LDQYRLAEAI CNRHGFDYTG
MFIVGFRAMH HIVEPIFSRS DEDQRGRVVS MVTELIDEAA KRGYGEYRGH LSFMDQIAGT
YGWGDDALMK LSQRIKRALD PSGIMAPGKS GIWSDGASS