Gene Saro_3811 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_3811 
Symbol 
ID5077959 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_009427 
Strand
Start bp465785 
End bp467293 
Gene Length1509 bp 
Protein Length502 aa 
Translation table11 
GC content67% 
IMG OID640481534 
Productaldehyde dehydrogenase (acceptor) 
Protein accessionYP_001166196 
Protein GI146276036 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.505255 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAAGTT CCGAAGGTCA GTTGTCGAAG CTGGGTGAAG CGGCCCGAAC ATTTCTTGCC 
ACGCGGCCGG GCCTGCTGAT CGCCGGACGT AACACGCCCG CCGCCAGCGG CGCGCGGCGC
GACGTGCTCG ATCCATCGAC CGGGCTTGTG GTCACCGACG TGGCCGAAGG CGGAGCGGCG
GACGTCGACG CGGCGGTGGC AGCTGCCCGC GCGACCTTCC AGAGCGCCGA GTGGCGCGAC
ATGCGTCCCC TGGCGCGCGA ACGCCTGCTT CACCGCCTGG CCGACCTGAT CGAACGCGAC
GCCGATATCC TTGCGGAACT CGAAGTGATC GACACCGGCA AGATCCTGCC GATGGCGCGT
CATGGCGATC TCGTCATCGC TCTGGATTCC CTGCGCTACA TGGCAGGCTG GACCACCAAG
ATCGAAGGGA CGACGATCGA TCCCTCGTTC TCCTACATTC CCCCGATCCG CTTTTCTGCC
CGCACCGTGC GCCAGCCGGT CGGCGTTGTC GGGCAGATCA TCCCGTGGAA CTTCCCGCTG
GTCATGGCCA TCTGGAAAAT CGCGCCGGCG CTTGCCGCAG GCTGCACCGT CGTGCTGAAG
CCCGCCGAGG ACACCCCGCT CACCGCGCTC TACCTTGGCC GCCTGATCGC GGAAGCCGGG
TTCCCGGCGG GAGCCGTCAA CATCGTGACC GGCGGGCGCG AGGTCGGCAT GGCGATCGTC
GAGCATCCCG GCATCGACAA GATTGCCTTT ACCGGATCGA CCGCCGCCGG CCAGGACATC
CAGCGTAGGG CCGCCGCCAC GATGAAGCGC CTCAGCCTGG AACTCGGTGG CAAGAGCCCG
GTCGTTATCC TCGAGGATTG CCCGGTGCCG ATGGCGGTGG AAGGCGCGGC GGGCGCGATC
TTCTTCAACC ACGGCCAGGT CTGCACCGCC GGTTCACGCC TGCTGGTCCA CCGCAGCATC
TACGAGGACG TCGTGCAGGG GCTAGCCCAT GCGGCGAATG GCATGGTGCT GGGCGAAGGG
ATGGACCCGG CCGGCCAGAT GGGGCCCTTG ATCTCCGCCC GCCAGCGGGA CCGTGTTGCC
GGTTACGTGC AGGGTGCGCT CGATCAGGGC GCGCGGCTGC TGGCGGGCGG TGAAGCGCCG
GACCGCGACG GGTTCTTCTA CCGGCCGACA GTGCTTGCCG ATGGCACGCC TTCCATGACC
ATCTTCCAGG AGGAAGTGTT CGGTCCGGTG GTCATTGCCA TGCCCTTCGA TACGGAAGAG
GAAGCGCTCG CGCTTGCCAA CGATTCCTGC TTCGCGCTGG GCGCCAGCGT CTGGACGCAG
AACCTTGCGG CGGCCAACCG CTTCGCCGGC GCGCTGCGCT CGGGCAACGT CTGGATCAAC
GCGCACAACA TCCTCGACCC GGCGGTGCCG TTCGGTGGGT GGAAGATGTC GGGATATGGC
CGCGAACTGG GACACAGCGC GGTCGAACTC TATACCGAGG CCAAGTCCAT CACGATGCCG
CTCCTCTGA
 
Protein sequence
MASSEGQLSK LGEAARTFLA TRPGLLIAGR NTPAASGARR DVLDPSTGLV VTDVAEGGAA 
DVDAAVAAAR ATFQSAEWRD MRPLARERLL HRLADLIERD ADILAELEVI DTGKILPMAR
HGDLVIALDS LRYMAGWTTK IEGTTIDPSF SYIPPIRFSA RTVRQPVGVV GQIIPWNFPL
VMAIWKIAPA LAAGCTVVLK PAEDTPLTAL YLGRLIAEAG FPAGAVNIVT GGREVGMAIV
EHPGIDKIAF TGSTAAGQDI QRRAAATMKR LSLELGGKSP VVILEDCPVP MAVEGAAGAI
FFNHGQVCTA GSRLLVHRSI YEDVVQGLAH AANGMVLGEG MDPAGQMGPL ISARQRDRVA
GYVQGALDQG ARLLAGGEAP DRDGFFYRPT VLADGTPSMT IFQEEVFGPV VIAMPFDTEE
EALALANDSC FALGASVWTQ NLAAANRFAG ALRSGNVWIN AHNILDPAVP FGGWKMSGYG
RELGHSAVEL YTEAKSITMP LL