Gene Saro_3855 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_3855 
Symbol 
ID5077466 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_009426 
Strand
Start bp23208 
End bp24692 
Gene Length1485 bp 
Protein Length494 aa 
Translation table11 
GC content64% 
IMG OID640480964 
Productaldehyde dehydrogenase 
Protein accessionYP_001165626 
Protein GI146275465 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID[TIGR03216] 2-hydroxymuconic semialdehyde dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.146546 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCTCAA CTTCCTCCTC TCCGGTCACC GAAACTATCC TGAACTTCAT TGACGGTTCC 
TATCGCAAAG GTAGCGAGGG TAAGTCGTTT CCCAACGTCA ATCCGGCCAC CGGCACCGCG
ATCGGGGTTG TGCACGAAGC AAGCCAGGCC GACGTCGCAG ACGCCGTGGC TGCGGCCAAG
GCAGCGCTTA CCGGGCCATG GGGCAAGATG ACCACGGCCG AGCGGGTCAA GCTGATCGCC
GCCGTGGCGA CCGAGATCGA ACGCCGAGCG GATGATTTCC TGGCTGCCGA AGTGGCCGAC
ACCGGCAAGC CGCGCCATGT CGCGTCCCAT ATCGACATTC CGCGCGGGGC CGCCAACTTC
CGCATGTTCG CGGATGTCGT CTCGACAATG CCGGGCGAAA GCTTCAACAC ATCGACCCCC
GATGGCGGCC AGGCGCTCAA CTATACCGTG CGCAAGCCCA AGGGTGTGGT CGCGGTGGTC
TGCCCATGGA ACTTCCCGCT GCTGCTGATG ACCTGGAAGG TTGGCCCGGC GCTGGCCTGC
GGCAATACCG TGGTGGTCAA GCCGTCCGAG GAAACGCCTC GGACTGCTGC CCTGCTGGGT
GAAGTAATGA ACGCGGTGGG CATGCCCAAG GGTGTCTACA ACGTCGTCCA CGGATTCGGT
CCGGGTTCGG CCGGCGAATT CCTCACGTCC AACCCCGATG TCGATGCCAT CACCTTCACC
GGCGAGACCG GCACCGGACA GGCGATCATG CAGAAGGCCG CGATCGGCGT TCGCGACATT
TCGTTCGAAC TCGGTGGCAA GAACCCGGCG ATTGTGTTCG CCGATGCCGA CCTCGACAAG
GCGGTCGAGG GTCTGTCGCG CTCGGTCTTC CTGAACACCG GGCAGGTCTG CCTCGGAACC
GAGCGGGTCT ATGTCGAACG ACCGATCTTC GACGCCTTCG TGGCGCGGAT GGCGGCGGCG
GCGCAGGGCT TCAAGCCGGG CGTGACCGGT GATCGCGCCT ATCTCGGCCC GCTGATCAGC
GCCGAGCACC GCGAGAAAGT GCTGGGTTAC TATCGCCGTG CGGTCGAGGA CGGGGCCACC
GTGGTCACCG GCGGCGGCGT TCCCGAAATC TCGGGTGCGG AAGCCGGTGG CTTCTTCGTG
GAACCGACGC TGTGGATCGA CGTCGCCCAC GGCGATACCG TGATGCGCGA GGAAATCTTC
GGACCGTGCT GCGGTATCGT GCCGTTCGAC AGCGAGGACG AGGTGATCGC ACTTGCAAAC
GATACGGTTT ACGGCCTGTG CGCCTCGATC TGGACCGAAA ACATGTCCCG CGGACACCGC
GTGGCGGCGG CGATGGACGT GGGGGTGTGC TGGGTCAATT CCTGGTTCCT GCGCGATCTG
CGCACGGCTT TCGGCGGGTC CGGCCATTCC GGCATCGGCC GGGAAGGCGG GGTGCACAGC
CTCGAATTCT ACACCGAGAT CACCAACATT TGCGTGAAGC TGTAA
 
Protein sequence
MTSTSSSPVT ETILNFIDGS YRKGSEGKSF PNVNPATGTA IGVVHEASQA DVADAVAAAK 
AALTGPWGKM TTAERVKLIA AVATEIERRA DDFLAAEVAD TGKPRHVASH IDIPRGAANF
RMFADVVSTM PGESFNTSTP DGGQALNYTV RKPKGVVAVV CPWNFPLLLM TWKVGPALAC
GNTVVVKPSE ETPRTAALLG EVMNAVGMPK GVYNVVHGFG PGSAGEFLTS NPDVDAITFT
GETGTGQAIM QKAAIGVRDI SFELGGKNPA IVFADADLDK AVEGLSRSVF LNTGQVCLGT
ERVYVERPIF DAFVARMAAA AQGFKPGVTG DRAYLGPLIS AEHREKVLGY YRRAVEDGAT
VVTGGGVPEI SGAEAGGFFV EPTLWIDVAH GDTVMREEIF GPCCGIVPFD SEDEVIALAN
DTVYGLCASI WTENMSRGHR VAAAMDVGVC WVNSWFLRDL RTAFGGSGHS GIGREGGVHS
LEFYTEITNI CVKL