Gene Saro_2869 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_2869 
Symbol 
ID3915508 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp3090707 
End bp3092221 
Gene Length1515 bp 
Protein Length504 aa 
Translation table11 
GC content66% 
IMG OID640445648 
Productaldehyde dehydrogenase (acceptor) 
Protein accessionYP_498139 
Protein GI87200882 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.179484 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACGACA TGACCACCAT CTCGCGCACG CAGCGCGAAT ACTCGGAGGC CGCCAAGGCC 
TTCCTCGCGC GCAAGCCGCA GTTGTTCATC AACAACGAGT GGGTCGACAG CAGCCACGAC
GCCGTGATCG AGGTGGAAGA CCCCTCGAAC GGCAGGATCG TCGGTCATGT CGTCGATGCC
TCGGACAAGG ACGTCGACCG GGCGGTTGCC GCGGCGCGCG CCGCGTTCGA CGATGGCCGC
TGGTCCAACC TGCCGCCAAT GGTCCGCGAT CGCACCATGA ATCGCCTGGC CGACCTGCTT
GAAGCCAACG CCGATCTCTT TGCCGAGCTC GAAGCGATCG ACAACGGCAA GCCCAAGGGC
ATGGCCGGCG CCGTCGACAT CCCCGGCGCG ATCAGCCAGC TCCGCTTCAT GGCCGGCTGG
GCCAGCAAGG TCGCGGGCGA GACGACGCAG CCCTACACGA TGCCCAACGG CACCGTGTTC
AGCTACACCG TCAAGGAACC CGTCGGCGTC TGCGCGCAGA TCGTGCCGTG GAACTTCCCG
CTGCTGATGG CCTCGCTCAA GATCGCCCCG GCGCTGGCGG CTGGCTGCAC CCTGGTGCTG
AAGCCCGCCG AACAGACCTC GCTTACCGCG CTCAAGCTTG CCGATCTCGT GGTCGAGGCC
GGCTTCCCTG CGGGCGTGAT CAACATCATC ACCGGCAACG GCCACACCGC CGGTGACCGC
ATGGTCAAGC ATCCCGACGT CGACAAGGTC GCCTTCACCG GCTCGACCGA GATCGGCAAG
CTGATCAATC GCAACGCCAC CACCACGCTC AAGCGGGTCA CGCTCGAACT GGGCGGGAAG
AGCCCCGTCG TGGTCATGCC CGACGTCGAC GTGGCGCAGA CCGCGCCTGG CGTTGCCGGC
GCGATCTTCT TCAACGCGGG CCAGGTCTGC GTTGCCGGTT CGCGTCTCTA TGCGCACCGT
TCGGTGTTCG ATTCCGTGCT CGAAGGCATG ACCCAGACCG CGCCGTTCTG GGCGCCGCGC
CCCTCGCTGG ATCCCGAAGC CCACATGGGC CCGTTGGTCA GCAAGGAGCA GCACGACCGC
GTGATGGGCT ACATCGAGGC GGGCAAGCGC GATGGCGCCA GCGTCGTCAT GGGCGGCGAT
TGCCCCAGCG CCGATGGCGG GTACTACGTC AACCCGACGA TCCTTGCAGA CGTGAACCCG
CAGATGTCGG TCGTGCGCGA GGAAATCTTC GGCCCCGTCG TCGTCGCCCA GCGCTTCGAC
GATCTCGATG AAGTGGCGAA GATGGCCAAC GACACCTGCT TCGGCCTCGG CGCGGGCGTG
TGGACGCGCG ATGTCGCGGT GATGCACAAG CTTGCCTCGA AGATCAAATC GGGCACCGTG
TGGGGCAACT GCCACGCCCT GATCGATACC GCGCTGCCCT TTGGCGGCTA CAAGGAATCG
GGCCTGGGCC GCGAACAGGG GCGCGCCGGC ATCGACGCCT ACCTCGAGAC CAAGACCGTC
ATCATCCAGA TGTAA
 
Protein sequence
MNDMTTISRT QREYSEAAKA FLARKPQLFI NNEWVDSSHD AVIEVEDPSN GRIVGHVVDA 
SDKDVDRAVA AARAAFDDGR WSNLPPMVRD RTMNRLADLL EANADLFAEL EAIDNGKPKG
MAGAVDIPGA ISQLRFMAGW ASKVAGETTQ PYTMPNGTVF SYTVKEPVGV CAQIVPWNFP
LLMASLKIAP ALAAGCTLVL KPAEQTSLTA LKLADLVVEA GFPAGVINII TGNGHTAGDR
MVKHPDVDKV AFTGSTEIGK LINRNATTTL KRVTLELGGK SPVVVMPDVD VAQTAPGVAG
AIFFNAGQVC VAGSRLYAHR SVFDSVLEGM TQTAPFWAPR PSLDPEAHMG PLVSKEQHDR
VMGYIEAGKR DGASVVMGGD CPSADGGYYV NPTILADVNP QMSVVREEIF GPVVVAQRFD
DLDEVAKMAN DTCFGLGAGV WTRDVAVMHK LASKIKSGTV WGNCHALIDT ALPFGGYKES
GLGREQGRAG IDAYLETKTV IIQM