Gene Saro_1668 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_1668 
Symbol 
ID3918777 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp1748389 
End bp1749789 
Gene Length1401 bp 
Protein Length466 aa 
Translation table11 
GC content67% 
IMG OID640444409 
Productaldehyde dehydrogenase 
Protein accessionYP_496942 
Protein GI87199685 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAGTTTG AACGCATCAA TCCGATGACC GGCGCCGTCG CCTCGCAGGC AGAGGCCATG 
AAAGCGTCGG ACATTCCTTC CATTGCTGCC CGCGCAGGAC AGGCCTTTCC GGCGTGGGCC
GCGATGGGCC CCAACGCACG TCGCGGCGTA CTGATGAAGG CGGCTGCGGC GCTCGAAGCG
CGGGCCGACG CTTTCGTCGA GGCCATGATG GGCGAGATCG GCGCGACCAG GGGCTGGGCG
CTGTTCAACC TCGGCCTTGC CGCCAGCATG GTGCGCGAAG CCGCCGCGCT GACCACGCAG
ATCTCGGGAG AGGTCATTCC CTCGGACAAG CCGGGCTGCA TTTCGATGGC TCTGCGCGAA
CCGGTTGGCG TGATTCTGGG CATCGCGCCG TGGAATGCGC CGATCATCCT TGGCGTGCGC
GCCATTGCCG TGCCGCTCGC CTGCGGCAAC GCGGTGATAC TCAAGGCCAG CGAAACCTGT
CCGCGCACCC ACGCGCTCAT CATCGAGGCC TTCGCCGAAG CCGGCTTCCC CGAAGGCGTG
GTCAATGTCG TGACGAACGC GCCTGCCGAC GCAGCGGAAG TGGTCGGCGC GCTGATCGAT
GCGCCGGAAG TGCGCCGCAT CAACTTCACC GGCTCGACCA ATGTCGGCAG GATCATCGCA
AAGCGCGCGG CCGAGCATCT CAAGCCCTGC CTGCTCGAAC TGGGCGGCAA GGCACCGCTG
ATCGTCCTGG ACGATGCGGA CCTCGACGAA GCGGTCAAGG CCGCGGCTTT CGGCGCCTTC
ATGAACCAGG GCCAGATCTG CATGTCGACG GAGCGGATCA TCGTGGTCGA TGCCGTTGCC
GATGCCTTCG CCGATAAGTT CAAGGCCAAG GTCGCCTCGA TGGCTGTCGG CGACCCGCGC
GAGGGCACGA CCCCGCTCGG CGCCGTCGTC GACGCGAAGA CCGTCGCTCA CTGCCGCAGC
CTGATCGACG ATGCCCTGGC CAAGGGCGCC CGTCTGCTGA CCGGCGGCGA AACCACGCAC
AACGTGCTCA TGCCCGCCCA TGTCGTCGAT GGCGTGACGC AGGACATGAA GCTGTTCCGC
GACGAGAGCT TCGGCCCCGT GGTCGGCGTG ATCCGCGCGC GCGACGAAGC CCATGCCATC
GAACTGGCGA ACGACAGTGA ATACGGCCTG TCGGCGGCCG TTTTCACCCG CGACACCGCG
CGCGGCCTGC GCGTCGCCCG CCAGATCCGT TCGGGCATCT GCCATGTCAA CGGCCCGACC
GTCCACGACG AGGCGCAGAT GCCTTTCGGT GGAGTGGGCG CGTCCGGCTA CGGCCGCTTT
GGCGGCAAGG CCGGCATCGA CAGTTTCACC GAGCTGCGCT GGATCACGAT GGAAACCCAG
CCCGGCCACT ATCCCATTTG A
 
Protein sequence
MQFERINPMT GAVASQAEAM KASDIPSIAA RAGQAFPAWA AMGPNARRGV LMKAAAALEA 
RADAFVEAMM GEIGATRGWA LFNLGLAASM VREAAALTTQ ISGEVIPSDK PGCISMALRE
PVGVILGIAP WNAPIILGVR AIAVPLACGN AVILKASETC PRTHALIIEA FAEAGFPEGV
VNVVTNAPAD AAEVVGALID APEVRRINFT GSTNVGRIIA KRAAEHLKPC LLELGGKAPL
IVLDDADLDE AVKAAAFGAF MNQGQICMST ERIIVVDAVA DAFADKFKAK VASMAVGDPR
EGTTPLGAVV DAKTVAHCRS LIDDALAKGA RLLTGGETTH NVLMPAHVVD GVTQDMKLFR
DESFGPVVGV IRARDEAHAI ELANDSEYGL SAAVFTRDTA RGLRVARQIR SGICHVNGPT
VHDEAQMPFG GVGASGYGRF GGKAGIDSFT ELRWITMETQ PGHYPI